html - xpath怎样不包括某个节点？

Question

公司有30多个网站，因为G20怕被篡改。写了个scrapy爬下30多个网站的&lt;body&gt;内容，然后保存json对比。其中两个网站有访问人数统计，所以每次访问得到的数字都不一样，所以不能判断是否被篡改。想到的方法是去...

巴扎黑 · Answer

Regular should be fine, right? Have you tried it?

ringa_lee · Answer

怪我咯 · Answer

xpath is for matching and mismatching. You pull it down completely and then match the unnecessary parts and remove them