php - snoopy crawler reports error 405 Not Allowed

Question

Code $httpClass = new Snoopy();$httpClass-&gt;fetch('https://v.qq.com/');$url = $httpClass-&gt;results;print_r($url);die(); When crawling https://www.baidu.com/, it keeps reporting a 405 error. Crawling https://v.qq.com/ is a normal operation...

淡淡烟草味 · Answer

Baidu probably has a crawler-like setting. You need to disguise it and define a UA or something like that
Reference: http://www.4wei.cn/archives/396

天蓬老师 · Answer

This is not a Snoopy-type problem. It's because you don't know much about crawlers. Since there are crawlers, of course there will be anti-crawler technology. The simplest is based on the browser identifier or the referer in the request header, etc. Big websites like Baidu and Tencent will not want us to crawl data, so they must have taken many preventive measures. Therefore, it is recommended to understand the knowledge of crawlers before crawling data.