python爬虫 - python访问豆瓣突然遇到403forbidden,用过edge浏览器也无法访问?
高洛峰
高洛峰 2017-04-17 16:52:07
0
4
606

如题,使用requests访问,昨天写代码的时候两个多小时一直在访问豆瓣主页,没有什么问题,今天用相同的程序就变成了403forbidden,chrome、edge访问网址也变成了403forbidden..不知是什么原因..

高洛峰
高洛峰

拥有18年软件开发和IT教学经验。曾任多家上市公司技术总监、架构师、项目经理、高级软件工程师等职务。 网络人气名人讲师,...

reply all(4)
刘奇

It’s good to learn to crawl Douban. If you want to apply it, please use their open platform http://developers.douban.com/

巴扎黑

I caught him too hard and was blocked by Douban~

Peter_Zhu

How much effort does it take to create some UGC? How can you allow you to grab it casually?
You can use high-density proxy IP to capture. We have captured Taobao's data before, but we banned it after capturing it a few times. Later, we used a proxy to solve the problem perfectly, and the other party couldn't ban it at all.

黄舟

Douban limits the crawling frequency through cookies. After analyzing the cookies, it can be forged. Last month, 1.07 million pages were crawled in two hours.

Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template
About us Disclaimer Sitemap
php.cn:Public welfare online PHP training,Help PHP learners grow quickly!