How much effort does it take to create some UGC? How can you allow you to grab it casually? You can use high-density proxy IP to capture. We have captured Taobao's data before, but we banned it after capturing it a few times. Later, we used a proxy to solve the problem perfectly, and the other party couldn't ban it at all.
Douban limits the crawling frequency through cookies. After analyzing the cookies, it can be forged. Last month, 1.07 million pages were crawled in two hours.
It’s good to learn to crawl Douban. If you want to apply it, please use their open platform http://developers.douban.com/
I caught him too hard and was blocked by Douban~
How much effort does it take to create some UGC? How can you allow you to grab it casually?
You can use high-density proxy IP to capture. We have captured Taobao's data before, but we banned it after capturing it a few times. Later, we used a proxy to solve the problem perfectly, and the other party couldn't ban it at all.
Douban limits the crawling frequency through cookies. After analyzing the cookies, it can be forged. Last month, 1.07 million pages were crawled in two hours.