node.js - node crawler, how to use IP pool to prevent anti-crawling?

Question

The problem is this. I am a newbie who just started learning node. Of course, it is obviously for crawlers. Then I was reading a novel recently, but there were too many ads on those free novel websites, so I planned to write a crawler to crawl the entire novel. However, the number of url requests was too frequent, so that it would be reverse-crawled and blocked. ..

高洛峰 · Answer

Anti-crawling means that the control program cannot use one IP address to crawl the same website multiple times at a very fast frequency. Here comes the idea. Now that we have an IP pool, the program can use multiple IPs to initiate requests. In this case, What you do is to regularly change the IP used by the program. For example, according to your crawling frequency, half an hour, or half a day, or longer is an interval. When the time is up, replace an IP for the crawler program. Here is a link, node Agent, maybe useful/q/10...