如一篇文章有2-3页,然后想把这些内容页爬下来,拼接成一页,然后再放入数据库。文章url如:article_1.html,article_2.htmlitem有:item['title'],item['content']而item['content']就是拼接成一页的内容。大概怎么写呢?
item['title'],item['content']
item['content']
认证0级讲师
Find the paging interface url
Find the link to the next page and add it to the list of crawled URLs
You can write regular rules in the rules to automatically scan for matching URLs
Find the paging interface url
Find the link to the next page and add it to the list of crawled URLs
You can write regular rules in the rules to automatically scan for matching URLs