网页爬虫 - Python爬虫运行内存占用过高导致电脑停止响应

Question

各位好，我写了1个非常简单的爬虫去爬取51job里的招聘信息。从下面的链接里提取出每个招聘岗位的链接（一共50个链接）http://search.51job.com/jobse...再根据每个招聘岗位的url为每个岗位生成一个id，并且爬取每...

高洛峰 · Answer

This is an idle bug

Just save the results to a file~

阿神 · Answer

I tried running your code and found that it did not occupy too much memory. The maximum memory usage was only 30M.
I suggest you try the following

Run python xxx.py directly on the command line without using Pycharm to see if it is caused by Pycharm
Confirm the memory usage and CPU usage during runtime

As you said, this code is very simple and the workload is not large, so this kind of problem should not occur

黄舟 · Answer

Pycharm occasionally has this kind of difficulty. It is recommended to run it directly in the python environment.