beautifulsoup - python requests 高频率刷新时卡顿

Question

我用requests.get()读取固定网页上的信息（网页非常简单，不超过十个字符），然后用beautifulsoup解析，我设定的是1秒读取一次，但是发现运行时非常不稳定，有时会隔十几秒才能读到内容。这是网站服务器端的问题...

PHP中文网 · Answer

It’s blocked, so it should be done asynchronously.

天蓬老师 · Answer

Restrict the timeout of requests to see if it is a network problem.
There is also this lightweight extraction, which is enough for regular use.

PHPz · Answer

Is it blocked while waiting for a response?

阿神 · Answer

This is not a problem with requests, it is blocked. You can use an asynchronous method. In addition, I personally feel that bs is actually a bit slow. You can use etree in lxml to parse it directly.