html - 为什么BeautifulSoup find_all 返回的list都不是按照网页显示顺序排序的？

Question

我想爬糗百的段子，显示作者，和对应的段子，先只爬第一面 {代码...} print 结果： {代码...} html.fromstring xpath 也这样 {代码...} print 结果： {代码...} 但网页的实际显示顺序是: {代码...} 如何让返回的l...

大家讲道理 · Answer

It should be because the sorting of the content on the page is changing. His ranking is based on the "funny" number in the comments. The funny numbers are close, the order changes normally, and sometimes new jokes are added to this page. The time when your browser retrieves the page is different from the time when the crawler crawls it. It is normal that the order of the paragraphs you see is different.