需要在网站点击保存,完成弹出的表单来定制下载:点击后,出现这两个:
然而并没有一个链接是直接打开就能下载的那种,之前只爬过静态网页,非常小白,网上介绍的案例往往是用正则匹配到文件的下载链接直接写入就可以,不清楚这种需要提交表单,并且下载的文档内容由表单决定的情况需要怎么处理。求指教,非常感谢。
学习是最好的投资!
You should first analyze the request header, submission parameter information, and returned header information of the submission form, and then use python to simulate submission and finally solve the problem. This is generally the idea
You should first analyze the request header, submission parameter information, and returned header information of the submission form, and then use python to simulate submission and finally solve the problem. This is generally the idea