If you really have no idea, here is some information for your reference, but the key point is that you still have to read it...
First of all, what is a web crawler:
You can simply take a look at wiki-web crawler
Come to this website again: The University Mathematics School has some simple teachings (and videos), which I believe are very suitable for beginners. You can start from this article: What is a web crawler
Then comes Introduction tutorial:
In fact, there is a very simple method. Just pick a crawler tool to read the document. If you have no direction, you can choose to read the Beautiful Soup Chinese document. It is a Chinese version and it is not too complicated. Take some time. You can read the whole thing.
Just now, the University Mathematics Hall has a series of introductory teaching and practical teaching. I think it should be worth referring to. Here are the first few articles of the introductory course:
Start writing a web crawler (Crawler) using Python
How to install Jupyter (Ipython Notebook)
Introduction to Jupyter operation (1)
How to use GET to crawl web content?
How to use POST to capture web content?
How to use Python package: BeautifulSoup4 to analyze web content?
How to use Python requests and BeautifulSoup4 to complete Taobao crawler?
The next step is to understand those tools and crawler framework:
This place is very complete: Python crawler tool list with Github code download link
This blog also has a lot of teachings
For discussions about crawler tools and frameworks, please refer to this article Zhihu: When writing crawlers in Python, which method and framework is better?
I think there are a lot of resources on the Internet, you can give them a try. Everything is difficult at the beginning, so come on!
If you really have no idea, here is some information for your reference, but the key point is that you still have to read it...
First of all, what is a web crawler:
You can simply take a look at wiki-web crawler
Come to this website again: The University Mathematics School has some simple teachings (and videos), which I believe are very suitable for beginners. You can start from this article: What is a web crawler
Then comes Introduction tutorial:
In fact, there is a very simple method. Just pick a crawler tool to read the document. If you have no direction, you can choose to read the Beautiful Soup Chinese document. It is a Chinese version and it is not too complicated. Take some time. You can read the whole thing.
Just now, the University Mathematics Hall has a series of introductory teaching and practical teaching. I think it should be worth referring to. Here are the first few articles of the introductory course:
Start writing a web crawler (Crawler) using Python
How to install Jupyter (Ipython Notebook)
Introduction to Jupyter operation (1)
How to use GET to crawl web content?
How to use POST to capture web content?
How to use Python package: BeautifulSoup4 to analyze web content?
How to use Python requests and BeautifulSoup4 to complete Taobao crawler?
The next step is to understand those tools and crawler framework:
This place is very complete: Python crawler tool list with Github code download link
This blog also has a lot of teachings
For discussions about crawler tools and frameworks, please refer to this article Zhihu: When writing crawlers in Python, which method and framework is better?
I think there are a lot of resources on the Internet, you can give them a try. Everything is difficult at the beginning, so come on!