Table of Contents
1. Using requests
2. Using wget
3. Downloading Redirected Files
4. Download large files in chunks
5. Download multiple files (parallel/batch download)
6. Download using the progress bar
7. Use urllib to download a web page
8. Downloading through a proxy
9. Using urllib3
10. Download files from S3 using Boto3
11. Using asyncio
Home Backend Development Python Tutorial 11 postures downloaded using Python, each more advanced than the last

11 postures downloaded using Python, each more advanced than the last

May 15, 2023 pm 04:31 PM
python code posture

使用 Python 下载的11种姿势,一种比一种高级

Below you will learn how to overcome various challenges you may encounter, such as downloading redirected files, downloading large files, completing a multi-threaded download, and other strategies.

1. Using requests

You can use the requests module to download files from a URL.

Consider the following code:

使用 Python 下载的11种姿势,一种比一种高级

You simply get the URL using the get method of the requests module and store the result into a variable called "myfile" middle. Then, write the contents of this variable to the file.

2. Using wget

使用 Python 下载的11种姿势,一种比一种高级

#You can also use Python’s wget module to download files from a URL. You can install the wget module using pip by following the command:

Consider the following code, which we will use to download the logo image for Python.

使用 Python 下载的11种姿势,一种比一种高级

In this code, the URL and path (where the image will be stored) are passed to the download method of the wget module.

3. Downloading Redirected Files

In this section, you will learn how to use requests to download a file from a URL that will be redirected to another with a .pdf The URL of the file. The URL looks like this:

使用 Python 下载的11种姿势,一种比一种高级

To download this pdf file, use the following code:

使用 Python 下载的11种姿势,一种比一种高级

In this paragraph In the code, the first step we specify is the URL. Then, we use the get method of the request module to get the URL. In the get method, we set allow_redirects to True, which will allow redirections in the URL and the redirected content will be assigned to the variable myfile. In addition, search the backend of the public account for top architects to reply "Interview Status" to get a surprise gift package.

Finally, we open a file to write the obtained content.

4. Download large files in chunks

Consider the following code:

使用 Python 下载的11种姿势,一种比一种高级

First, we use the get of the requests module as before method, but this time, we will set the stream property to True.

Next, we create a file named PythonBook.pdf in the current working directory and open it for writing.

Then, we specify the chunk size to be downloaded each time. We've set it to 1024 bytes, then iterated through each chunk and written those chunks to the file until the end of the chunk.

Isn't it beautiful? Don't worry, we will display a progress bar of the download process later.

5. Download multiple files (parallel/batch download)

To download multiple files at the same time, please import the following module:

使用 Python 下载的11种姿势,一种比一种高级

We imported the os and time modules to check how long it takes to download the file. The ThreadPool module allows you to run multiple threads or processes using a pool.

Let's create a simple function that sends the response in chunks to a file:

使用 Python 下载的11种姿势,一种比一种高级

This URL is a two-dimensional array that specifies your The path and URL of the page to download.

使用 Python 下载的11种姿势,一种比一种高级

Just like we did in the previous section, we pass this URL to requests.get. Finally, we open the file (the path specified in the URL) and write the page content.

Now, we can call this function for each URL individually, or we can call this function for all URLs at the same time. Let’s call this function for each URL individually in a for loop, paying attention to the timer:

使用 Python 下载的11种姿势,一种比一种高级

Now, replace the for loop with the following lines of code:

使用 Python 下载的11种姿势,一种比一种高级

Run the script.

6. Download using the progress bar

The progress bar is a UI component of the clint module. Enter the following command to install the clint module:

使用 Python 下载的11种姿势,一种比一种高级

Consider the following code:

使用 Python 下载的11种姿势,一种比一种高级

In this code, we first The requests module is imported, and then we import the progress component from clint.textui. The only difference is in the for loop. When writing content to a file, we use the bar method of the progress bar module.

7. Use urllib to download a web page

In this section, we will use urllib to download a web page.

The urllib library is Python’s standard library, so you don’t need to install it.

The following lines of code can easily download a web page:

使用 Python 下载的11种姿势,一种比一种高级

Here specify what you want to save the file for and the URL of where you want to store it .

使用 Python 下载的11种姿势,一种比一种高级

In this code, we use the urlretrieve method and pass the URL of the file, and the path to save the file. The file extension will be .html.

8. Downloading through a proxy

If you need to use a proxy to download your files, you can use the ProxyHandler of the urllib module. Please look at the following code:

使用 Python 下载的11种姿势,一种比一种高级

In this code, we create the proxy object and open the proxy by calling urllib's build_opener method and pass in the proxy object . Then we create a request to get the page.

In addition, you can also use the requests module as described in the official documentation:

使用 Python 下载的11种姿势,一种比一种高级

You only need to import the requests module and create your proxy object. Then, you can get the file.

9. Using urllib3

urllib3 is an improved version of the urllib module. You can download and install it using pip:

使用 Python 下载的11种姿势,一种比一种高级

We will use urllib3 to get a web page and store it in a text file.

Import the following modules:

使用 Python 下载的11种姿势,一种比一种高级

When processing files, we use the shutil module.

Now, we initialize the URL string variable like this:

使用 Python 下载的11种姿势,一种比一种高级

Then, we use urllib3’s PoolManager, which keeps track of the necessary connection pools.

使用 Python 下载的11种姿势,一种比一种高级

Create a file:

使用 Python 下载的11种姿势,一种比一种高级

Finally, we send a GET request to get the URL and open a file, Then write the response to the file:

使用 Python 下载的11种姿势,一种比一种高级

10. Download files from S3 using Boto3

To download files from Amazon S3, you can use the Python boto3 module .

Before you begin, you need to install the awscli module using pip:

使用 Python 下载的11种姿势,一种比一种高级

For AWS configuration, run the following command:

使用 Python 下载的11种姿势,一种比一种高级

Now, enter your details by pressing the following command:

使用 Python 下载的11种姿势,一种比一种高级

To download files from Amazon S3, you need to import boto3 and botocore. Boto3 is an Amazon SDK that allows Python to access Amazon web services (such as S3). Botocore provides a command line service for interacting with Amazon web services.

Botocore comes with awscli. To install boto3, run the following command:

使用 Python 下载的11种姿势,一种比一种高级

Now, import these two modules:

使用 Python 下载的11种姿势,一种比一种高级

From Amazon When downloading a file, we need three parameters:

  • Bucket name
  • The name of the file you need to download
  • The name of the file after downloading

Initialize variables:

使用 Python 下载的11种姿势,一种比一种高级

Now, we initialize a variable to use the session’s resources. To do this, we will call boto3's resource() method and pass in the service, which is s3:

使用 Python 下载的11种姿势,一种比一种高级

Finally, use the download_file method to download the file and pass in the variable:

使用 Python 下载的11种姿势,一种比一种高级

11. Using asyncio

The asyncio module is mainly used to handle system events. It works around an event loop that waits for an event to occur and then reacts to that event. The reaction can be to call another function. This process is called event processing. The asyncio module uses coroutines for event handling.

To use asyncio event handling and coroutine functionality, we will import the asyncio module:

使用 Python 下载的11种姿势,一种比一种高级

Now, define the asyncio coroutine method like this:

使用 Python 下载的11种姿势,一种比一种高级

The keyword async indicates that this is a native asyncio coroutine. Inside the coroutine, we have an await keyword, which returns a specific value. We can also use return keyword.

Now, let’s create a piece of code to download a file from a website using a coroutine:

使用 Python 下载的11种姿势,一种比一种高级

In this code, we create an asynchronous coroutine function , which will download our file and return a message.

Then, we use another asynchronous coroutine to call main_func, which will wait for the URL and form all the URLs into a queue. asyncio's wait function waits for the coroutine to complete.

Now, in order to start the coroutine, we have to put the coroutine into the event loop using asyncio's get_event_loop() method, and finally, we execute the event loop using asyncio's run_until_complete() method.

Downloading files using Python is fun. Hope this tutorial is useful to you!

The above is the detailed content of 11 postures downloaded using Python, each more advanced than the last. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months ago By 尊渡假赌尊渡假赌尊渡假赌
Two Point Museum: All Exhibits And Where To Find Them
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Is the conversion speed fast when converting XML to PDF on mobile phone? Is the conversion speed fast when converting XML to PDF on mobile phone? Apr 02, 2025 pm 10:09 PM

The speed of mobile XML to PDF depends on the following factors: the complexity of XML structure. Mobile hardware configuration conversion method (library, algorithm) code quality optimization methods (select efficient libraries, optimize algorithms, cache data, and utilize multi-threading). Overall, there is no absolute answer and it needs to be optimized according to the specific situation.

Is there any mobile app that can convert XML into PDF? Is there any mobile app that can convert XML into PDF? Apr 02, 2025 pm 08:54 PM

An application that converts XML directly to PDF cannot be found because they are two fundamentally different formats. XML is used to store data, while PDF is used to display documents. To complete the transformation, you can use programming languages ​​and libraries such as Python and ReportLab to parse XML data and generate PDF documents.

How to convert XML files to PDF on your phone? How to convert XML files to PDF on your phone? Apr 02, 2025 pm 10:12 PM

It is impossible to complete XML to PDF conversion directly on your phone with a single application. It is necessary to use cloud services, which can be achieved through two steps: 1. Convert XML to PDF in the cloud, 2. Access or download the converted PDF file on the mobile phone.

What is the function of C language sum? What is the function of C language sum? Apr 03, 2025 pm 02:21 PM

There is no built-in sum function in C language, so it needs to be written by yourself. Sum can be achieved by traversing the array and accumulating elements: Loop version: Sum is calculated using for loop and array length. Pointer version: Use pointers to point to array elements, and efficient summing is achieved through self-increment pointers. Dynamically allocate array version: Dynamically allocate arrays and manage memory yourself, ensuring that allocated memory is freed to prevent memory leaks.

How to control the size of XML converted to images? How to control the size of XML converted to images? Apr 02, 2025 pm 07:24 PM

To generate images through XML, you need to use graph libraries (such as Pillow and JFreeChart) as bridges to generate images based on metadata (size, color) in XML. The key to controlling the size of the image is to adjust the values ​​of the <width> and <height> tags in XML. However, in practical applications, the complexity of XML structure, the fineness of graph drawing, the speed of image generation and memory consumption, and the selection of image formats all have an impact on the generated image size. Therefore, it is necessary to have a deep understanding of XML structure, proficient in the graphics library, and consider factors such as optimization algorithms and image format selection.

How to convert xml into pictures How to convert xml into pictures Apr 03, 2025 am 07:39 AM

XML can be converted to images by using an XSLT converter or image library. XSLT Converter: Use an XSLT processor and stylesheet to convert XML to images. Image Library: Use libraries such as PIL or ImageMagick to create images from XML data, such as drawing shapes and text.

How to open xml format How to open xml format Apr 02, 2025 pm 09:00 PM

Use most text editors to open XML files; if you need a more intuitive tree display, you can use an XML editor, such as Oxygen XML Editor or XMLSpy; if you process XML data in a program, you need to use a programming language (such as Python) and XML libraries (such as xml.etree.ElementTree) to parse.

Recommended XML formatting tool Recommended XML formatting tool Apr 02, 2025 pm 09:03 PM

XML formatting tools can type code according to rules to improve readability and understanding. When selecting a tool, pay attention to customization capabilities, handling of special circumstances, performance and ease of use. Commonly used tool types include online tools, IDE plug-ins, and command-line tools.

See all articles