如何使用Python從網站上抓取圖片？-Python教學-PHP中文網

如何使用Python從網站上抓取圖片？

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

發布： 2024-08-25 06:01:02

原創

1065 人瀏覽過

How to scrape images from a website using Python?

要使用Python從網站上抓取圖像，您通常會使用幾個流行的庫，例如用於發出網絡請求的requests、用於解析HTML的BeautifulSoup和Pillow（ Pillow的更新版本） PIL）用於處理影像。

Python從網站上抓取圖片的步驟

以下是一個簡單的逐步指南，展示如何從網站上抓取圖像：

1.安裝必要的庫

如果你還沒安裝這些函式庫，可以透過pip安裝：
pip install 請求 beautifulsoup4 枕頭

2.發送請求並取得網頁內容

使用requests庫發送HTTP請求並取得網頁的HTML內容。

3. 解析HTML並找到圖片鏈接

使用BeautifulSoup解析網頁內容，找到圖片的URL。

4.下載鏡像

再次使用requests庫根據圖片的URL下載圖片內容，並使用Pillow庫將圖片儲存到本機。
這是一個簡單的範例程式碼：

import requests
from bs4 import BeautifulSoup
from PIL import Image
from io import BytesIO

# URL of the target page
url = 'https://example.com'

# Send a request and get the web page content
response = requests.get(url)
html = response.text

# Parsing HTML
soup = BeautifulSoup(html, 'html.parser')

# Find all image tags
images = soup.find_all('img')

# Traverse the image tags and download the images
for img in images:
    src = img['src']  # Get the URL of the image
    response = requests.get(src)
    img_data = response.content

    # Using PIL to process image data
    image = Image.open(BytesIO(img_data))

    # Save the image locally
    image.save(f'downloaded_{img["src"].split("/")[-1]}')

print('Image download complete!')

登入後複製

請注意，此範例程式碼可能需要根據您正在抓取的網站的具體情況進行調整。例如，某些網站可能會透過 JavaScript 動態載入映像，在這種情況下，您可能需要使用 Selenium 等工具來模擬瀏覽器行為。 ‌