如何使用Python从网站上抓取图像？-Python教程-PHP中文网

如何使用Python从网站上抓取图像？

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

发布： 2024-08-25 06:01:02

原创

1098 人浏览过

How to scrape images from a website using Python?

要使用Python从网站上抓取图像，您通常会使用几个流行的库，例如用于发出网络请求的requests、用于解析HTML的BeautifulSoup和Pillow（Pillow的更新版本） PIL）用于处理图像。

Python从网站上抓取图片的步骤

以下是一个简单的分步指南，展示如何从网站上抓取图像：

1.安装必要的库

如果你还没有安装这些库，可以通过pip安装：
pip install 请求 beautifulsoup4 枕头

2.发送请求并获取网页内容

使用requests库发送HTTP请求并获取网页的HTML内容。

3. 解析HTML并找到图片链接

使用BeautifulSoup解析网页内容，找到图片的URL。

4.下载镜像

再次使用requests库根据图片的URL下载图片内容，并使用Pillow库将图片保存到本地。
这是一个简单的示例代码：

import requests
from bs4 import BeautifulSoup
from PIL import Image
from io import BytesIO

# URL of the target page
url = 'https://example.com'

# Send a request and get the web page content
response = requests.get(url)
html = response.text

# Parsing HTML
soup = BeautifulSoup(html, 'html.parser')

# Find all image tags
images = soup.find_all('img')

# Traverse the image tags and download the images
for img in images:
    src = img['src']  # Get the URL of the image
    response = requests.get(src)
    img_data = response.content

    # Using PIL to process image data
    image = Image.open(BytesIO(img_data))

    # Save the image locally
    image.save(f'downloaded_{img["src"].split("/")[-1]}')

print('Image download complete!')

登录后复制

请注意，此示例代码可能需要根据您正在抓取的网站的具体情况进行调整。例如，某些网站可能通过 JavaScript 动态加载图像，在这种情况下，您可能需要使用 Selenium 等工具来模拟浏览器行为。 ‌