如何使用 Python 網路抓取從網站中提取日出和日落時間？-Python教學-PHP中文網

如何使用 Python 網路抓取從網站中提取日出和日落時間？

Linda Hamilton

發布： 2024-10-26 09:01:29

原創

735 人瀏覽過

How can I extract Sunrise and Sunset times from a website using Python web scraping?

使用Python 進行網頁內容抓取

網頁抓取是從線上來源擷取資料的過程，是一種無需手動即可收集資訊的寶貴技術干涉。在這個問題中，我們將探索如何使用 Python 抓取網頁內容。

用於網頁抓取的 Python 模組

Python 提供了多個模組來促進網頁抓取。兩個比較突出的是：

urllib2: 用於請求和處理網頁。
BeautifulSoup: 一個 HTML 解析庫，可以簡化從複雜的網路結構中擷取資料。

Web 內容抓取教學

為了說明使用Python 進行網頁抓取，請考慮從以下位置提取日出/日落時間的範例網站：

<code class="python">import urllib2
from BeautifulSoup import BeautifulSoup

# Open the web page containing the sunrise/sunset times
web_page = urllib2.urlopen('http://example.com')

# Parse the page using BeautifulSoup
soup = BeautifulSoup(web_page.read())

# Find the table containing the times
table = soup.find('table', {'class': 'spad'})

# Loop through the table rows
for row in table.find('tbody').find_all('tr'):
    # Extract the date and times
    tds = row.find_all('td')
    date = tds[0].string
    sunrise = tds[1].string

    # Print the results
    print(date, sunrise)</code>

登入後複製

此腳本示範如何解析包含日出/日落時間的表，使用Python 模組和適當的HTML 選擇器擷取相關資料。

以上是如何使用 Python 網路抓取從網站中提取日出和日落時間？的詳細內容。更多資訊請關注PHP中文網其他相關文章！