社群

學習

工具庫

AI工具

休閒

繁体中文

首頁 > 後端開發 > Python教學 > python使用BeautifulSoup分析网页信息的方法

python使用BeautifulSoup分析网页信息的方法

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

發布： 2016-06-10 15:16:19

原創

1347 人瀏覽過

本文实例讲述了python使用BeautifulSoup分析网页信息的方法。分享给大家供大家参考。具体如下：

这段python代码查找网页上的所有链接，分析所有的span标签，并查找class包含titletext的span的内容

复制代码代码如下:

#import the library used to query a website
import urllib2

#specify the url you want to query
url = "http://www.python.org"

#Query the website and return the html to the variable 'page'
page = urllib2.urlopen(url)

#import the Beautiful soup functions to parse the data returned from the website
from BeautifulSoup import BeautifulSoup

#Parse the html in the 'page' variable, and store it in Beautiful Soup format
soup = BeautifulSoup(page)

#to print the soup.head is the head tag and soup.head.title is the title tag
print soup.head
print soup.head.title

#to print the length of the page, use the len function
print len(page)

#create a new variable to store the data you want to find.
tags = soup.findAll('a')

#to print all the links
print tags

#to get all titles and print the contents of each title
titles = soup.findAll('span', attrs = { 'class' : 'titletext' })
for title in allTitles:
print title.contents

希望本文所述对大家的Python程序设计有所帮助。

相關標籤：

beautifulsoup python 分析

上一篇：给Python初学者的一些编程技巧下一篇：Python新手在作用域方面经常容易碰到的问题

本網站聲明

本文內容由網友自願投稿，版權歸原作者所有。本站不承擔相應的法律責任。如發現涉嫌抄襲或侵權的內容，請聯絡admin@php.cn

作者最新文章

LLM的工作方式：培訓前訓練，神經網絡，幻覺和推理

2025-02-26 03:58:14
我將區塊鍊和AI組合在一起以生成藝術。接下來發生了什麼。

2025-02-26 03:38:10
高級及時工程：思想鏈（COT）

2025-02-26 03:17:10
在Sqlite中檢索增強發電

2025-02-26 02:49:09
如何使用LLM驅動的樣板構建自己的節點。JSAPI

2025-02-26 01:08:13
2024年編碼的LLM：價格，性能和爭取最佳的戰鬥

2025-02-26 00:46:10
提示視覺語言模型

2025-02-25 23:42:08
如何衡量大語模型的響應的可靠性

2025-02-25 22:50:13
生活的幻想

2025-02-25 21:54:11
科學家對大型語言模型的認真反映了人類思維

2025-02-25 20:45:11

最新問題

python - ubuntu16.04 lxml的報錯

來自於 1970-01-01 08:00:00

0

0

0

python3.x - java呼叫python，python程式碼自動停止了，找不到原因

來自於 1970-01-01 08:00:00

0

0

0

有辦法在PHP裡寫Python嗎？

來自於 1970-01-01 08:00:00

0

0

0

python scrapy爬蟲錯誤

來自於 1970-01-01 08:00:00

0

0

0

python相關問題求解決，有償

來自於 1970-01-01 08:00:00

0

0

0

相關專題

更多>

熱門推薦

熱門教學

更多>

相關教學

熱門推薦

最新課程

最新ThinkPHP 5.1全球首發影片教學(60天成就PHP大牛線上訓練課程)

1437021
php入門教程之一週學會PHP

4296766
JAVA 初級入門影片教學

2664565
小甲魚零基礎入門學習Python影片教學

516509
PHP 零基礎入門教學

877263

最新下載

更多>

網站特效

網站源碼

網站素材

前端模板