Community

Learn

Tools Library

AI Tools

Leisure

English

Home > Backend Development > PHP Tutorial > 用bs4爬取标签内的text的问题

用bs4爬取标签内的text的问题

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Release： 2016-06-06 20:13:07

Original

1542 people have browsed it

def get_coursename(info):
  info = get_content(url)
  soup = BeautifulSoup(info)
  
  all_coursename = soup.find_all('h2', class_="color-primary-text headline-1-text flex-1")
  
  #print all_coursename
  
  f = open("course_coursename.txt","w")
  for coursename in all_coursename:
      detail = soup.h2.get_text()
      
      print detail
      f.write(detail + '\n' )
      f.close
  return all_coursename

Copy after login

Copy after login

以上是我的代码，使用soup.find_all（）函数后在coursera得到64个标签段，但是使用递归对象和写入文件后，controlb后得到了64个第一个课程的名字，如下，求大神解答

Buddhism and Modern Psychology
Buddhism and Modern Psychology
.
.
.
.

回复内容：

def get_coursename(info):
  info = get_content(url)
  soup = BeautifulSoup(info)
  
  all_coursename = soup.find_all('h2', class_="color-primary-text headline-1-text flex-1")
  
  #print all_coursename
  
  f = open("course_coursename.txt","w")
  for coursename in all_coursename:
      detail = soup.h2.get_text()
      
      print detail
      f.write(detail + '\n' )
      f.close
  return all_coursename

Copy after login

Copy after login

以上是我的代码，使用soup.find_all（）函数后在coursera得到64个标签段，但是使用递归对象和写入文件后，controlb后得到了64个第一个课程的名字，如下，求大神解答

Buddhism and Modern Psychology
Buddhism and Modern Psychology
.
.
.
.

你的for循环里应该使用循环变量coursename而不是soup

代码不全啊，我发现几个小问题。你的get_coursename()的参数 info 是不是多余了？另外你在get_coursename()里面直接调用get_content(url)这不是无中生有吗？

Related labels：

css html html5 php python

Previous article：javascript - ecshop登录失败直接在当前页面提示错误信息，不跳转页面 Next article：javascript - angular 控制台的模版请求

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Latest Articles by Author

How LLMs Work: Pre-Training to Post-Training, Neural Networks, Hallucinations, and Inference

2025-02-26 03:58:14
I Combined the Blockchain and AI to Generate Art. Here’s What Happened Next.

2025-02-26 03:38:10
Advanced Prompt Engineering: Chain of Thought (CoT)

2025-02-26 03:17:10
Retrieval Augmented Generation in SQLite

2025-02-26 02:49:09
How to Use an LLM-Powered Boilerplate for Building Your Own Node.js API

2025-02-26 01:08:13
LLMs for Coding in 2024: Price, Performance, and the Battle for the Best

2025-02-26 00:46:10
Prompting Vision Language Models

2025-02-25 23:42:08
How to Measure the Reliability of a Large Language Model's Response

2025-02-25 22:50:13
An Illusion of Life

2025-02-25 21:54:11
Scientists Go Serious About Large Language Models Mirroring Human Thinking

2025-02-25 20:45:11

Latest Issues

python - Are there any related forums or books about Python web development?

From 1970-01-01 08:00:00

0

0

0

python - Ubuntu16.04 lxml error reporting

From 1970-01-01 08:00:00

0

0

0

python scrapy crawler error

From 1970-01-01 08:00:00

0

0

0

string - Python string case-insensitive replacement

From 1970-01-01 08:00:00

0

0

0

python3.x - Java calls python, and the python code stops automatically, and the reason cannot be found

From 1970-01-01 08:00:00

0

0

0

Related Topics

More>

Popular Recommendations

Popular Tutorials

More>

Related Tutorials

Popular Recommendations

Latest courses

Latest Downloads

More>

Web Effects

Website Source Code

Website Materials

Front End Template