我收到AttributeError:“非类型”对象没有属性“文本”,但在我的“HTML代码”中找不到错误

2024-03-28 22:15:47 发布

您现在位置:Python中文网/ 问答频道 /正文

我一直收到这个错误&;已多次检查html源代码,如下所附。我很确定我找到了正确的元素,因此文本部分在技术上应该存在

背景:构建webscraper;试图提取标题。奇怪的是,用于提取摘要文本的代码&;链接两个工作-只是不是标题

from bs4 import BeautifulSoup
import requests 
url = "https://www.ribbonfarm.com/"
source = requests.get(url, headers = {"User-Agent" : "Mozilla/5.0"}).text
soup = BeautifulSoup(source, 'lxml')

article = soup.find("div")

headline = soup.find("h1", class_ = "entry-title").text
print (headline)

我收到的错误消息:回溯(上次最近的呼叫): 文件“main.py”,第9行,在 headline=soup.find(“h1”,class=“条目标题”)。文本 AttributeError:“非类型”对象没有属性“文本”

HTML Source Code from Website

谢谢大家!


Tags: textfrom文本importurl标题source错误
1条回答
网友
1楼 · 发布于 2024-03-28 22:15:47

不是100%清楚你到底想要什么。不过,这是一条路

from bs4 import BeautifulSoup
import requests
url = "https://www.ribbonfarm.com/"
source = requests.get(url, headers = {"User-Agent" : "Mozilla/5.0"})
soup = BeautifulSoup(source.content, 'html.parser')



article = soup.find("div", class_="hfeed")
title_h2 = article.find_all('div')
for title in title_h2:
    ans = title.find('a')
    print(ans.text)

输出:-

MJD 59,169
Venkatesh Rao
Captain's Log
Captain's Log
6 Comments
MJD 59,163
Venkatesh Rao
Captain's Log
Captain's Log
1 Comment
MJD 59,151
Venkatesh Rao
Captain's Log
Captain's Log
3 Comments
MJD 59,145
Venkatesh Rao
Captain's Log
Captain's Log
4 Comments
MJD 59,143
Venkatesh Rao
Captain's Log
Captain's Log
5 Comments
Clockmaking: 2
Venkatesh Rao
Clockmaking
Clockmaking
4 Comments
MJD 59,128
Venkatesh Rao
Captain's Log
Captain's Log
2 Comments
Mansionism 2: Bungalows
Venkatesh Rao
Mansionism
Mansionism
6 Comments
Notes: The Marshall Plan by Benn Steil
Venkatesh Rao
Book Notes
Book Notes

2 Comments
Epistemic Reserve Notes
Mike Elias
Wittgenstein’s Revenge,
1 Comment
1

相关问题 更多 >