无法使用beautiful soup获取数据

import mechanize import urlparse from bs4 import BeautifulSoup op = mechanize.Browser() op.open("https://www.mentalhelp.net/symptoms/") for link in op.links(): print link.text print urlparse.urljoin(link.base_url, link.url) get = BeautifulSoup(urllib2.urlopen("https://www.mentalhelp.net/symptoms/").read()).findAll('p') print get print "\n"

1条回答

网友

1楼 · 发布于 2024-04-18 18:41:29

缩进在Python中很重要，它用于确定块，如for循环、if块、while循环或函数等

在您给出的代码中，for循环后面的语句在for循环中没有正确缩进，for循环希望其主体中至少有一条语句，我认为您希望for循环下面的行在for循环中，因此您应该在for循环中缩进它们。你知道吗

代码-

for link in op.links():
    print link.text
    print urlparse.urljoin(link.base_url, link.url)
    get = BeautifulSoup(urllib2.urlopen("https://www.mentalhelp.net/symptoms/").read()).findAll('p')
    print get
    print "\n"

虽然我不确定这是否会得到你想要的，它会修复你目前的错误。你知道吗

对于只获取classic symptoms及其描述的新需求，可以使用-

soup = BeautifulSoup(urllib2.urlopen("https://www.mentalhelp.net/symptoms/").read())
for div in soup.findAll('div',{'id':'page'}):
    for entrydiv in div.findAll('div',{'class':'h4 entry-title'}):
        print(entrydiv.get_text())
        print(entrydiv.next_sibling.get_text())

相关问题更多 >

编程相关推荐

热门问题

热门文章