为什么这个美丽组代码输出“无”？

import urllib2 from BeautifulSoup import BeautifulSoup contenturl = "http://espnfc.com/tables/_/league/esp.1/spanish-la-liga?cc=5901" soup = BeautifulSoup(urllib2.urlopen(contenturl).read()) table = soup.find('div id', attrs={'class': 'content'}) rows = soup.findAll('tr') for tr in rows: cols = tr.findAll('td') for td in cols: text = td.find(text=True) print text, print

2条回答

网友

1楼 · 编辑于 2024-05-17 19:51:50

如果你在网站上注意到，一些信息之间有空格，这些信息包含在每个td中。在

您可能会注意到所有的空间都有一个宽度。所以，你可以这样做：

cols = tr.findAll('td', width=None)

如果您决定在任何阶段交换到BeautifulGroup 4，请使用：

^{pr2}$

网友

2楼 · 编辑于 2024-05-17 19:51:50

当一个元素有多个子元素时，如The Docs中所示，则发生None

去除None的最简单方法如下：

for tr in rows:
    cols = tr.findAll('td')
    for td in cols:
        text = td.find(text=True)
        if text is not None:
            print text,  
    print

它将检查text = None如果是，则不会打印它

相关问题更多 >

编程相关推荐

热门问题

热门文章