python：使用beauthoulsoup解析表

from bs4 import BeautifulSoup import urllib2 soup = BeautifulSoup(urllib2.urlopen('https://personal.vanguard.com/us/FundsAllHoldings?FundId=0970&FundIntExt=INT&tableName=Equity&tableIndex=0').read()) print(soup.prettify()) print soup('tbody') table = soup.find("tbody", { "class" : "Holding" }) print table for row in table.findAll("tr"): cells = row.findAll("td")

2条回答

网友

1楼 · 编辑于 2024-04-16 17:18:18

可以使用以下表达式选择所有行：

soup.select('tbody tr')

然后，对于每一行，可以提取所有列：

^{pr2}$

您只需要过滤所需的列。在

网友

2楼 · 编辑于 2024-04-16 17:18:18

from bs4 import BeautifulSoup
import urllib2
url = 'https://personal.vanguard.com/us/FundsAllHoldings?FundId=0970&FundIntExt=INT&tableName=Equity&tableIndex=0'
soup = BeautifulSoup(urllib2.urlopen(url))
table = soup.find("tbody", { "class" : "right" })
for row in table.findAll("tr"):
    cells = row.findAll("td")
    if len(cells) > 0: # skip first row
        holding = cells[0]
        mv = cells[2]
        print holding, mv

相关问题更多 >

编程相关推荐

热门问题

热门文章

python：使用beauthoulsoup解析表

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >