存储html标记的未知Id

2024-04-26 11:24:45 发布

男 | 程序猿一只，喜欢编程写python代码。

因此，我尝试使用BeautifulSoup获取一个html，但是在使用python3.4查找标记id时遇到了问题。我知道标签("tr")是什么，但是id一直在变化，我想在id变化时保存它。例如：

<div class = "thisclass"
  <table id = "thistable">
    <tbody>
      <tr id="what i want">
        <td class = "someinfo">
   <tbody>           
  <table>
<div>

我可以找到div标签和table，我知道tr标签在那里，但是我想提取id旁边的text，而不知道text要说什么。你知道吗

到目前为止，我有以下代码：

soup = BeautifulSoup(url.read())
divTag = soup.find_all("table",id ="thistable")
i = 0
for i in divTag:
  trtag = soup.find("tr", id) 
  print(trtag)    
  i = i+1

如果有人能帮我解决这个问题，我将不胜感激。你知道吗

Tags： text div id html table 标签 find tr

1条回答

网友

1楼 · 发布于 2024-04-26 11:24:45

您可以使用^{}：

print([element.get('id') for element in soup.select('table#thistable tr[id]'))

存储html标记的未知Id

相关问题更多 >

编程相关推荐

热门问题

热门文章

存储html标记的未知Id

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >