Python-Beautifulsoup解析htm

<tr class="odd" > <td><a href="show_result.php?id=7084083" title="Show the User ID DB records for the id '7084083'" tabindex="5" >7084083</A></td> <td><a href="show_result.php?name=bernd" title="Show the User ID DB records the name 'bernd'" >bernd</A></td> <td><a href="show_result.php?range=DDF+User" title="range_link" >DDF User</A></td> <td>mandatory</td> <td>Solaris</td> <td>valid</td>   <td><a class="navlink" href="show_detail.php?rec_id=283330130" title="show the detail for this entry [alt-E]" accesskey="E"><img src="detail.gif" alt="show the detail for this entry [alt-E]" title="show the detail for this entry [alt-E]" border="0"> </a></td> </tr>

1条回答

网友

1楼 · 发布于 2024-06-16 08:27:12

由于您正在搜索html的特定部分，因此使用re而不是bs4可能更容易：

import re
s = """
<tr  class="odd" >
<td><a href="show_result.php?id=7084083" title="Show the User ID DB records for the id '7084083'"  tabindex="5" >7084083</A></td>
<td><a href="show_result.php?name=bernd" title="Show the User ID DB records the name 'bernd'"   >bernd</A></td>
<td><a href="show_result.php?range=DDF+User" title="range_link"   >DDF User</A></td>
<td>mandatory</td>
<td>Solaris</td>
<td>valid</td>
<!  xxxx old style   >
<!  xxxx showdetail navlink  >
<td><a class="navlink" href="show_detail.php?rec_id=283330130"  title="show the detail for this entry [alt-E]" accesskey="E"><img src="detail.gif" alt="show the detail for this entry [alt-E]" title="show the detail for this entry [alt-E]" border="0">&nbsp;</a></td>
</tr>
"""
final_id = re.findall('(?<=id\=)\d+', s)[0]

输出：

'7084083'

相关问题更多 >

编程相关推荐

热门问题

热门文章