靓汤：从中提取所有 - 问答

soup = BeautifulSoup(html) emphased = soup.find_all('strong') for single in emphased: children = single.children before = 0 foundText = None after = 0 for child in children: if not isinstance(child, NavigableString): if foundText: after += 1 child.unwrap() else: before += 1 # DOES NOT WORK child.unwrap() else: foundText = single.get_text().strip()

1条回答

网友

1楼 · 发布于 2024-06-10 15:28:52

根据您的示例，您可以从strong中提取所有的br标记，并将它们前置，用新的标记替换最新的标记。在

下面是一个片段：

from bs4 import BeautifulSoup

soup = BeautifulSoup("<strong>Ihre Aufgaben:<br/></strong>", "html.parser")
for strong in soup.find_all("strong"):
    [s.extract() for s in strong.find_all('br')]
    strong.string = strong.get_text(strip=True)
    strong.replaceWith(BeautifulSoup( " %s%s " % ("<br/>", strong), "html.parser"))
print soup

哪些输出：

 Ihre Aufgaben:

编程相关推荐

用于从服务器到客户端的json通信的公共java类
Java Eclipse启动问题
java如何阻止ActionListener停止所有其他代码？
用户界面点击计数器Java GUI
java如何在RequestParm中将多个值转换为enum？
使用Java ServiceLoader的类路径结果NoTouchElementException错误
带条件的Java重复字符正则表达式
java在mockmvc上执行测试时获取断言错误
java为一个实体使用两个实体管理器
java如何在类中使用2个运行程序

靓汤：从<strong>中提取所有<br/>

相关问题更多 >

编程相关推荐

热门问题

热门文章

靓汤：从<strong>中提取所有<br/>

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >