更改HTML文本并保存回HTML

<html><head><meta http-equiv="Content-Type" content="text/html;charset=utf-8" /><link href="style.css" rel="stylesheet" type="text/css" /><title> <span>Name. Of the book.</span> </title></head> ... </div>

2条回答

网友

1楼 · 编辑于 2024-05-23 20:44:15

您可以使用wrap()方法（doc）将文本包装成<span>标记-它将更新整个HTML结构

例如：

data = '''<html><head><meta http-equiv="Content-Type" content="text/html;charset=utf-8" /><link href="style.css" rel="stylesheet" type="text/css" /><title> Name. Of the book. </title></head>'''

from bs4 import BeautifulSoup

soup = BeautifulSoup(data, 'html.parser')

print('Before:')
print('-' * 80)
print(soup.prettify())
print('-' * 80)

for text in soup.find_all(text=True):
    text.wrap(soup.new_tag("span"))     # use wrap() function to wrap the text into <span> tag

print('After:')
print('-' * 80)
print(soup.prettify())
print('-' * 80)

打印（注意<title>标记内的<span>）：

Before:
                                        
<html>
 <head>
  <meta content="text/html;charset=utf-8" http-equiv="Content-Type"/>
  <link href="style.css" rel="stylesheet" type="text/css"/>
  <title>
   Name. Of the book.
  </title>
 </head>
</html>
                                        
After:
                                        
<html>
 <head>
  <meta content="text/html;charset=utf-8" http-equiv="Content-Type"/>
  <link href="style.css" rel="stylesheet" type="text/css"/>
  <title>
   <span>
    Name. Of the book.
   </span>
  </title>
 </head>
</html>

网友

2楼 · 编辑于 2024-05-23 20:44:15

好吧，我有一个非常天真但非常有效的方法。您可以先获取整个html代码，然后将其存储在字符串中，然后对其使用Regular Expression来提取span标记的文本。

这是我现在唯一能想到的方法。希望这有帮助：）

相关问题更多 >

编程相关推荐

热门问题

热门文章

更改HTML文本并保存回HTML

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >