查找并将每个引用附加到html链接Python

网友

1楼 · 编辑于 2024-05-16 22:15:13

如果这真的是你要做的，你可以用sed和它的-i选项来重写文件：

sed -e 's,href="/wiki,href="/home/fergus/wikiget/wiki,' wiki-file.html

但是，这里有一个使用可爱的lxmlAPI的Python解决方案，以防您需要执行更复杂的操作，或者可能有格式错误的HTML等：

^{pr2}$

注意，lxml对于这类任务可能比BeautifulSoup更好，因为BeautifulSoup的作者给出了reasons。在

网友

2楼 · 编辑于 2024-05-16 22:15:13

这是使用re模块的解决方案：

#!/usr/bin/env python
import re

open('output.html', 'w').write(re.sub('href="http://en.wikipedia.org', 'href="/home/fergus/wikiget/wiki/Absinthe', open('file.html').read()))

这是另一个没有使用re的方法：

^{pr2}$

网友

3楼 · 编辑于 2024-05-16 22:15:13

可以使用函数re.sub公司公司名称：

def match(m):
    return '<a href="/home/fergus/wikiget' + m.group(1) + '">'

r = re.compile(r'<a\shref="([^"]+)">')
r.sub(match, yourtext)

例如：

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章

查找并将每个引用附加到html链接Python

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >