用Python从字符串中提取标签的一种优雅方法？

3条回答

网友

1楼 · 编辑于 2024-05-14 07:59:42

regular expression objects的findall方法可以一次获得它们：

>>> import re
>>> s = "this #is a #string with several #hashtags"
>>> pat = re.compile(r"#(\w+)")
>>> pat.findall(s)
['is', 'string', 'hashtags']
>>>

网友

2楼 · 编辑于 2024-05-14 07:59:42

使用@inspectorG4dget's answer时，如果不需要重复项，可以使用集合理解而不是列表理解。

>>> tags="Hey guys! #stackoverflow really #rocks #rocks #announcement"
>>> {tag.strip("#") for tag in tags.split() if tag.startswith("#")}
set(['announcement', 'rocks', 'stackoverflow'])

请注意，集合理解的{ }语法仅从Python 2.7开始工作。
如果使用的是旧版本，则将列表理解（[ ]）输出作为set函数。

网友

3楼 · 编辑于 2024-05-14 07:59:42

[i[1:] for i in line.split() if i.startswith("#")]

这个版本将去掉任何空字符串（正如我在注释中读到的那样）和只有"#"的字符串。另外，在Bertrand Marron的代码中，最好将其转换为以下集合（以避免重复和O（1）查找时间）：

set([i[1:] for i in line.split() if i.startswith("#")])

相关问题更多 >

编程相关推荐

热门问题

热门文章

用Python从字符串中提取标签的一种优雅方法？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >