如何按制表符拆分字符串，但每次仅限一次出现

网友

1楼 · 编辑于 2024-04-24 04:23:31

在否定look-behind断言上使用re.split应该可以做到：

import re

s = ''.join(re.split(r'(?<!\t)\t', row))
print(s)
# 'Ihavea\tstring'

断言(?<!\t)阻止在\t前面有另一个{}的拆分。在

如果实际上不需要拆分中的项目，可以使用re.sub：

^{pr2}$

网友

2楼 · 编辑于 2024-04-24 04:23:31

如果您希望避免导入re模块，那么列表理解也是一种方法：

row = "I\thave\ta\t\tstring"
text = [splits if splits else "\t"  for splits in row.split("\t")]
"".join(text)
#'Ihavea\tstring'

布尔上下文中的空字符串为false，并且将为每个连续的拆分字符生成空列表元素（在本例中为“\t”）

网友

3楼 · 编辑于 2024-04-24 04:23:31

为了简单起见，可以使用^{}

from re import split
text = "I\thave\ta\t\tstring"
split_string = split(r'\t+', text)  #Gives ['I', 'have', 'a', 'string']

正则表达式r'\t+'基本上只是将所有连续的选项卡组合在一起。在