匹配以逗号分隔的key=value列表的正则表达式，其中value可以包含逗号

3条回答

网友
1楼 · 编辑于 2024-04-20 00:19:55

daramarak的答案要么非常接近工作，要么就是按原样工作；很难从示例输出的格式和步骤的模糊描述中分辨出来。但如果它是非常接近工作的版本，它很容易修复。
输入代码：
>>> bits=[x.rsplit(',', 1) for x in s.split('=')] >>> kv = [(bits[i][-1], bits[i+1][0]) for i in range(len(bits)-1)]
第一行是（我相信）达玛拉克的答案。第一行本身给您成对的(value_i, key_i+1)，而不是(key_i, value_i)。第二行是最明显的解决方案。有了更多的中间步骤和一些输出，看看它是如何工作的：
>>> s = 'foo=bar,breakfast=spam,eggs,blt=bacon,lettuce,tomato,spam=spam' >>> bits0 = s.split('=') >>> bits0 ['foo', 'bar,breakfast', 'spam,eggs,blt', 'bacon,lettuce,tomato,spam', 'spam'] >>> bits = [x.rsplit(',', 1) for x in bits0] >>> bits [('foo'), ('bar', 'breakfast'), ('spam,eggs', 'blt'), ('bacon,lettuce,tomato', 'spam'), ('spam')] >>> kv = [(bits[i][-1], bits[i+1][0]) for i in range(len(bits)-1)] >>> kv [('foo', 'bar'), ('breakfast', 'spam,eggs'), ('blt', 'bacon,lettuce,tomato'), ('spam', 'spam')]

网友
2楼 · 编辑于 2024-04-20 00:19:55

我能建议你像以前一样使用拆分操作吗。但是先在等号处拆分，然后在最右边的逗号处拆分，以生成一个左右字符串的列表。
input = "bob=whatever,king=kong,banana=herb,good,yellow,thorn=hurts"
最初的分裂会变成
first_split = input.split("=") #first_split = ['bob' 'whatever,king' 'kong,banana' 'herb,good,yellow,thorn' 'hurts']
然后在最右边的逗号处拆分可以得到：
second_split = [single_word for sublist in first_split for item in sublist.rsplit(",",1)] #second_split = ['bob' 'whatever' 'king' 'kong' 'banana' 'herb,good,yellow' 'thorn' 'hurts']
然后你就这样收集这对：
pairs = dict(zip(second_split[::2],second_split[1::2]))

网友
3楼 · 编辑于 2024-04-20 00:19:55

为了便于比较，这里有一个regex似乎也能解决这个问题：

([^=]+)    # key
=          # equals is how we tokenise the original string
([^=]+)    # value
(?:,|$)    # value terminator, either comma or end of string

这里的诀窍是限制你在第二组中捕获的内容。.+吞下=符号，这是用于区分键和值的字符。完整的regex不依赖于任何回溯（因此如果需要的话，它应该与re2之类的东西兼容），并且可以使用abarnert的示例。

用法如下：

re.findall(r'([^=]+)=([^=]+)(?:,|$)', 'foo=bar,breakfast=spam,eggs,blt=bacon,lettuce,tomato,spam=spam')

返回：

[('foo', 'bar'), ('breakfast', 'spam,eggs'), ('blt', 'bacon,lettuce,tomato'), ('spam', 'spam')]

相关问题更多 >

编程相关推荐

热门问题

热门文章