如何分离.csv文件中的数据？

网友

1楼 · 编辑于 2024-05-16 07:47:46

有一些不错的re-解决方案，但我只想添加这个非正则表达式解决方案：

>>> s = "John Smith,M,23 Ashley Jones,F,18 James Smith Johns,M,20"
>>> sum((item.split(None, 1) for item in s.split(',')), list())
['Name', 'Gender', 'Age', 'John Smith', 'M', '23', 'Ashley Jones', 'F', '18', 'James Smith Johns', 'M', '20']

除了sum，还可以使用itertools.chain。但最终，它似乎一点也不短。你知道吗

>>> list(itertools.chain(*[item.split(None, 1) for item in s.split(',')]))

或者更好

>>> list(itertools.chain.from_iterable(item.split(None, 1) for item in s.split(',')))

网友

2楼 · 编辑于 2024-05-16 07:47:46

下面是一个使用正则表达式的解决方案：

re.compile("([^,]+),([^,]+),(\d+|Age)\s+").findall("Name,Gender,Age John Smith,M,23 Ashley Jones,F,18 James Smith Johns,M,20")

结果如下：

[('Name', 'Gender', 'Age'), ('John Smith', 'M', '23'), ('Ashley Jones', 'F', '18')]

网友

3楼 · 编辑于 2024-05-16 07:47:46

首先在,处拆分，然后遍历该列表并在空白处拆分每个项。如果在空格处拆分后返回的项目数大于1，则分别返回第一个项目和其余项目，否则只返回第一个项目。你知道吗

import csv
def solve(row):
    for item in row:
        spl = item.split(None, 1)
        if len(spl) > 1:
            yield spl[0]
            yield spl[1]           
        else:
            yield spl[0]
...             
with open('abc1') as f:
    reader = csv.reader(f, delimiter=',')
    for row in reader:      
        print list(solve(row))
...         
['Name', 'Gender', 'Age', 'John Smith', 'M', '23', 'Ashley Jones', 'F', '18', 'James Smith Johns', 'M', '20']

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何分离.csv文件中的数据？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >