在Pandas中解析csv文件时，如何从字符串中删除多余的空格？

1997,Ford,E350 1997, Ford , E350 1997,Ford,E350,"Super, luxurious truck" 1997,Ford,E350,"Super ""luxurious"" truck" 1997,Ford,E350," Super luxurious truck " "1997",Ford,E350 1997,Ford,E350 2000,Mercury,Cougar

Year Make Model Description 0 1997 Ford E350 None 1 1997 Ford E350 None 2 1997 Ford E350 Super, luxurious truck 3 1997 Ford E350 Super "luxurious" truck 4 1997 Ford E350 Super luxurious truck 5 1997 Ford E350 None 6 1997 Ford E350 None 7 2000 Mercury Cougar None

3条回答

网友

1楼 · 编辑于 2024-05-12 14:32:30

把参数skipinitialspace=True添加到^{}对我有效。

所以试试看：

pd.read_table("data.csv", 
              sep=r',', 
              names=["Year", "Make", "Model", "Description"], 
              skipinitialspace=True)

同样的事情也适用于pd.read_csv()。

网友

2楼 · 编辑于 2024-05-12 14:32:30

您可以使用转换器：

import pandas as pd

def strip(text):
    try:
        return text.strip()
    except AttributeError:
        return text

def make_int(text):
    return int(text.strip('" '))

table = pd.read_table("data.csv", sep=r',',
                      names=["Year", "Make", "Model", "Description"],
                      converters = {'Description' : strip,
                                    'Model' : strip,
                                    'Make' : strip,
                                    'Year' : make_int})
print(table)

收益率

   Year     Make   Model              Description
0  1997     Ford    E350                     None
1  1997     Ford    E350                     None
2  1997     Ford    E350   Super, luxurious truck
3  1997     Ford    E350  Super "luxurious" truck
4  1997     Ford    E350    Super luxurious truck
5  1997     Ford    E350                     None
6  1997     Ford    E350                     None
7  2000  Mercury  Cougar                     None

网友

3楼 · 编辑于 2024-05-12 14:32:30

好吧，空白在数据中，所以不读入空白就不能读入数据。但是，在读入之后，您可以通过执行df["Make"] = df["Make"].map(str.strip)（其中df是您的数据帧）来去掉空白。

相关问题更多 >

编程相关推荐

热门问题

热门文章