使用pandas read\u cs时跳过0xff字节

1条回答

网友

1楼 · 发布于 2024-06-01 01:47:02

我会把它读成一串。然后用python咀嚼一下，然后把它传递给熊猫.read_csv. 下面是示例代码。在

# get the data as a python string
with open ("CM120102.CSV", "r") as myfile:
    data=myfile.read()

# munge in python - get rid of the garbage in the input (lots of xff bytes)
import re
data = re.sub(r'[^a-zA-Z0-9_\.;:\n]', '', data) # get rid of the rubbish
data = data + '\n' # the very last one is missing?
data = re.sub(r';\n', r'\n', data) # last ; separator on line is problematic

# now let's suck into a pandas DataFrame
from StringIO import StringIO
import pandas as pd
df = pd.read_csv(StringIO(data), index_col=None, header=0,
    skipinitialspace=True, sep=';', parse_dates=True)

编程相关推荐

java在SWT中关闭CTabItem时如何获取警告消息？
java如何从中获取文本字符串
java带有（int[][]）的方法意味着什么？
java我在创建这个安卓浮动泡泡动画时做错了什么？
将边距属性作为列表项的java表抛出异常ClassCastException
java如何在Storm拓扑中测量延迟和吞吐量
java如何在javafx中序列化事件？
java访问main（）之外的线程
java如何强制某些方法仅对kotlin可见
java如何使用quartzscheduler启动具有多个crontrigger的作业？

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用pandas read\u cs时跳过0xff字节

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >