Python源-目的地矩阵公式中的CSV操作

import csv import numpy from pprint import pprint numpy.set_printoptions(threshold='nan') with open('./London stations.csv', 'rU') as csvfile: reader = csv.DictReader(csvfile) Stations = ['{O_Station}'.format(**row) for row in reader] print(Stations) O_D = numpy.column_stack(([Stations],[Stations])) pprint(O_D)

2条回答

网友

1楼 · 编辑于 2024-06-12 20:41:43

这是一个不完整的答案，但我会跳过numpy直接进入^{}：

csv_file = '''Station Lat Lng
Abbey Road  51.53195199 0.003737786
Abbey Wood  51.49078408 0.120286371
Acton   51.51688696 -0.267675543
Acton Central   51.50875781 -0.263415792
Acton Town  51.50307148 -0.280288296'''

这很困难，因为它不是真正的逗号分隔的，否则我们可以调用^{}：

^{pr2}$

所以我们最终得到df.head()的结果：

                       Lat           Lng
Station
Abbey Road     51.53195199   0.003737786
Abbey Wood     51.49078408   0.120286371
Acton          51.51688696  -0.267675543
Acton Central  51.50875781  -0.263415792
Acton Town     51.50307148  -0.280288296

得到排列可能意味着我们不需要把台站作为索引。。。暂时还不确定。希望这能有所帮助！在

网友

2楼 · 编辑于 2024-06-12 20:41:43

在处理这样的表格数据时，我更喜欢使用pandas。它使控制数据结构变得简单。在

import pandas as pd

#read in csv
stations = pd.read_csv('london stations.csv', index_col = 0)

#create new dataframe
O_D = pd.DataFrame(columns = ['O_Station','O_lat','O_lng','D_Station','D_lat','D_lng'])

#iterate through the stations

new_index= 0
for o_station in stations.index:
    for d_station in stations.index:
        ls = [o_station,stations.Lat.loc[o_station],stations.Lng.loc[o_station],d_station, stations.Lat.loc[d_station], stations.Lng.loc[d_station]]
        O_D.loc[new_index] = ls
        new_index+=1

#remove double stations
O_D = O_D[O_D.O_Station != O_D.D_Station]

这应该是您的数据转换的诀窍。在

相关问题更多 >

编程相关推荐

热门问题

热门文章