计算列表中给定其他项的出现次数

import glob import csv import re from collections import Counter path = "ReviewsSep2018/*.csv" mylist = [] for filename in glob.glob(path): print(filename) with open(filename, newline='', encoding='utf-16') as f: reader = csv.reader(f) for row in reader: result = re.search(r'\d+\W\d+\W\d+', row[5]) if result: line = result.group() mylist.append(tuple([line,row[9]])) print(mylist) for i in mylist: print(i[0],i[1])

2条回答

网友

1楼 · 编辑于 2024-04-24 23:02:41

把你的mylist变成Counter

mycount = Counter()

而不是附加到(date, rating)元组的列表增量计数：

mycount[(line,row[9])] += 1

最后，将其显示为：

for (date, rating), count in mycount.items():
    print(date, rating, count)

网友

2楼 · 编辑于 2024-04-24 23:02:41

如果您不介意使用pandas库，您可以在解析数据之后使用groupby。在我看来，pandas还有一个很好的.csv阅读功能。你知道吗

import pandas as pd

(pd.DataFrame([['2018-09-01', 1],
              ['2018-09-01', 5],
              ['2018-09-01', 2],
              ['2018-09-01', 1],
              ['2018-08-23', 1],
              ['2018-09-01', 4],
              ['2018-09-01', 4],
              ['2018-09-01', 5],
              ['2018-09-01', 2],
              ['2018-09-02', 1],
              ['2018-09-02', 5],
              ['2018-09-02', 5]],
             columns=['date', 'star']
            )
 .assign(count=1)
 .groupby(['date', 'star'])
 .count()
)

相关问题更多 >

编程相关推荐

热门问题

热门文章