Python或Pandas数据摘要（将表转换为行名称：[列名称，值]…]的字典）

### Final Result # IndexName [col_name, cell_value] [sum of positive numbers, result] [sum of negative numbers, result] Peralta [Rating, 40] [Score, 20] [Sum_Total_of_positive_numbers, 60] Amy [Rating, 40] [Score, 20] [Rating, 40] [Score, -20] [Rating, 40] [Sum_Total_of_positive_numbers, 140] [Sum_Total_of_negative_numbers, -20] Terry [Score, -20] [Rating, 40] [Rating, -40] [Sum_Total_of_positive_numbers, 40] [Sum_Total_of_negative_numbers, -60] Gina [Score, 20] [Score, -20] [Rating, 40] [Sum_Total_of_positive_numbers, 60] [Sum_Total_of_negative_numbers, -20]

for k, v in dff_dict.items(): # k: name of index, v: is a df check = v.columns[(v == 20).any()] if len(check) > 0: print((k, check.to_list()), file=open("output.txt", "a"))

1条回答

网友

1楼 · 发布于 2024-06-16 14:08:32

我认为如果您可以将整个数据集分成两部分，考虑到“正数和负数分别求和”的要求，这会更容易

从您的示例数据开始：

import pandas as pd
import numpy as np
data = [
{"Name": "Peralta", "Score": 0, "Rating": 40},
{"Name": "Peralta", "Score": 20, "Rating": 0},
{"Name": "Peralta", "Score": 0, "Rating": 0},
{"Name": "Amy", "Score": 0, "Rating": 40},
{"Name": "Amy", "Score": 20, "Rating": 40},
{"Name": "Amy", "Score": -20, "Rating": 40},
{"Name": "Terry", "Score": 0, "Rating": 0},
{"Name": "Terry", "Score": -20, "Rating": 40},
{"Name": "Terry", "Score": 0, "Rating": -40},
{"Name": "Gina", "Score": 20, "Rating": 0},
{"Name": "Gina", "Score": 0, "Rating": 0},
{"Name": "Gina", "Score": -20, "Rating": 40},
]
df = pd.DataFrame(data).set_index("Name")

我们可以得到正负值的预测：

df_pos = df.where(df>=0, other=0)
df_neg = df.where(df<0, other=0)

然后分组求和，以获得您想要的结果：

df_pos = df_pos.groupby(by="Name").sum()
df_pos["total_positive"] = df_pos.apply(np.sum, axis=1)

df_neg = df_neg.groupby(by="Name").sum()
df_neg["total_negative"] = df_neg.apply(np.sum, axis=1)

注意-在这个阶段，数据仍然在两个数据帧中，没有展平到您显示的[field, value]格式

相关问题更多 >

编程相关推荐

热门问题

热门文章