比较从s3读取的每个文件的数据帧

A B books [book1, book2, book3] animal [animal1, animal2, animal3] place [place1 , place2, place3] dataframe df2 A . B name [name1, name2, name3] animal [animal 5, animal 6]

1条回答

网友

1楼 · 发布于 2024-04-26 09:53:58

通过^{}将所有数据帧连接起来，B的聚合值与列表理解中lambda函数中列表的扁平列表：

df = pd.concat([df1, df2], ignore_index=True)

f = lambda x: [z for y in x for z in y]
df3 = df.groupby('A', sort=False)['B'].apply(f).reset_index()
print (df3)
        A                                                B
0   books                            [book1, book2, book3]
1  animal  [animal1, animal2, animal3, animal 5, animal 6]
2   place                         [place1, place2, place3]
3    name                            [name1, name2, name3]

或：

#slow solution in large data
df = df.groupby('A', sort=False)['B'].sum().reset_index()

编程相关推荐

Java 2D数组，查找包含元素
包含EBCDIC值的java打印字节数组未给出预期值
java应用程序重新启动，由于AndroidRuntime异常而无法运行
java在spring中对拦截器的使用
java ActiveMQ，代理接收要发送的消息的时间戳
JAVA：如何从需要启用Cookie的站点下载HTML文件？
邮件发送期间发生java证书错误
Java错误：类事务中的构造函数事务无法应用于给定类型
方法的Java对象空检查
Java如何在多个源文件夹之间使用全局变量？

相关问题更多 >

编程相关推荐

热门问题

热门文章

比较从s3读取的每个文件的数据帧

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >