如何对相关组应用按键分组

uniqueID = 'ID_'+ grouped.groups.keys().astype(str) uniqueID Name Nationality age ID_UK28 Peter UK 28 ID_US29 John US 29 ID_UK28 Wiley UK 28 ID_US29 Aster US 29

2条回答

网友

1楼 · 编辑于 2024-05-23 14:26:55

您不需要groupby来创建uniqueID，您可以稍后对uniqueID进行groupby，以获得基于年龄和国籍的组。我使用了一个自定义函数来构建文本str。这是一种方法。你知道吗

df1 = df.assign(uniqueID='ID_'+df.Nationality+df.age.astype(str))

def myText(x):
    str = ' and '.join(x.Name)
    str += ' have a combined age of {}.'.format(x.age.sum())
    return str

df2 = df1.groupby(['uniqueID', 'Nationality','age']).apply(myText).reset_index().rename(columns={0:'Text'})
print(df2)

输出：

  uniqueID Nationality  age                                        Text
0  ID_UK28          UK   28  Peter and Wiley have a combined age of 56.
1  ID_US29          US   29   John and Aster have a combined age of 58.

网友

2楼 · 编辑于 2024-05-23 14:26:55

希望足够接近，不能得到平均年龄：

import pandas as pd

#create dataframe
df = pd.DataFrame({'Name': ['Peter', 'John', 'Wiley', 'Aster'], 'Nationality': ['UK', 'US', 'UK', 'US'], 'age': [28, 29, 28, 29]})

#make uniqueID
df['uniqueID'] = 'ID_' + df['Nationality'] + df['age'].astype(str)

#groupby has agg method that can take dict and preform multiple aggregations
df = df.groupby(['uniqueID', 'Nationality']).agg({'age': 'sum', 'Name': lambda x: ' and '.join(x)})

#to get text you just combine new Name and sum of age
df['Text'] = df['Name'] + ' have a combined age of ' + df['age'].astype(str)

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何对相关组应用按键分组

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >