合并列具有默认值和覆盖的数据帧

METRIC SYSTEM_NAME YELLOW RED 16 pagins NaN 500.0 1000.0 17 preadsec NaN 5000.0 10000.0 18 swapins NaN 250.0 500.0 19 cpupcent foo 30.0 90.0 20 pagins bar 456.0 123.0

SYSTEM_NAME METRIC CVAL 19886 foo cpupcent 89.281734 19887 bar swapins 41.799927 19888 bar pagins 123.92355 19889 quux preadsec 28.837423 19890 quux pagins 232.30303

SYSTEM_NAME METRIC CVAL YELLOW RED 19886 foo cpupcent 89.281734 30.0 90.0 19887 bar swapins 41.799927 250.0 500.0 19888 bar pagins 123.92355 456.0 123.0 19889 quux preadsec 28.837423 5000.0 10000.0 19890 quux pagins 232.30303 500.0 1000.0

1条回答

网友

1楼 · 发布于 2024-04-27 16:36:00

我发现了这个。这有点复杂和混乱，但它解决了一个合理的时间框架的问题

它假设这些值通过一个附加列进行加权，优先选择最小值

# Ground work, prepare the index
tmp_df = df.reset_index()
# Now, perform the merge. Use the common value, then tidy up the duplicates
tmp_df = tmp_df.merge(t_df, 'left', on='METRIC')\
         .drop('SYSTEM_NAME_y', axis=1)
         .rename(index=str, columns='SYSTEM_NAME_x':'SYSTEM_NAME'})
         .drop_duplicates(subset=['END_DATE','METRIC','SYSTEM_NAME'], keep='last')
# And restore the index
tmp_df = tmp_df.set_index(df.index.name)

相关问题更多 >

编程相关推荐

热门问题

热门文章