正在尝试从索引数据帧中两行范围内的数据帧中查找值

pricing Price_Per_Ton Wet_Fat 0 306 10 1 339 11 2 382 12 3 430 13 4 481 14 5 532 15 6 580 16 7 625 17 8 665 18 9 700 19 10 728 20 11 750 21 12 766 22 13 778 23 14 788 24 15 797 25 grower_moo Load Ticket Net Fruit Weight Net MOO Percent_MOO Fat 0 L2019000011817 56660 833 1.448872 21.92 1 L2019000011816 53680 1409 2.557679 21.12 2 L2019000011815 53560 1001 1.834644 21.36 3 L2019000011161 62320 2737 4.207080 21.41 4 L2019000011160 57940 1129 1.911324 20.06

grower_moo_with_price Load Ticket Net Fruit Weight Net MOO Percent_MOO Fat price_per_ton 0 L2019000011817 56660 833 1.448872 21.92 750 1 L2019000011816 53680 1409 2.557679 21.12 750 2 L2019000011815 53560 1001 1.834644 21.36 750 3 L2019000011161 62320 2737 4.207080 21.41 750 4 L2019000011160 57940 1129 1.911324 20.06 728

2条回答

网友

1楼 · 编辑于 2024-06-09 21:49:55

这看起来像是“截至”合并的作业，pd.merge_asof（documentation）：

This is similar to a left-join except that we match on nearest key rather than equal keys. Both DataFrames must be sorted by the key.
For each row in the left DataFrame:
A "backward" search [the default] selects the last row in the right DataFrame whose ‘on’ key is less than or equal to the left’s key.

在下面的代码中，我使用您的示例输入，但是列名使用下划线_而不是空格

# Required by merge_asof: sort keys in left DataFrame
grower_moo = grower_moo.sort_values('Fat')

# Required by merge_asof: key column data types must match
pricing['Wet_Fat'] = pricing['Wet_Fat'].astype('float') 

# Perform the asof merge
res = pd.merge_asof(grower_moo, pricing, left_on='Fat', right_on='Wet_Fat')

# Print result
res
      Load_Ticket  Net_Fruit_Weight  Net_MOO  Percent_MOO    Fat  Price_Per_Ton  Wet_Fat
0  L2019000011160             57940     1129     1.911324  20.06            728     20.0
1  L2019000011816             53680     1409     2.557679  21.12            750     21.0
2  L2019000011815             53560     1001     1.834644  21.36            750     21.0
3  L2019000011161             62320     2737     4.207080  21.41            750     21.0
4  L2019000011817             56660      833     1.448872  21.92            750     21.0

# Optional: drop the key column from the right DataFrame
res.drop(columns='Wet_Fat')
      Load_Ticket  Net_Fruit_Weight  Net_MOO  Percent_MOO    Fat  Price_Per_Ton
0  L2019000011160             57940     1129     1.911324  20.06            728
1  L2019000011816             53680     1409     2.557679  21.12            750
2  L2019000011815             53560     1001     1.834644  21.36            750
3  L2019000011161             62320     2737     4.207080  21.41            750
4  L2019000011817             56660      833     1.448872  21.92            750

网友
                    
                    

                    

                    2楼 ·

                    
                        编辑于 2024-06-09 21:49:55

concat_df = pd.concat([grower_moo, pricing], axis)
cocnat_df = concat_df[concat_df['Wet_Fat'] < concat_df['Fat']]
del cocnat_df['Wet_Fat']

`相关问题更多 >`

`编程相关推荐`

`热门问题`

`热门文章`