如何过滤数据框中具有特定值和不同值的条目？

neighborhood type_property type_negotiation price Smallville house rent 2000 Oakville apartment for sale 100000 King Bay house for sale 250000 ...

neighborhood tenthpercentile ninetiethpercentile Quantity King Bay 250000.0 250000.0 1 Smallville 99000.0 120000.0 8 Oakville 45000.0 160000.0 6 ...

2条回答

网友

1楼 · 编辑于 2024-05-26 22:58:27

可能不是最优雅的，但你可以加入百分位聚合到每个房地产数据

df.join(df.groupby(‘neighborhood’).quantile([0.1,0.9]), on=‘neighborhood’)

在手机上，如果语法不完美，请原谅我

网友

2楼 · 编辑于 2024-05-26 22:58:27

您可以将它们设置为具有相同的索引，广播百分位数，并且只需使用.between

所以首先

df2 = df2.set_index('neighborhood')
df = df.set_index('neighborhood')

然后，broadcast使用loc

df.loc[:, 't'], df.loc[:, 'n'] = df2.tenthpercentile, df2.ninetiethpercentile

最后

df.price.between(df.t, df.n)

这就产生了

neighborhood
Smallville    False
Oakville       True
King Bay       True
King Bay      False
dtype: bool

所以要过滤，就切片

df[df.price.between(df.t, df.n)]

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何过滤数据框中具有特定值和不同值的条目？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >