<p>我是Python新手(使用anacondaw/pythonv3.4.3),还没有找到这个答案,但这似乎是一个非常重要的问题,我一定是走错了方向</p>
<pre><code>import pandas as pd
url = 'https://raw.github.com/pydata/pandas/master/pandas/tests/data/tips.csv'
tips = pd.read_csv(url)
tips.head(5)
Out[1]:
total_bill tip sex smoker day time size
0 16.99 1.01 Female No Sun Dinner 2
1 10.34 1.66 Male No Sun Dinner 3
2 21.01 3.50 Male No Sun Dinner 3
3 23.68 3.31 Male No Sun Dinner 2
4 24.59 3.61 Female No Sun Dinner 4
</code></pre>
<p>我想选择<code>day</code>组中至少有50条记录的记录</p>
<pre><code>sel_days = tips.groupby("day").size() > 50
sel_days
Out[2]:
day
Fri False
Sat True
Sun True
Thur True
dtype: bool
</code></pre>
<p>我看到这是一个序列,但似乎不知道如何生成一个布尔序列来从原始集合<code>tips</code>中选择行</p>
<pre><code>type(sel_days)
Out[3]: pandas.core.series.Series
print(x in sel_days for x in tips["day"])
<generator object <genexpr> at 0x0000000007DBDFC0>
</code></pre>
<p>我该怎么做</p>