如何按值对多列执行有序选择问题的回答

如何按值对多列执行有序选择

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

您可以创建<code>DatetimeIndex</code>，然后按<a href="http://pandas.pydata.org/pandas-docs/stable/timeseries.html#partial-string-indexing" rel="nofollow noreferrer">^{<cd2>}</a>选择： <pre><code>stats_month_census_2 = pd.DataFrame({ 'year': [2008, 2008, 2008, 2013,2013], 'month': ['April','May','June','September','October'], 'data':[1,3,4,6,5] }) print (stats_month_census_2) year month data 0 2008 April 1 1 2008 May 3 2 2008 June 4 3 2013 September 6 4 2013 October 5 s = stats_month_census_2.pop('year').astype(str) + stats_month_census_2.pop('month') #if need year and month columns #s = stats_month_census_2['year'].astype(str) + stats_month_census_2['month'] stats_month_census_2.index = pd.to_datetime(s, format='%Y%B') print (stats_month_census_2) data 2008-04-01 1 2008-05-01 3 2008-06-01 4 2013-09-01 6 2013-10-01 5 </code></pre> <hr/> <pre><code>print (stats_month_census_2['2008':'2013']) data 2008-04-01 1 2008-05-01 3 2008-06-01 4 2013-09-01 6 2013-10-01 5 print (stats_month_census_2['2008-05':'2013-09']) data 2008-05-01 3 2008-06-01 4 2013-09-01 6 </code></pre> 或者创建列并将<a href="http://pandas.pydata.org/pandas-docs/stable/generated/pandas.Series.between.html" rel="nofollow noreferrer">^{<cd3>}</a>与<a href="http://pandas.pydata.org/pandas-docs/stable/indexing.html#boolean-indexing" rel="nofollow noreferrer">^{<cd4>}</a>一起使用： <pre><code>s = stats_month_census_2['year'].astype(str) + stats_month_census_2['month'] stats_month_census_2['date'] = pd.to_datetime(s, format='%Y%B') print (stats_month_census_2) year month data date 0 2008 April 1 2008-04-01 1 2008 May 3 2008-05-01 2 2008 June 4 2008-06-01 3 2013 September 6 2013-09-01 4 2013 October 5 2013-10-01 df = stats_month_census_2[stats_month_census_2['date'].between('2008-05', '2013-09')] print (df) year month data date 1 2008 May 3 2008-05-01 2 2008 June 4 2008-06-01 3 2013 September 6 2013-09-01 </code></pre> 不幸的是，对于select between years，这种使用datetime列的方法是不可能的，因此需要使用<code>pygo</code>列的<code>year</code>解决方案： <pre><code>#wrong output df = stats_month_census_2[stats_month_census_2['date'].between('2008', '2013')] print (df) year month data date 0 2008 April 1 2008-04-01 1 2008 May 3 2008-05-01 2 2008 June 4 2008-06-01 </code></pre>

如何按值对多列执行有序选择

1 个回答

相关Python问题