我使用Quandl
python模块的get()
函数从Quandl下载了一个时间序列,它的形状是(24,19)
,但我只想使用第1到3列和第18列。我编写了这个脚本来完成这项工作,使用了一个名为ll
的列表,并在我的DataFrame
对象中应用了drop()
方法,该列表包含了我不使用的列的名称。这里的挑战是不用创建列表就可以完成。
import Quandl
import pandas as pd
UNES_HARD_COAL_BR = Quandl.get("UENG/CL_BRA", authtoken="xX6ntNSFuvq7eCZvDdvL")
UNES_HARD_COAL_BR.columns = UNES_HARD_COAL_BR.columns.str.replace(' ', '_')
ll =['Hard_coal_-_transformation_(Metric_tons,_thousand)', 'Hard_coal_-_transformation_in_coke_ovens_(Metric_tons,_thousand)','Hard_coal_-_transformation_in_electricity,_CHP_and_heat_plants_(Metric_tons,_thousand)','Hard_coal_-_transformation_in_electricity_plants_-_main_activity_producers_(Metric_tons,_thousand)','Hard_coal_-_final_energy_consumption_(Metric_tons,_thousand)','Hard_coal_-_consumption_by_manufacturing,_construction_and_non-fuel_mining_industry_(Metric_tons,_thousand)','Hard_coal_-_consumption_by_other_manuf.,_const._and_non-fuel_ind._(Metric_tons,_thousand)','Hard_coal_-_final_consumption_(Metric_tons,_thousand)','Hard_coal_-_stock_changes_(Metric_tons,_thousand)','Hard_coal_-_transformation_in_electricity_plants_-_autoproducers_(Metric_tons,_thousand)','Hard_coal_-_consumption_by_transport_(Metric_tons,_thousand)','Hard_coal_-_consumption_by_rail_(Metric_tons,_thousand)','Hrad_coal_-_consumption_by_iron_and_steel_industry_(Metric_tons,_thousand)', 'Hard_coal_-_losses_(Metric_tons,_thousand)', 'Hard_coal_-_total_energy_supply_(Metric_tons,_thousand)']
UNES_HARD_COAL_BR.drop(ll, axis=1, inplace=True)
How to select only specific columns from a DataFrame with MultiIndex columns?
pandas: Extracting specific selected columns from a DataFrame to new DataFrame
使用
.iloc[]
并注意pandas从0
开始计算列数:相关问题 更多 >
编程相关推荐