pandas：返回以特定数字开头的列值

url = 'https://raw.githubusercontent.com/108michael/ms_thesis/master/sic_naics_catcode.csv' df= pd.read_csv(url, index_col=0) df.head(3) SICcode Catcode Category SICname MultSIC 2012 NAICS Code 2002to2007 NAICS 0 111 A1500 Wheat, corn, soybeans and cash grain Wheat X 111140 111140 1 112 A1600 Other commodities (incl rice, peanuts, honey) X 111160 111160 2 115 A1500 Wheat, corn, soybeans and cash grain Corn X 111150 111150

2条回答

网友

1楼 · 编辑于 2024-05-14 11:04:20

您可以使用RegEx power：

df.loc[df['2002to2007 NAICS'].astype(str).str.contains(r'^(?:531|92|541[6-9])')]

将给出以531、92或5416-5419开头的所有值

网友

2楼 · 编辑于 2024-05-14 11:04:20

对于以531或92开头的值：

df.loc[(df["2002to2007 NAICS"].astype(str).str.startswith("531")) | (df["2002to2007 NAICS"].astype(str).str.startswith("92"))]

对于从5416:5419开始的值：

df.loc[df["2002to2007 NAICS"].astype(str).str.slice(0,4).isin([str(i) for i in range(5416, 5420)])]

相关问题更多 >

编程相关推荐

热门问题

热门文章