处理网页中的空单元格

1条回答

网友

1楼 · 发布于 2024-04-20 08:35:41

我的建议是：使用pandas.DataFrame。它可以从许多源加载数据，包括HTML

您可以使用fillna方法轻松地处理空单元格

考虑这个例子：

import pandas as pd

# read_excel returns list of dataframes.
# In this case we know there is only one in the page
df = pd.read_html('http://www.basketball-reference.com/leagues/NBA_2015_per_poss.html',
                  attrs={'id': 'per_poss'})[0] 

# the headers repeat every 20 lines, filtering them out
df = df[df['Rk'] != 'Rk'] 

# inserting 0 to empty cells
# could also use inplace=True kwarg instead of reassigning, or pass a 
# dictionary to use different value for each column 
df = df.fillna(0)

编程相关推荐

java如何在安卓中使用动画旋转某些东西
排序如何对Java ArrayList进行排序
JAVAlang.OutOfMemoryError:使用Apache POI读取excel时的Java堆空间
java Tomcat 8.0.20内存不足错误
显式EntityManager之后@RequestScoped Bean中的java LazyInitializationException。发现
java对象到片段的通信
java DidRangeBeanConsinRegion并不总是在altBeacon库中工作
用java将xml配置文件应用到我的应用程序中的最佳方法是什么？
输入Java扫描器和字母
字符串Java解析输出的消息

相关问题更多 >

编程相关推荐

热门问题

热门文章

处理网页中的空单元格

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >