擅长:python、mysql、java
<p>以下脚本将在单个excel工作表中为每个表创建单独的数据框:</p>
<pre><code>import pandas as pd
import numpy as np
# read data from excel:
data = pd.read_excel('path/to/my/file.xlsx', header=None)
# split on rows with NaN (blank rows):
df_list = np.split(data, data[data.isnull().all(1)].index)
# iterate over each dataframe:
for df in df_list:
# remove NaNs:
df = df.dropna()
# set first row as column headers:
new_header = df.iloc[0]
df = df[1:]
df.columns = new_header
# clean up dataframe:
df = df.reset_index()
df = df.drop(['index'], axis=1)
df.columns.name = ''
# print results:
print(df)
print()
</code></pre>
<p>将输出:</p>
<pre><code> Nomes ID Endereço
0 Pedro 1900-01-01 00:00:00 Rua X
1 Joao 1900-01-01 00:00:00 Rua Y
2 Maria 1900-01-02 00:00:00 Rua X
Modelo ID Tipo
0 A 20 XYZ
1 B 30 XYS
2 C 40 XQZ
Cor Modelo Classe Local
0 Azul A 4 Casa
1 Vermelho A 4 Casa
2 Roxo A 5 Casa
3 Lilas X Casa
</code></pre>