擅长:python、mysql、java
<p>对于@jezrael这个非常好的解决方案,一个更冗长的替代方案是:</p>
<pre><code>import pandas as pd
infile = "test.txt" #this is your file
df = pd.DataFrame(columns=['Time','Text'])
with open(infile, "r") as inf:
for i, line in enumerate(inf):
line = line.strip()
if line.startswith("-") or line.startswith("="):
pass
else:
if len(line.split("-")) > 1:
df.loc[i] = pd.Series({'Time':line.split("-")[0], 'Text':line.split("-")[1]})
inf.close()
</code></pre>
<p>但我不确定您是否希望将Time列转换为pd Time格式。如果是,那么只需添加:</p>
^{pr2}$
<p>在剧本的结尾</p>