擅长:python、mysql、java
<p>使用<code>series.apply()</code>:</p>
<pre><code>>>> import pandas
>>> data = {'filename': ["blah-blah-blah/dss_outline.pdf", "doot-doot/helper_doc.pdf", "blah-blah-blah/help_file.pdf"]}
>>> df = pandas.DataFrame(data=data)
>>> df
filename
0 blah-blah-blah/dss_outline.pdf
1 doot-doot/helper_doc.pdf
2 blah-blah-blah/help_file.pdf
>>> def get_top_level_from(string):
... return string.split('/')[0]
...
>>> series = df["filename"]
>>> series
0 blah-blah-blah/dss_outline.pdf
1 doot-doot/helper_doc.pdf
2 blah-blah-blah/help_file.pdf
Name: filename, dtype: object
>>> series.apply(get_top_level_from)
0 blah-blah-blah
1 doot-doot
2 blah-blah-blah
Name: filename, dtype: object
</code></pre>
<p>代码:</p>
<pre><code>def get_top_level_from(string):
return string.split('/')[0]
results = df["filename"].apply(get_top_level_from)
</code></pre>