擅长:python、mysql、java
<p>如果你想计算每个单词的频率,你可以试试这个:</p>
<pre><code>>>> from collections import Counter
>>> words = "Dave, Laura, Maddy, Dave, Laura, Maddy, Dave, Laura, Dave"
>>> Counter([word.lower() for word in words.split(', ')])
Counter({'dave': 4, 'laura': 3, 'maddy': 2})
</code></pre>
<p><a href="https://docs.python.org/2/library/collections.html#collections.Counter" rel="nofollow">^{<cd1>}</a>的文档</p>
<hr/>
<p>第一个答案的简短备选方案:</p>
^{pr2}$
<p>如果你想去掉标点符号(多于“,”):</p>
<pre><code>>>> import re
>>> words = "Dave! Laura: Maddy; Dave, Laura? Maddy, Dave, Laura, Dave."
>>> re.sub(r'[!:;,?.]', '', words).lower()
'dave laura maddy dave laura maddy dave laura dave'
</code></pre>