擅长:python、mysql、java
<p>线路首次加载</p>
<pre><code>lines = open('movie_ratings.txt').read().splitlines()[1:]
sentences = [line.split('\t') for line in lines]
</code></pre>
<p>现在我们将注释保留在最后一个值为“1”的位置</p>
<pre><code>comments_to_keep = [
comment for rating_id, comment, flag in sentences
if flag == '1'
]
</code></pre>
<p>现在我们从这些评论中抽取一个样本</p>
<pre><code>import random
sample = random.sample(comments_to_keep, 1000)
</code></pre>