擅长:python、mysql、java
<p>你不能用熊猫数据框来做这个吗?链接:<a href="https://pandas.pydata.org/pandas-docs/stable/generated/pandas.DataFrame.sample.html" rel="nofollow noreferrer">Pandas Dataframe Sampling</a>。这是我过去用过的一个例子:</p>
<pre><code> import pandas as pd
keeping = 0.8
source = "/path/to/some/file"
df = pd.DataFrame(source)
ones = df[df.trainlabels == 1].sample(frac=keeping)
twos = df[df.trainlabels == 2].sample(frac=keeping)
</code></pre>