有没有一种更简单的方法可以将列表随机拆分为子列表，而不必在python中重复元素？

import random import os # list files in folder files = os.listdir("C:/.../my_folder") # define the size of the sets: ~30% validation, ~20% test, ~50% training (remaining goes to training set) validation_count = int(0.3 * len(files)) test_count = int(0.2 * len(files)) training_count = len(files) - validation_count - test_count # randomly choose ~20% of files to test set test_set = random.sample(files, k = test_count) # remove already chosen files from original list files_wo_test_set = [f for f in files if f not in test_set] # randomly chose ~30% of remaining files to validation set validation_set = random.sample(files_wo_test_set, k = validation_count) # the remaining files going into the training set training_set = [f for f in files_wo_test_set if f not in validation_set]

2条回答

网友

1楼 · 编辑于 2024-04-24 10:36:12

我建议您查看sci工具包学习库，因为它包含为您执行此操作的train_test_split函数。但是，只使用random库来回答您的问题

# First shuffle the list randomly
files = os.listdir("C:/.../my_folder")
random.shuffle(files) 

# Then just slice
ratio = int(len(files)/5) # 20%
test_set = files[:ratio]
val_set = files[ratio:1.5*ratio] #30%

网友

2楼 · 编辑于 2024-04-24 10:36:12

我认为答案是不言自明的，所以我没有添加任何解释

import random
random.shuffle(files)
k = test_count
set1 = files[:k]
set2 = files[k:1.5k]
set3 = files[1.5k:]

相关问题更多 >

编程相关推荐

热门问题

热门文章