从Pytorch中加载EMNIST数据集

0 投票
1 回答
161 浏览
提问于 2025-04-12 21:35

我正在使用EMNIST数据集,想从PyTorch加载它,但出现了一个奇怪的错误,内容是:

运行时错误:文件未找到或已损坏。

这是我尝试加载数据集的方式:

trainset = torchvision.datasets.EMNIST(root="emnist",
                                   split="letters",
                                   train=True,
                                   download=True,
                                   transform=transforms.ToTensor())

可能出什么问题了呢?

1 个回答

1

我觉得这个链接不对,试着用这个下载数据集:

https://github.com/Tony-Y/pytorch_warmup/blob/master/examples/emnist/download.py

然后把你的代码改成:

import torchvision
from torchvision import transforms

# Update the path to where you've manually placed the EMNIST dataset
root_dir = "./path/to/your/emnist"  # Change this to the actual path

trainset = torchvision.datasets.EMNIST(root=root_dir,
                                   split="letters",
                                   train=True,
                                   download=False,  # Set to False since you already downloaded it 
                                   transform=transforms.ToTensor())

撰写回答