使用torchtext.data.TablerDataset将存储在G驱动器中的csv数据加载为torchtext格式时,“无此类文件”,

2024-04-26 23:57:03 发布

您现在位置:Python中文网/ 问答频道 /正文

我在G驱动器中存储了一个csv文件,并尝试将其加载到torchtext data.TablerDataset。错误消息是“FileNotFoundError:[Errno 2]没有这样的文件或目录:”https://.....“

是否无法将csv文件从g驱动器直接加载到torchtext TablerDataset

这是代码。我还公开了一份{a1}报告,其中提供了公开的数据

import torch
from torchtext import data, datasets

!pip install -U -q PyDrive
from pydrive.auth import GoogleAuth
from pydrive.drive import GoogleDrive
from google.colab import auth
from oauth2client.client import GoogleCredentials

auth.authenticate_user()
gauth = GoogleAuth()
gauth.credentials = GoogleCredentials.get_application_default()
drive = GoogleDrive(gauth)

TEXT = data.Field(tokenize = 'spacy', batch_first = True, lower=False)  
LABEL = data.LabelField(sequential=False, dtype = torch.float) 

train = data.TabularDataset(path = 'https://drive.google.com/open?id=1eWMjusU3H34m0uml5SdJvYX6gQuB8zta', 
                            format = 'csv', 
                            fields = [('Insult', LABEL), (None, None), ('Comment', TEXT)], 
                            skip_header=False)

Tags: 文件csvfromhttpsimportauthfalsedata
1条回答
网友
1楼 · 发布于 2024-04-26 23:57:03

假设您有能力下载此CSV文件。我建议您在torchtext上使用功能内置的:download_from_url

import os
import torch
from torchtext import data, datasets
from torchtext.utils import download_from_url

# download the file
CSV_FILENAME = 'data.csv'
CSV_GDRIVE_URL = 'https://drive.google.com/uc?export=download&id=1eWMjusU3H34m0uml5SdJvYX6gQuB8zta'
download_from_url(CSV_GDRIVE_URL, CSV_FILENAME)

TEXT = data.Field(tokenize = 'spacy', batch_first = True, lower=False)  #from torchtext import data
LABEL = data.LabelField(sequential=False, dtype = torch.float) 

# if you're on Colab, you'll need this /content
train = data.TabularDataset(path=os.path.join('/content', CSV_FILENAME),
                            format='csv',
                            fields = [('Insult', LABEL), (None, None), ('Comment', TEXT)],
                            skip_header=False )

请注意,googledrive链接不应该是带有open?id的链接,而是将其更改为uc?export=download&id

相关问题 更多 >