运行run_seq2seq.py Transformers培训脚本时出错

2024-03-28 20:08:29 发布

您现在位置:Python中文网/ 问答频道 /正文

我正在尝试训练一个seq2seq模型。我在Colab中运行了示例代码:

!git clone https://github.com/huggingface/transformers
!git clone https://github.com/huggingface/datasets
!pip install transformers
!pip install datasets
!python transformers/examples/seq2seq/run_seq2seq.py \
    --model_name_or_path t5-small \
    --do_train \
    --do_eval \
    --task summarization \
    --dataset_name xsum \
    --output_dir /tmp/tst-summarization \
    --per_device_train_batch_size=4 \
    --per_device_eval_batch_size=4 \
    --overwrite_output_dir \
    --predict_with_generate \
    --max_train_samples 500 \
    --max_val_samples 500

我犯了这个错误

I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0
Traceback (most recent call last):
  File "transformers/examples/seq2seq/run_seq2seq.py", line 47, in <module>
    from transformers.file_utils import is_offline_mode
ImportError: cannot import name 'is_offline_mode' from 'transformers.file_utils' (/usr/local/lib/python3.7/dist-packages/transformers/file_utils.py)

有什么想法吗


Tags: pipnamepyhttpsgitgithubcomclone
1条回答
网友
1楼 · 发布于 2024-03-28 20:08:29

问题是您克隆了存储库的主分支,并尝试使用主分支后面的transformers版本(4.3.3)运行run_seq2seq.py脚本

run_seq2seq.py于3月6日更新为导入is_offline_mode,使用此merge

您只需克隆用于旧变压器版本的分支:

!git clone  branch v4.3.3-release https://github.com/huggingface/transformers

注:我认为您不需要克隆数据集库

相关问题 更多 >