（OpenNMT）西班牙语到英语的模型改进

## Where the samples will be written save_data: en-sp/run/example ## Where the vocab(s) will be written src_vocab: en-sp/run/example.vocab.src tgt_vocab: en-sp/run/example.vocab.tgt ## Where the model will be saved save_model: drive/MyDrive/ESEN/model3_bpe_adam_001_layer2/model # Prevent overwriting existing files in the folder overwrite: False # Corpus opts: data: taus_corona: path_src: data/spanish_train path_tgt: data/english_train transforms: [sentencepiece, filtertoolong] weight: 1 valid: path_src: data/spanish_valid path_tgt: data/english_valid transforms: [sentencepiece] skip_empty_level: silent src_subword_model: data/esen.model tgt_subword_model: data/esen.model # General opts report_every: 100 train_steps: 5000 valid_steps: 1000 save_checkpoint_steps: 1000 world_size: 1 gpu_ranks: [0] # Optimizer optim: adam learning_rate: 0.001 # Model encoder_type: rnn decoder_type: rnn layers: 2 rnn_type: LSTM bidir_edges: True # Logging tensorboard: true tensorboard_log_dir: logs log_file: logs/log-file.txt verbose: True attn_debug: True align_debug: True global_attention: general global_attention_function: softmax

Step 1000/ 5000; acc: 27.94; ppl: 71.88; xent: 4.27; lr: 0.00100; 13103/12039 tok/s; 157 sec Validation perplexity: 136.446 Validation accuracy: 24.234 ... Step 4000/ 5000; acc: 61.25; ppl: 5.28; xent: 1.66; lr: 0.00100; 13584/12214 tok/s; 641 sec Validation accuracy: 22.1157 ...

1条回答

网友

1楼 · 发布于 2024-06-06 17:16:24

my validation accuracy goes down the more I train while my training accuracy goes up.

这听起来像是过度装修

10万句话并不多。所以你所看到的是预期的。当验证集上的结果停止改善时，您可以停止培训

同样的基本动力也可以在更大的范围内发生，只需要更长的时间

如果您的目标是培养自己的相当好的模型，我会看到以下几种选择：

将大小增加到1M左右
从预先训练好的模型开始，进行微调
两者

对于1来说，至少有100万行英语：西班牙语，即使过滤掉最嘈杂的语言，你也可以从ModelFront获得

对于2，我知道埃里温的团队在WMT20上取得了胜利，从Fairseq模型开始，使用了大约300K的翻译。他们能够用相当有限的硬件做到这一点

相关问题更多 >

编程相关推荐

热门问题

热门文章