跳转到主要内容

用于开发Task Bots的包

项目描述

Mask-Predict

下载模型

描述 数据集 模型
MASK-PREDICT [WMT14 English-German] 下载 (.tar.bz2)
MASK-PREDICT [WMT14 German-English] 下载 (.tar.bz2)
MASK-PREDICT [WMT16 English-Romanian] 下载 (.tar.bz2)
MASK-PREDICT [WMT16 Romanian-English] 下载 (.tar.bz2)
MASK-PREDICT [WMT17 English-Chinese] 下载 (.tar.bz2)
MASK-PREDICT [WMT17 Chinese-English] 下载 (.tar.bz2)

预处理

text=PATH_YOUR_DATA

output_dir=PATH_YOUR_OUTPUT

src=source_language

tgt=target_language

model_path=PATH_TO_MASKPREDICT_MODEL_DIR

python preprocess.py --source-lang srctargetlang{tgt} --trainpref text/trainvalidpreftext/valid --testpref text/testdestdir{output_dir}/data-bin --workers 60 --srcdict Missing superscript or subscript argument{src}tgt/dict.{src}.txt --tgtdict ${model_path}/maskPredictMissing superscript or subscript argument{tgt}/dict.${tgt}.txt

训练

model_dir=PLACE_TO_SAVE_YOUR_MODEL

python train.py outputdir/databinarchberttransformerseq2seqshareallembeddingscriterionlabelsmoothedlengthcrossentropylabelsmoothing0.1lr5e4warmupinitlr1e7minlr1e9lrschedulerinversesqrtwarmupupdates10000optimizeradamadambetas(0.9,0.999)adameps1e6tasktranslationselfmaxtokens8192weightdecay0.01dropout0.3encoderlayers6encoderembeddim512decoderlayers6decoderembeddim512fp16maxsourcepositions10000maxtargetpositions10000maxupdate300000seed0savedir{model_dir}

评估

python generate_cmlm.py outputdir/databinpath{model_dir}/checkpoint_best_average.pt --task translation_self --remove-bpe --max-sentences 20 --decoding-iterations 10 --decoding-strategy mask_predict

许可

MASK-PREDICT遵循CC-BY-NC 4.0许可。此许可证也适用于预训练模型。

引用

请引用如下

@inproceedings{ghazvininejad2019MaskPredict,
  title = {Mask-Predict: Parallel Decoding of Conditional Masked Language Models},
  author = {Marjan Ghazvininejad, Omer Levy, Yinhan Liu, Luke Zettlemoyer},
  booktitle = {Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing},
  year = {2019},
}

支持者