Improving Neural Machine Translation with the Abstract Meaning Representation by Combining Graph and Sequence Transformers

Environment set up

Install Python3.6+

Run https://github.com/jlab-nlp/amr-nmt/blob/main/install_environment.sh

Training

Download and unzip data directory into the project directory. The data directory includes processed amr data for the experimented language in the paper.

Example on en to mg: https://github.com/jlab-nlp/amr-nmt/blob/main/train_mha_concat.sh This uses the main model of the paper. There are other variations. You can find in https://github.com/jlab-nlp/amr-nmt/blob/main/onmt/models/model.py.

Caveats: note that for tokenization, you need to train the correponding sentencepiece tokenizer, and change the name to "sentencepiece.bpe.model" and put it into the project directory. We already have several trained sentencepiece tokenizers for different languages. You can see it in the project main directory named sentencepiece.bpe.model.*. When you use them just to rembember to change it to "sentencepiece.bpe.model" in the project directory. Besides, for different size of the tokenizer, you need to debug to change the code in the https://github.com/jlab-nlp/amr-nmt/blob/main/reduce_embeding_size.py line 21-68 and line 80 - 87 into the corresponding vocabulary size and language training file.

Prediction

Example on en to mg: https://github.com/jlab-nlp/amr-nmt/blob/main/predict_mha_concat.sh. This uses the main model of the paper. There are other variations.

Evaluation

Here are the prediction outputs for the experimented languages: https://github.com/jlab-nlp/amr-nmt/blob/main/pred_outs.zip. Go to https://github.com/jlab-nlp/amr-nmt/tree/main/multeval-0.5.1, unzip the prediction outputs into it and here is the example on en to mg https://github.com/jlab-nlp/amr-nmt/blob/main/multeval-0.5.1/eval_mg_tokenized.sh

Citation

@inproceedings{li-flanigan-2022-improving,
    title = "Improving Neural Machine Translation with the {A}bstract {M}eaning {R}epresentation by Combining Graph and Sequence Transformers",
    author = "Li, Changmao  and
      Flanigan, Jeffrey",
    editor = "Wu, Lingfei  and
      Liu, Bang  and
      Mihalcea, Rada  and
      Pei, Jian  and
      Zhang, Yue  and
      Li, Yunyao",
    booktitle = "Proceedings of the 2nd Workshop on Deep Learning on Graphs for Natural Language Processing (DLG4NLP 2022)",
    month = jul,
    year = "2022",
    address = "Seattle, Washington",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.dlg4nlp-1.2",
    doi = "10.18653/v1/2022.dlg4nlp-1.2",
    pages = "12--21",
    abstract = "Previous studies have shown that the Abstract Meaning Representation (AMR) can improve Neural Machine Translation (NMT). However, there has been little work investigating incorporating AMR graphs into Transformer models. In this work, we propose a novel encoder-decoder architecture which augments the Transformer model with a Heterogeneous Graph Transformer (Yao et al., 2020) which encodes source sentence AMR graphs. Experimental results demonstrate the proposed model outperforms the Transformer model and previous non-Transformer based models on two different language pairs in both the high resource setting and low resource setting. Our source code, training corpus and released models are available at \url{https://github.com/jlab-nlp/amr-nmt}.",
}

For any questions put it into the github issue or contact me at changmao.li@ucsc.edu.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
multeval-0.5.1		multeval-0.5.1
onmt		onmt
preprocess		preprocess
tools		tools
transformers		transformers
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bleu_significance.py		bleu_significance.py
bpe_dege.py		bpe_dege.py
config.json		config.json
eval_concat.sh		eval_concat.sh
install_environment.sh		install_environment.sh
org_ids_to_new_ids.json		org_ids_to_new_ids.json
postprocess.py		postprocess.py
pred_outs.zip		pred_outs.zip
predict.sh		predict.sh
predict_amr_nmt.sh		predict_amr_nmt.sh
predict_concat.sh		predict_concat.sh
predict_concat_add.sh		predict_concat_add.sh
predict_concat_single_decoder.sh		predict_concat_single_decoder.sh
predict_mha_concat.sh		predict_mha_concat.sh
predict_mha_concat_single_decoder.sh		predict_mha_concat_single_decoder.sh
predict_vanilla.sh		predict_vanilla.sh
preprocess.bpe.sh		preprocess.bpe.sh
preprocess.py		preprocess.py
preprocess.sh		preprocess.sh
preprocess_nmt_amr.sh		preprocess_nmt_amr.sh
reduce_embeding_size.py		reduce_embeding_size.py
resume_bertgraph2.py		resume_bertgraph2.py
sentencepiece-2k.model.low		sentencepiece-2k.model.low
sentencepiece-8k.bpe.model.low		sentencepiece-8k.bpe.model.low
sentencepiece.bpe.model		sentencepiece.bpe.model
sentencepiece.bpe.model.mg.4k		sentencepiece.bpe.model.mg.4k
sentencepiece.bpe.model.old		sentencepiece.bpe.model.old
sentencepiece.bpe.model.small		sentencepiece.bpe.model.small
sentencepiece.bpe.model.vi.4k		sentencepiece.bpe.model.vi.4k
sentencepiece.bpe.vi.16k.model		sentencepiece.bpe.vi.16k.model
sentencepiece.bpe.vi.8k.model		sentencepiece.bpe.vi.8k.model
sentencepiece.vocab.small		sentencepiece.vocab.small
switch-cuda.sh		switch-cuda.sh
test_gcn.py		test_gcn.py
test_rouge.py		test_rouge.py
train-multigpu.sh		train-multigpu.sh
train-mutligpu.sh		train-mutligpu.sh
train-radam-multigpu.sh		train-radam-multigpu.sh
train-radam.sh		train-radam.sh
train.py		train.py
train.sh		train.sh
train_bilstm_gat.sh		train_bilstm_gat.sh
train_concat.sh		train_concat.sh
train_concat_single_decoder.sh		train_concat_single_decoder.sh
train_concat_type_encoding.sh		train_concat_type_encoding.sh
train_mha_add_concat.sh		train_mha_add_concat.sh
train_mha_concat.sh		train_mha_concat.sh
train_mha_concat_single_decoder.sh		train_mha_concat_single_decoder.sh
train_seperate.sh		train_seperate.sh
train_vanilla.sh		train_vanilla.sh
translate.py		translate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Improving Neural Machine Translation with the Abstract Meaning Representation by Combining Graph and Sequence Transformers

Environment set up

Training

Prediction

Evaluation

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

jlab-nlp/amr-nmt

Folders and files

Latest commit

History

Repository files navigation

Improving Neural Machine Translation with the Abstract Meaning Representation by Combining Graph and Sequence Transformers

Environment set up

Training

Prediction

Evaluation

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages