Multi-Lingual, Multi-Modal, Multi-Task

Training machine translation for multiple language pairs leads to more generalization in the models, and helps low-resource language pairs. Moreover, the input to machine translation may also be enriched by information from other modalities, such as images or speech. And finally, machine translation may just be one task of an integrated neural network that performs other language processing tasks.

Multilingual Multimodal Multitask is the main subject of 71 publications. 42 are discussed here.

Topics in NeuralNetworkModels

Publications

Multi-language training:

Zoph et al. (2016) first train on a resource language pair and then adapt the resulting model towards a targeted low resource language, and show gains over just training on the low resource language. Nguyen and Chiang (2017) show better results when merging the vocabularies of the different input languages. Ha et al. (2016) prefix each input work with a language identifier (e.g., @en@dog, @de@Hund) and add monolingual data, both as source and target. Ha et al. (2017) observe that translation in multi-language systems with multiple target languages may switch to the wrong language. They limit word predictions to words existing in the desired target language, and also add source side language-identifying word factors. Lakew et al. (2018) show that Transformer models perform better for multi-language pair training than previous models based on recurrent neural networks. Lakew et al. (2018) build one-to-many translation models for languages varieties, i.e., closely related dialects such Brazilian and European Portuguese or Croatian and Serbian. This requires language variety identification to separate out the training data. Lakew et al. (2018) start with a model trained on a high-resource language pair and then incrementally add low-resource language pairs, including new vocabulary items. They show much faster training convergence and slight quality gains over joint training. Neubig and Hu (2018) train a many-to-one model for 58 language pairs and fine-tune it towards each of them. Aharoni et al. (2019) scale up multi-language training to up to 103 languages, training on language pairs with English on either side, measuring average translation performance from English and into English. They show that many-to-many systems improve over many-to-one system when translating into English but not over one-to-many systems when translating from English. They also see degradation when combining more than 5 languages. Murthy et al. (2019) identify a problem when a targeted language pair in the multi-language setup is low resource and has different word order from the other language pair. They propose to pre-order the input to match the word order of the dominant language.

Zero-Shot:

Johnson et al. (2017) explore how well a single canonical neural translation model is able to learn from multiple to multiple languages, by simultaneously training on on parallel corpora for several language pairs. They show small benefits for several input languages with the same output languages, mixed results for translating into multiple output languages (indicated by an additional input language token). The most interesting result is the ability for such a model to translate in language directions for which no parallel corpus is provided ("zero-shot"), thus demonstrating that some interlingual meaning representation is learned, although less well than using traditional pivot methods. Mattoni et al. (2017) explore zero-shot training for Indian languages with sparse training data, achieving limited success. Al-Shedivat and Parikh (2019) extend the training objective of zero-shot training in the scenario of English-X parallel corpora so that given an English-French sentence pair the translations French-Russian and English-Russian are consistent.

Multi-Language Training with Language-Specific Components:

There have been a few suggestions to alter the model for multi-language pair training. Dong et al. (2015) use different decoders for each target language. Firat et al. (2016) support multi-language input and output by training language-specific encoders and decoders and a shared attention mechanism. Firat et al. (2016) evaluate how well this model works for zero-shot translation. Lu et al. (2018) add an additional interlingua layer between specialized encoders and decoders that is shared across all language pairs. Conversely, Blackwood et al. (2018) use shared encoders and decoders but language-pair specific attention. Sachan and Neubig (2018) investigate which parameters in a Transformer model should be shared during one-to-many training and find that partial sharing of components outperforms no sharing or full sharing, although the best configuration depends on the languages involved. Wang et al. (2018) add language-dependent positional embeddings and split the decoder state into a general and language-dependent part. Platanios et al. (2018) generate the language-pair specific parameters for the encoder and decoder with a parameter generator that takes embeddings of input and output language identifiers as input.

Gu et al. (2018) frame the multi-language training setup as meta learning, which they define as either learning a policy for updating model parameters or learning a good parameter initialization method for fast adaptation. Their approach falls under that second definition and is similar to multi-language training with adaptation via fine-tuning, except for optimization during the first phase towards parameters that can be quickly adapted.

Gu et al. (2018) focus on the problem of word representation in multi-lingual training. They map the tokens of every language into a universal embedding space, aided monolingual data. Wang et al. (2019) have the same goal in mind and use language-specific and language-independent character-based word representations to map to a shared word embedding space. This is done for input words in a 58 language to English translation model.

Tan et al. (2019) change the training objective for multi-language training. In addition to matching the training data for the language pairs, an additional training objective is to match the prediction of a "teacher" model that was trained on the corresponding single-language pair data.

Malaviya et al. (2017) use the embedding associated with the language indicator token in massively multi-language models to predict typological properties of a language.

Ren et al. (2018) address the challenge of pivot translation (train a X-Z model by using a third language Y with large corpora X-Y and X-Z) in a neural model approach by setting up training objectives that match translation through the pivot path and the direct translation, and also other paths in this language triangle.

Multiple Inputs:

Zoph and Knight (2016) augment a translation model to consume two meaning-equivalent sentences in different languages as input. Zhou et al. (2017) apply this idea to the task of system combination, i.e., obtaining a consensus translation from multiple machine translation outputs. Garmash and Monz (2016) train multiple single-language systems, feed each the corresponding meaning-equivalent input sentence and combine these predictions of the models in an ensemble approach during decoding. Nishimura et al. (2018) explore how a multi-source model works when input for some languages is missing. In their experiments, the multi-encoder approach works more often better than the ensemble. Nishimura et al. (2018) fill in the missing sentences in the training data with (multi-source) back-translation. Dabre et al. (2017) concatenate the input sentences, and also use training data in the same format (which requires intersecting overlapping parallel corpora).

Pre-trained word embeddings:

Di Gangi and Federico (2017) do not observe improvement when using monolingual word embeddings in a gated network that trains additional word embeddings purely on parallel data. Abdou et al. (2017) showed worse performance on a WMT news translation task with pre-trained word embeddings. They argue, as Hill et al. (2014); Hill et al. (2017) did previously, that neural machine translation requires word embeddings that are based on semantic similarity of words (teacher and professor) rather than other kinds of relatedness (teacher and student), and demonstrate that word embeddings trained for translation score better on standard semantic similarity tasks. Artetxe et al. (2018) use monolingually trained word embeddings in a neural machine translation system, without using any parallel corpus. Qi et al. (2018) do show gains with pre-trained word embeddings in low resource conditions, but that benefits decrease with larger data sizes.

Multi-task training:

Niehues and Cho (2017) tackle multiple tasks (translation, part-of-speech tagging, and named entity identification) with shared components of a sequence to sequence model, showing that training on several tasks improves performance on each individual task. Zaremoodi and Haffari (2018) refine this approach with adversarial training that enforces task-independent representation in intermediate layers, and apply to to joint training with syntactic and semantic parsing. Li et al. (2019) add as auxiliary tasks the prediction of hierarchical word classes obtained by hierarchical Brown clustering. In the first layer of the decoder of a transformer model, the coarsest word classes are predicted, and in later layers more fine-grained word classes are predicted. The authors argue that this increases generalization ability of intermediate representations and show improvements in translation quality.

Benchmarks

Discussion

New Publications

Pham, Ngoc-Quan and Niehues, Jan and Ha, Thanh-Le and Waibel, Alex (2019): Improving Zero-shot Translation with Language-Independent Constraints, Proceedings of the Fourth Conference on Machine Translation
add
@InProceedings{pham-EtAl:2019:WMT,
author = {Pham, Ngoc-Quan and Niehues, Jan and Ha, Thanh-Le and Waibel, Alex},
title = {Improving Zero-shot Translation with Language-Independent Constraints},
booktitle = {Proceedings of the Fourth Conference on Machine Translation},
month = {August},
address = {Florence, Italy},
publisher = {Association for Computational Linguistics},
pages = {13--23},
url = {http://www.aclweb.org/anthology/W19-5202},
year = 2019
}
Pham et al. (2019)
Calixto, Iacer and Rios, Miguel and Aziz, Wilker (2019): Latent Variable Model for Multi-modal Translation, Proceedings of the 57th Conference of the Association for Computational Linguistics
add
@inproceedings{calixto-etal-2019-latent,
author = {Calixto, Iacer and Rios, Miguel and Aziz, Wilker},
title = {Latent Variable Model for Multi-modal Translation},
booktitle = {Proceedings of the 57th Conference of the Association for Computational Linguistics},
month = {jul},
address = {Florence, Italy},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/P19-1642},
pages = {6392--6405},
year = 2019
}
Calixto et al. (2019)
Chen, Xilun and Awadallah, Ahmed Hassan and Hassan, Hany and Wang, Wei and Cardie, Claire (2019): Multi-Source Cross-Lingual Model Transfer: Learning What to Share, Proceedings of the 57th Conference of the Association for Computational Linguistics
add
@inproceedings{chen-etal-2019-multi-source,
author = {Chen, Xilun and Awadallah, Ahmed Hassan and Hassan, Hany and Wang, Wei and Cardie, Claire},
title = {Multi-Source Cross-Lingual Model Transfer: Learning What to Share},
booktitle = {Proceedings of the 57th Conference of the Association for Computational Linguistics},
month = {jul},
address = {Florence, Italy},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/P19-1299},
pages = {3098--3112},
year = 2019
}
Chen et al. (2019)
Ive, Julia and Madhyastha, Pranava and Specia, Lucia (2019): Distilling Translations with Visual Awareness, Proceedings of the 57th Conference of the Association for Computational Linguistics
add
@inproceedings{ive-etal-2019-distilling,
author = {Ive, Julia and Madhyastha, Pranava and Specia, Lucia},
title = {Distilling Translations with Visual Awareness},
booktitle = {Proceedings of the 57th Conference of the Association for Computational Linguistics},
month = {jul},
address = {Florence, Italy},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/P19-1653},
pages = {6525--6538},
year = 2019
}
Ive et al. (2019)
Kim, Yunsu and Gao, Yingbo and Ney, Hermann (2019): Effective Cross-lingual Transfer of Neural Machine Translation Models without Shared Vocabularies, Proceedings of the 57th Conference of the Association for Computational Linguistics
add
@inproceedings{kim-etal-2019-effective,
author = {Kim, Yunsu and Gao, Yingbo and Ney, Hermann},
title = {Effective Cross-lingual Transfer of Neural Machine Translation Models without Shared Vocabularies},
booktitle = {Proceedings of the 57th Conference of the Association for Computational Linguistics},
month = {jul},
address = {Florence, Italy},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/P19-1120},
pages = {1246--1257},
year = 2019
}
Kim et al. (2019)
Leng, Yichong and Tan, Xu and Qin, Tao and Li, Xiang-Yang and Liu, Tie-Yan (2019): Unsupervised Pivot Translation for Distant Languages, Proceedings of the 57th Conference of the Association for Computational Linguistics
add
@inproceedings{leng-etal-2019-unsupervised,
author = {Leng, Yichong and Tan, Xu and Qin, Tao and Li, Xiang-Yang and Liu, Tie-Yan},
title = {Unsupervised Pivot Translation for Distant Languages},
booktitle = {Proceedings of the 57th Conference of the Association for Computational Linguistics},
month = {jul},
address = {Florence, Italy},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/P19-1017},
pages = {175--183},
year = 2019
}
Leng et al. (2019)
Liu, Hairong and Ma, Mingbo and Huang, Liang and Xiong, Hao and He, Zhongjun (2019): Robust Neural Machine Translation with Joint Textual and Phonetic Embedding, Proceedings of the 57th Conference of the Association for Computational Linguistics
add
@inproceedings{liu-etal-2019-robust,
author = {Liu, Hairong and Ma, Mingbo and Huang, Liang and Xiong, Hao and He, Zhongjun},
title = {Robust Neural Machine Translation with Joint Textual and Phonetic Embedding},
booktitle = {Proceedings of the 57th Conference of the Association for Computational Linguistics},
month = {jul},
address = {Florence, Italy},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/P19-1291},
pages = {3044--3049},
year = 2019
}
Liu et al. (2019)
Sen, Sukanta and Gupta, Kamal Kumar and Ekbal, Asif and Bhattacharyya, Pushpak (2019): Multilingual Unsupervised NMT using Shared Encoder and Language-Specific Decoders, Proceedings of the 57th Conference of the Association for Computational Linguistics
add
@inproceedings{sen-etal-2019-multilingual,
author = {Sen, Sukanta and Gupta, Kamal Kumar and Ekbal, Asif and Bhattacharyya, Pushpak},
title = {Multilingual Unsupervised {NMT} using Shared Encoder and Language-Specific Decoders},
booktitle = {Proceedings of the 57th Conference of the Association for Computational Linguistics},
month = {jul},
address = {Florence, Italy},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/P19-1297},
pages = {3083--3089},
year = 2019
}
Sen et al. (2019)
Wang, Yining and Zhou, Long and Zhang, Jiajun and Zhai, Feifei and Xu, Jingfang and Zong, Chengqing (2019): A Compact and Language-Sensitive Multilingual Translation Method, Proceedings of the 57th Conference of the Association for Computational Linguistics
add
@inproceedings{wang-etal-2019-compact,
author = {Wang, Yining and Zhou, Long and Zhang, Jiajun and Zhai, Feifei and Xu, Jingfang and Zong, Chengqing},
title = {A Compact and Language-Sensitive Multilingual Translation Method},
booktitle = {Proceedings of the 57th Conference of the Association for Computational Linguistics},
month = {jul},
address = {Florence, Italy},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/P19-1117},
pages = {1213--1223},
year = 2019
}
Wang et al. (2019)
Wang, Xinyi and Neubig, Graham (2019): Target Conditioned Sampling: Optimizing Data Selection for Multilingual Neural Machine Translation, Proceedings of the 57th Conference of the Association for Computational Linguistics
add
@inproceedings{wang-neubig-2019-target,
author = {Wang, Xinyi and Neubig, Graham},
title = {Target Conditioned Sampling: Optimizing Data Selection for Multilingual Neural Machine Translation},
booktitle = {Proceedings of the 57th Conference of the Association for Computational Linguistics},
month = {jul},
address = {Florence, Italy},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/P19-1583},
pages = {5823--5828},
year = 2019
}
Wang and Neubig (2019)
Domhan, Tobias and Hieber, Felix (2017): Using Target-side Monolingual Data for Neural Machine Translation through Multi-task Learning, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
add
@InProceedings{D17-1158,
author = {Domhan, Tobias and Hieber, Felix},
title = {Using Target-side Monolingual Data for Neural Machine Translation through Multi-task Learning},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
publisher = {Association for Computational Linguistics},
pages = {1501--1506},
location = {Copenhagen, Denmark},
url = {http://aclweb.org/anthology/D17-1158},
year = 2017
}
Domhan and Hieber (2017)

Multi-Lingual

Surafel Melaku Lakew and Quintino Francesco Lotito and Matteo Negri and Marco Turchi and Marcello Federico (2017): Improving Zero-Shot Translation of Low-Resource Languages, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
add
@inproceedings{IWSLT2017:Lakew,
author = {Surafel Melaku Lakew and Quintino Francesco Lotito and Matteo Negri and Marco Turchi and Marcello Federico},
title = {Improving Zero-Shot Translation of Low-Resource Languages},
url = {http://workshop2017.iwslt.org/downloads/O03-3-Paper.pdf},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
location = {Tokyo, Japan},
year = 2017
}
Lakew et al. (2017)
Zhou, Zhong and Sperber, Matthias and Waibel, Alex (2018): Massively Parallel Cross-Lingual Learning in Low-Resource Target Language Translation, Proceedings of the Third Conference on Machine Translation: Research Papers
add
@inproceedings{W18-6324,
author = {Zhou, Zhong and Sperber, Matthias and Waibel, Alex},
title = {Massively Parallel Cross-Lingual Learning in Low-Resource Target Language Translation},
booktitle = {Proceedings of the Third Conference on Machine Translation: Research Papers},
month = {oct},
address = {Belgium, Brussels},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/W18-6324},
pages = {232--243},
year = 2018
}
Zhou et al. (2018)
Gu, Jiatao and Wang, Yong and Cho, Kyunghyun and Li, Victor O.K. (2019): Improved Zero-shot Neural Machine Translation via Ignoring Spurious Correlations, Proceedings of the 57th Conference of the Association for Computational Linguistics
add
@inproceedings{gu-etal-2019-improved,
author = {Gu, Jiatao and Wang, Yong and Cho, Kyunghyun and Li, Victor O.K.},
title = {Improved Zero-shot Neural Machine Translation via Ignoring Spurious Correlations},
booktitle = {Proceedings of the 57th Conference of the Association for Computational Linguistics},
month = {jul},
address = {Florence, Italy},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/P19-1121},
pages = {1258--1268},
year = 2019
}
Gu et al. (2019)

Multi-Source, Multi-Target

Libovick\'y, Jindřich and Helcl, Jindřich and Mareček, David (2018): Input Combination Strategies for Multi-Source Transformer Decoder, Proceedings of the Third Conference on Machine Translation: Research Papers
add
@inproceedings{W18-6326,
author = {Libovick{\'y}, Jind{\v{r}}ich and Helcl, Jind{\v{r}}ich and Mare{\v{c}}ek, David},
title = {Input Combination Strategies for Multi-Source Transformer Decoder},
booktitle = {Proceedings of the Third Conference on Machine Translation: Research Papers},
month = {oct},
address = {Belgium, Brussels},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/W18-6326},
pages = {253--260},
year = 2018
}
Libovick\'y et al. (2018)

Multi-modal (speech, vision)

Caglayan, Ozan and Aransa, Walid and Wang, Yaxing and Masana, Marc and García-Martínez, Mercedes and Bougares, Fethi and Barrault, Loïc and van de Weijer, Joost (2016): Does Multimodality Help Human and Machine for Translation and Image Captioning?, Proceedings of the First Conference on Machine Translation
add
@InProceedings{caglayan-EtAl:2016:WMT,
author = {Caglayan, Ozan and Aransa, Walid and Wang, Yaxing and Masana, Marc and Garc\'{i}a-Mart\'{i}nez, Mercedes and Bougares, Fethi and Barrault, Lo\"{i}c and van de Weijer, Joost},
title = {Does Multimodality Help Human and Machine for Translation and Image Captioning?},
booktitle = {Proceedings of the First Conference on Machine Translation},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {627--633},
url = {http://www.aclweb.org/anthology/W/W16/W16-2358},
year = 2016
}
Caglayan et al. (2016)
Lala, Chiraag and Madhyastha, Pranava and Specia, Lucia (2019): Grounded Word Sense Translation, Proceedings of the Second Workshop on Shortcomings in Vision and Language
add
@inproceedings{lala-etal-2019-grounded,
author = {Lala, Chiraag and Madhyastha, Pranava and Specia, Lucia},
title = {Grounded Word Sense Translation},
booktitle = {Proceedings of the Second Workshop on Shortcomings in Vision and Language},
month = {jun},
address = {Minneapolis, Minnesota},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/W19-1808},
pages = {78--85},
year = 2019
}
Lala et al. (2019)
Singhal, Karan and Raman, Karthik and ten Cate, Balder (2019): Learning Multilingual Word Embeddings Using Image-Text Data, Proceedings of the Second Workshop on Shortcomings in Vision and Language
add
@inproceedings{singhal-etal-2019-learning,
author = {Singhal, Karan and Raman, Karthik and ten Cate, Balder},
title = {Learning Multilingual Word Embeddings Using Image-Text Data},
booktitle = {Proceedings of the Second Workshop on Shortcomings in Vision and Language},
month = {jun},
address = {Minneapolis, Minnesota},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/W19-1807},
pages = {68--77},
year = 2019
}
Singhal et al. (2019)
Dutta Chowdhury, Koel and Hasanuzzaman, Mohammed and Liu, Qun (2018): Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data, Proceedings of the Workshop on Deep Learning Approaches for Low-Resource NLP
add
@inproceedings{W18-3405,
author = {Dutta Chowdhury, Koel and Hasanuzzaman, Mohammed and Liu, Qun},
title = {Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data},
booktitle = {Proceedings of the Workshop on Deep Learning Approaches for Low-Resource NLP},
month = {jul},
address = {Melbourne},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/W18-3405},
pages = {33--42},
year = 2018
}
Chowdhury et al. (2018)
Shigehiko Schamoni and Julian Hitschler and Stefan Riezler (2018): A Dataset and Reranking Method for Multimodal MT of User-Generated Image Captions, Annual Meeting of the Association for Machine Translation in the Americas (AMTA)
add
@inproceedings{AMTA2018-Schamoni,
author = {Shigehiko Schamoni and Julian Hitschler and Stefan Riezler},
title = {A Dataset and Reranking Method for Multimodal {MT} of User-Generated Image Captions},
booktitle = {Annual Meeting of the Association for Machine Translation in the Americas (AMTA)},
location = {Boston, USA},
year = 2018
}
Schamoni et al. (2018)
Shah, Kashif and Wang, Josiah and Specia, Lucia (2016): SHEF-Multimodal: Grounding Machine Translation on Images, Proceedings of the First Conference on Machine Translation
add
@InProceedings{shah-wang-specia:2016:WMT,
author = {Shah, Kashif and Wang, Josiah and Specia, Lucia},
title = {SHEF-Multimodal: Grounding Machine Translation on Images},
booktitle = {Proceedings of the First Conference on Machine Translation},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {660--665},
url = {http://www.aclweb.org/anthology/W/W16/W16-2363},
year = 2016
}
Shah et al. (2016)
Elliott, Desmond and Kádár, \'Akos (2017): Imagination Improves Multimodal Translation, Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
add
@inproceedings{elliott-kadar-2017-imagination,
author = {Elliott, Desmond and K{\'a}d{\'a}r, {\'A}kos},
title = {Imagination Improves Multimodal Translation},
booktitle = {Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers)},
month = {nov},
address = {Taipei, Taiwan},
publisher = {Asian Federation of Natural Language Processing},
url = {https://www.aclweb.org/anthology/I17-1014},
pages = {130--141},
year = 2017
}
Elliott and Kádár (2017)
Delbrouck, Jean-Benoit and Dupont, Stéphane (2017): An empirical study on the effectiveness of images in Multimodal Neural Machine Translation, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
add
@InProceedings{D17-1096,
author = {Delbrouck, Jean-Benoit and Dupont, St{\'e}phane},
title = {An empirical study on the effectiveness of images in Multimodal Neural Machine Translation},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
publisher = {Association for Computational Linguistics},
pages = {921--930},
location = {Copenhagen, Denmark},
url = {http://aclweb.org/anthology/D17-1096},
year = 2017
}
Delbrouck and Dupont (2017)
Calixto, Iacer and Liu, Qun (2017): Incorporating Global Visual Features into Attention-based Neural Machine Translation., Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
add
@InProceedings{D17-1106,
author = {Calixto, Iacer and Liu, Qun},
title = {Incorporating Global Visual Features into Attention-based Neural Machine Translation.},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
publisher = {Association for Computational Linguistics},
pages = {1003--1014},
location = {Copenhagen, Denmark},
url = {http://aclweb.org/anthology/D17-1106},
year = 2017
}
Calixto and Liu (2017)
Hitschler, Julian and Schamoni, Shigehiko and Riezler, Stefan (2016): Multimodal Pivots for Image Caption Translation, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
add
@InProceedings{hitschler-schamoni-riezler:2016:P16-1,
author = {Hitschler, Julian and Schamoni, Shigehiko and Riezler, Stefan},
title = {Multimodal Pivots for Image Caption Translation},
booktitle = {Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {2399--2409},
url = {http://www.aclweb.org/anthology/P16-1227},
year = 2016
}
Hitschler et al. (2016)
Calixto, Iacer and Liu, Qun and Campbell, Nick (2017): Doubly-Attentive Decoder for Multi-modal Neural Machine Translation, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
add
@InProceedings{calixto-liu-campbell:2017:Long,
author = {Calixto, Iacer and Liu, Qun and Campbell, Nick},
title = {Doubly-Attentive Decoder for Multi-modal Neural Machine Translation},
booktitle = {Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {July},
address = {Vancouver, Canada},
publisher = {Association for Computational Linguistics},
pages = {1913--1924},
url = {http://aclweb.org/anthology/P17-1175},
year = 2017
}
Calixto et al. (2017)
Hewitt, John and Ippolito, Daphne and Callahan, Brendan and Kriz, Reno and Wijaya, Derry Tanti and Callison-Burch, Chris (2018): Learning Translations via Images with a Massively Multilingual Image Dataset, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
add
@InProceedings{P18-1239,
author = {Hewitt, John and Ippolito, Daphne and Callahan, Brendan and Kriz, Reno and Wijaya, Derry Tanti and Callison-Burch, Chris},
title = {Learning Translations via Images with a Massively Multilingual Image Dataset},
booktitle = {Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
publisher = {Association for Computational Linguistics},
pages = {2566--2576},
location = {Melbourne, Australia},
url = {http://aclweb.org/anthology/P18-1239},
year = 2018
}
Hewitt et al. (2018)
Zhou, Mingyang and Cheng, Runxiang and Lee, Yong Jae and Yu, Zhou (2018): A Visual Attention Grounding Neural Model for Multimodal Machine Translation, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
add
@inproceedings{D18-1400,
author = {Zhou, Mingyang and Cheng, Runxiang and Lee, Yong Jae and Yu, Zhou},
title = {A Visual Attention Grounding Neural Model for Multimodal Machine Translation},
booktitle = {Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing},
address = {Brussels, Belgium},
publisher = {Association for Computational Linguistics},
url = {https://www.aclweb.org/anthology/D18-1400},
pages = {3643--3653},
year = 2018
}
Zhou et al. (2018)

Multi-Task

Kiperwasser, Eliyahu and Ballesteros, Miguel (2018): Scheduled Multi-Task Learning: From Syntax to Translation, Transactions of the Association for Computational Linguistics
add
@article{Q18-1017,
author = {Kiperwasser, Eliyahu and Ballesteros, Miguel},
title = {Scheduled Multi-Task Learning: From Syntax to Translation},
journal = {Transactions of the Association for Computational Linguistics},
volume = {6},
url = {https://www.aclweb.org/anthology/Q18-1017},
pages = {225--240},
year = 2018
}
Kiperwasser and Ballesteros (2018)

MT Research Survey Wiki

A Comprehensive Survey of Neural and Statistical Machine Translation Research Publications

Search Descriptions