Neural Network Models
Neural network models have received little attention until a recent explosion of research in the 2010s, caused by their success in vision and speech recognition. Such models allow for clustering of related words and flexible use of context.
Neural Network Models and its 15 sub-topics are the main subject of 801 publications.
Publications
Basic models to use neural networks for machine translation were already proposed in the 20th century
Alex Waibel and A. N. Jain and A. E. McNair and H. Saito and A.G. Hauptmann and J. Tebelskis (1991):
JANUS: A Speech-to-Speech Translation System using Connectionist and Symbolic Processing Strategies, Proceedings of the 1991 International Conference on Acoustics, Speech and Signal Processing (ICASSP)

@inproceedings{janus,
author = {Alex Waibel and A. N. Jain and A. E. McNair and H. Saito and A.G. Hauptmann and J. Tebelskis},
title = {JANUS: A Speech-to-Speech Translation System using Connectionist and Symbolic Processing Strategies},
booktitle = {Proceedings of the 1991 International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
pages = {793-796},
location = {Toronto, Canada},
year = 1991
}
(Waibel et al., 1991), but not seriously pursued due to lack of computational resources. In fact, quite similar models as the ones currently in use date back to that era
Forcada, Mikel L and Ñeco, Ramón P (1997):
Recursive hetero-associative memories for translation, Biological and Artificial Computation: From Neuroscience to Technology
mentioned in Neural Network Models and Computer Aided Translation@incollection{forcada1997recursive,
author = {Forcada, Mikel L and {\~N}eco, Ram{\'o}n P},
title = {Recursive hetero-associative memories for translation},
booktitle = {Biological and Artificial Computation: From Neuroscience to Technology},
pages = {453--462},
publisher = {Springer},
year = 1997
}
(Forcada and Ñeco, 1997;
M. Asunción Castaño and Francisco Casacuberta and Enrique Vidal (1997):
Machine Translation using Neural Networks and Finite-State Models, "Theoretical and Methodological Issues in Machine Translation"

@inproceedings{castano-tmi-1997,
author = {M. Asunci{\'o}n Casta{\~n}o and Francisco Casacuberta and Enrique Vidal},
title = {Machine Translation using Neural Networks and Finite-State Models},
booktitle = {"Theoretical and Methodological Issues in Machine Translation"},
url = {
http://www.mt-archive.info/TMI-1997-Castano.pdf},
pages = {160-167},
year = 1997
}
Castaño et al., 1997).
Schwenk, Holger and Dechelotte, Daniel and Gauvain, Jean-Luc (2006):
Continuous Space Language Models for Statistical Machine Translation, Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions
mentioned in Neural Network Models and Neural Language Models@InProceedings{schwenk-dechelotte-gauvain:2006:POS,
author = {Schwenk, Holger and Dechelotte, Daniel and Gauvain, Jean-Luc},
title = {Continuous Space Language Models for Statistical Machine Translation},
booktitle = {Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions},
month = {July},
address = {Sydney, Australia},
publisher = {Association for Computational Linguistics},
pages = {723--730},
url = {
http://www.aclweb.org/anthology/P/P06/P06-2093},
year = 2006
}
Schwenk et al. (2006) introduce neural language models to machine translation (also called "continuous space language models"), and use them in re-ranking, similar to the earlier work in speech recognition.
The first competitive fully neural machine translation system participated in the WMT evaluation campaign in 2015
Jean, Sébastien and Firat, Orhan and Cho, Kyunghyun and Memisevic, Roland and Bengio, Yoshua (2015):
Montreal Neural Machine Translation Systems for WMTâ""15, Proceedings of the Tenth Workshop on Statistical Machine Translation

@InProceedings{jean-EtAl:2015:WMT,
author = {Jean, S\'{e}bastien and Firat, Orhan and Cho, Kyunghyun and Memisevic, Roland and Bengio, Yoshua},
title = {Montreal Neural Machine Translation Systems for WMTâ""15},
booktitle = {Proceedings of the Tenth Workshop on Statistical Machine Translation},
month = {September},
address = {Lisbon, Portugal},
publisher = {Association for Computational Linguistics},
pages = {134--140},
url = {
http://aclweb.org/anthology/W15-3014},
year = 2015
}
(Jean et al., 2015), reaching state-of-the-art performance at IWLST 2015
Minh-Thang Luong and Christopher Manning (2015):
Stanford neural machine translation systems for spoken language domains, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
mentioned in Neural Network Models and Adaptation@inproceedings{IWSLT-2015-Luong,
author = {Minh-Thang Luong and Christopher Manning},
title = {Stanford neural machine translation systems for spoken language domains},
pages = {76-79},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
location = {Da Nang, Vietnam},
url = {
http://www.mt-archive.info/15/IWSLT-2015-luong.pdf},
month = {December},
year = 2015
}
(Luong and Manning, 2015) and WMT 2016
Sennrich, Rico and Haddow, Barry and Birch, Alexandra (2016):
Edinburgh Neural Machine Translation Systems for WMT 16, Proceedings of the First Conference on Machine Translation
mentioned in Neural Network Models and Inference@InProceedings{sennrich-haddow-birch:2016:WMT,
author = {Sennrich, Rico and Haddow, Barry and Birch, Alexandra},
title = {Edinburgh Neural Machine Translation Systems for WMT 16},
booktitle = {Proceedings of the First Conference on Machine Translation},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {371--376},
url = {
http://www.aclweb.org/anthology/W/W16/W16-2323},
year = 2016
}
(Sennrich et al., 2016), The same year, Systran
Josep Maria Crego and Jungi Kim and Guillaume Klein and Anabel Rebollo and Kathy Yang and Jean Senellart and Egor Akhanov and Patrice Brunelle and Aurelien Coquard and Yongchao Deng and Satoshi Enoue and Chiyo Geiss and Joshua Johanson and Ardas Khalsa and Raoum Khiari and Byeongil Ko and Catherine Kobus and Jean Lorieux and Leidiana Martins and Dang-Chuan Nguyen and Alexandra Priori and Thomas Riccardi and Natalia Segal and Christophe Servan and Cyril Tiquet and Bo Wang and Jin Yang and Dakun Zhang and Jing Zhou and Peter Zoldan (2016):
SYSTRAN's Pure Neural Machine Translation Systems, CoRR

@article{DBLP:journals/corr/CregoKKRYSABCDE16,
author = {Josep Maria Crego and Jungi Kim and Guillaume Klein and Anabel Rebollo and Kathy Yang and Jean Senellart and Egor Akhanov and Patrice Brunelle and Aurelien Coquard and Yongchao Deng and Satoshi Enoue and Chiyo Geiss and Joshua Johanson and Ardas Khalsa and Raoum Khiari and Byeongil Ko and Catherine Kobus and Jean Lorieux and Leidiana Martins and Dang{-}Chuan Nguyen and Alexandra Priori and Thomas Riccardi and Natalia Segal and Christophe Servan and Cyril Tiquet and Bo Wang and Jin Yang and Dakun Zhang and Jing Zhou and Peter Zoldan},
title = {SYSTRAN's Pure Neural Machine Translation Systems},
journal = {CoRR},
volume = {abs/1610.05540},
url = {
http://arxiv.org/abs/1610.05540},
timestamp = {Wed, 02 Nov 2016 09:51:26 +0100},
biburl = {
http://dblp.uni-trier.de/rec/bib/journals/corr/CregoKKRYSABCDE16},
bibsource = {dblp computer science bibliography,
http://dblp.org},
year = 2016
}
(Crego et al., 2016), Google
Yonghui Wu and Mike Schuster and Zhifeng Chen and Quoc V. Le and Mohammad Norouzi and Wolfgang Macherey and Maxim Krikun and Yuan Cao and Qin Gao and Klaus Macherey and Jeff Klingner and Apurva Shah and Melvin Johnson and Xiaobing Liu and Lukasz Kaiser and Stephan Gouws and Yoshikiyo Kato and Taku Kudo and Hideto Kazawa and Keith Stevens and George Kurian and Nishant Patil and Wei Wang and Cliff Young and Jason Smith and Jason Riesa and Alex Rudnick and Oriol Vinyals and Greg Corrado and Macduff Hughes and Jeffrey Dean (2016):
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation, CoRR
mentioned in Neural Network Models and Attention Model@article{DBLP:journals/corr/WuSCLNMKCGMKSJL16,
author = {Yonghui Wu and Mike Schuster and Zhifeng Chen and Quoc V. Le and Mohammad Norouzi and Wolfgang Macherey and Maxim Krikun and Yuan Cao and Qin Gao and Klaus Macherey and Jeff Klingner and Apurva Shah and Melvin Johnson and Xiaobing Liu and Lukasz Kaiser and Stephan Gouws and Yoshikiyo Kato and Taku Kudo and Hideto Kazawa and Keith Stevens and George Kurian and Nishant Patil and Wei Wang and Cliff Young and Jason Smith and Jason Riesa and Alex Rudnick and Oriol Vinyals and Greg Corrado and Macduff Hughes and Jeffrey Dean},
title = {Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation},
journal = {CoRR},
volume = {abs/1609.08144},
url = {
http://arxiv.org/abs/1609.08144.pdf},
timestamp = {Mon, 03 Oct 2016 17:51:10 +0200},
biburl = {
http://dblp.uni-trier.de/rec/bib/journals/corr/WuSCLNMKCGMKSJL16},
bibsource = {dblp computer science bibliography,
http://dblp.org},
year = 2016
}
(Wu et al., 2016), and WIPO
Marcin Junczys-Dowmunt and Tomasz Dwojak and Hieu Hoang (2016):
Is Neural Machine Translation Ready for Deployment? A Case Study on 30 Translation Directions, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)

@inproceedings{IWSLT-2016-Junczys-Dowmunt,
author = {Marcin Junczys-Dowmunt and Tomasz Dwojak and Hieu Hoang},
title = {Is Neural Machine Translation Ready for Deployment? A Case Study on 30 Translation Directions},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
location = {Seattle, Washington, USA},
url = {
http://workshop2016.iwslt.org/downloads/IWSLT\_2016\_paper\_4.pdf},
month = {December},
year = 2016
}
(Junczys-Dowmunt et al., 2016) reported large-scale deployments.
Graham Neubig (2017):
Neural Machine Translation and Sequence-to-sequence Models: A Tutorial, CoRR

@article{DBLP:journals/corr/Neubig17,
author = {Graham Neubig},
title = {Neural Machine Translation and Sequence-to-sequence Models: {A} Tutorial},
journal = {CoRR},
volume = {abs/1703.01619},
url = {
http://arxiv.org/abs/1703.01619},
timestamp = {Wed, 07 Jun 2017 14:40:22 +0200},
biburl = {
http://dblp.uni-trier.de/rec/bib/journals/corr/Neubig17},
bibsource = {dblp computer science bibliography,
http://dblp.org},
year = 2017
}
Neubig (2017) presents a hands-on tutorial on neural machine translation models.
Technical Background:
A good introduction to modern neural network research is the textbook
Deep Learning
Ian Goodfellow and Yoshua Bengio and Aaron Courville (2016):
Deep Learning

@book{Goodfellow-et-al-2016,
author = {Ian Goodfellow and Yoshua Bengio and Aaron Courville},
title = {Deep Learning},
publisher = {MIT Press},
note = {\url{
http://www.deeplearningbook.org}},
year = 2016
}
(Goodfellow et al., 2016). There is also book on neural network methods applied to the natural language processing in general
Goldberg, Yoav (2017):
Neural Network Methods for Natural Language Processing

@book{Goldberg17,
author = {Goldberg, Yoav},
title = {Neural Network Methods for Natural Language Processing},
address = {San Rafael, CA},
doi = {10.2200/S00762ED1V01Y201703HLT037},
groups = {public},
isbn = {978-1-62705-298-6},
issn = {1947-4040},
publisher = {Morgan \& Claypool},
series = {Synthesis Lectures on Human Language Technologies},
volume = {37},
year = 2017
}
(Goldberg, 2017).
Toolkits:
There are several toolkits that implement various neural translation models.
- OpenNMT
Klein, Guillaume and Kim, Yoon and Deng, Yuntian and Senellart, Jean and Rush, Alexander (2017):
OpenNMT: Open-Source Toolkit for Neural Machine Translation, Proceedings of ACL 2017, System Demonstrations

@InProceedings{klein-EtAl:2017:ACL-2017-System-Demonstrations,
author = {Klein, Guillaume and Kim, Yoon and Deng, Yuntian and Senellart, Jean and Rush, Alexander},
title = {OpenNMT: Open-Source Toolkit for Neural Machine Translation},
booktitle = {Proceedings of ACL 2017, System Demonstrations},
month = {July},
address = {Vancouver, Canada},
publisher = {Association for Computational Linguistics},
pages = {67--72},
url = {
http://aclweb.org/anthology/P17-4012},
year = 2017
}
(Klein et al., 2017;
Klein, Guillaume and Kim, Yoon and Deng, Yuntian and Nguyen, Vincent and Senellart, Jean and Rush, Alexand er (2018):
OpenNMT: Neural Machine Translation Toolkit, Proceedings of the 13th Conference of the Association for Machine Translation in the Americas (Volume 1: Research Papers)

@inproceedings{W18-1817,
author = {Klein, Guillaume and Kim, Yoon and Deng, Yuntian and Nguyen, Vincent and Senellart, Jean and Rush, Alexand er},
title = {OpenNMT: Neural Machine Translation Toolkit},
booktitle = {Proceedings of the 13th Conference of the Association for Machine Translation in the Americas (Volume 1: Research Papers)},
month = {mar},
address = {Boston, MA},
publisher = {Association for Machine Translation in the Americas},
url = {
https://www.aclweb.org/anthology/W18-1817},
pages = {177--184},
year = 2018
}
Klein et al., 2018) is a more recent popular toolkit based on PyTorch
- fairseq
Ott, Myle and Edunov, Sergey and Baevski, Alexei and Fan, Angela and Gross, Sam and Ng, Nathan and Grangier, David and Auli, Michael (2019):
fairseq: A Fast, Extensible Toolkit for Sequence Modeling, Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (Demonstrations)

@inproceedings{ott-etal-2019-fairseq,
author = {Ott, Myle and Edunov, Sergey and Baevski, Alexei and Fan, Angela and Gross, Sam and Ng, Nathan and Grangier, David and Auli, Michael},
title = {fairseq: A Fast, Extensible Toolkit for Sequence Modeling},
booktitle = {Proceedings of the 2019 Conference of the North {A}merican Chapter of the Association for Computational Linguistics (Demonstrations)},
month = {jun},
address = {Minneapolis, Minnesota},
publisher = {Association for Computational Linguistics},
url = {
https://www.aclweb.org/anthology/N19-4009},
pages = {48--53},
year = 2019
}
(Ott et al., 2019) is based on PyTorch and supported by Facebook
- Sockeye
Felix Hieber and Tobias Domhan and Michael Denkowski and David Vilar and Artem Sokolov and Ann Clifton and Matt Post (2018):
The Sockeye Neural Machine Translation Toolkit at AMTA 2018, Annual Meeting of the Association for Machine Translation in the Americas (AMTA)

@inproceedings{AMTA2018-Hieber,
author = {Felix Hieber and Tobias Domhan and Michael Denkowski and David Vilar and Artem Sokolov and Ann Clifton and Matt Post},
title = {The Sockeye Neural Machine Translation Toolkit at AMTA 2018},
booktitle = {Annual Meeting of the Association for Machine Translation in the Americas (AMTA)},
location = {Boston, USA},
year = 2018
}
(Hieber et al., 2018) is based on MX-Net and supported by Amazon
- Marian
Junczys-Dowmunt, Marcin and Grundkiewicz, Roman and Dwojak, Tomasz and Hoang, Hieu and Heafield, Kenneth and Neckermann, Tom and Seide, Frank and Germann, Ulrich and Aji, Alham Fikri and Bogoychev, Nikolay and Martins, André F. T. and Birch, Alexandra (2018):
Marian: Fast Neural Machine Translation in C++, Proceedings of ACL 2018, System Demonstrations

@InProceedings{P18-4020,
author = {Junczys-Dowmunt, Marcin and Grundkiewicz, Roman and Dwojak, Tomasz and Hoang, Hieu and Heafield, Kenneth and Neckermann, Tom and Seide, Frank and Germann, Ulrich and Aji, Alham Fikri and Bogoychev, Nikolay and Martins, Andr{\'e} F. T. and Birch, Alexandra},
title = {Marian: Fast Neural Machine Translation in C++},
booktitle = {Proceedings of ACL 2018, System Demonstrations},
publisher = {Association for Computational Linguistics},
pages = {116--121},
location = {Melbourne, Australia},
url = {
http://aclweb.org/anthology/P18-4020},
year = 2018
}
(Junczys-Dowmunt et al., 2018;
Junczys-Dowmunt, Marcin and Heafield, Kenneth and Hoang, Hieu and Grundkiewicz, Roman and Aue, Anthony (2018):
Marian: Cost-effective High-Quality Neural Machine Translation in C++, Proceedings of the 2nd Workshop on Neural Machine Translation and Generation

@InProceedings{W18-2716,
author = {Junczys-Dowmunt, Marcin and Heafield, Kenneth and Hoang, Hieu and Grundkiewicz, Roman and Aue, Anthony},
title = {Marian: Cost-effective High-Quality Neural Machine Translation in C++},
booktitle = {Proceedings of the 2nd Workshop on Neural Machine Translation and Generation},
publisher = {Association for Computational Linguistics},
pages = {129--135},
location = {Melbourne, Australia},
url = {
http://aclweb.org/anthology/W18-2716},
year = 2018
}
Junczys-Dowmunt et al., 2018b) is a fast C++ implementation that is focused on fast training and decoding
- XNMT
Neubig, Graham and Sperber, Matthias and Wang, Xinyi and Felix, Matthieu and Matthews, Austin and Padmanabhan, Sarguna and Qi, Ye and Sachan, Devendra and Arthur, Philip and Godard, Pierre and Hewitt, John and Riad, Rachid and Wang, Liming (2018):
XNMT: The eXtensible Neural Machine Translation Toolkit, Proceedings of the 13th Conference of the Association for Machine Translation in the Americas (Volume 1: Research Papers)

@inproceedings{W18-1818,
author = {Neubig, Graham and Sperber, Matthias and Wang, Xinyi and Felix, Matthieu and Matthews, Austin and Padmanabhan, Sarguna and Qi, Ye and Sachan, Devendra and Arthur, Philip and Godard, Pierre and Hewitt, John and Riad, Rachid and Wang, Liming},
title = {XNMT: The eXtensible Neural Machine Translation Toolkit},
booktitle = {Proceedings of the 13th Conference of the Association for Machine Translation in the Americas (Volume 1: Research Papers)},
month = {mar},
address = {Boston, MA},
publisher = {Association for Machine Translation in the Americas},
url = {
https://www.aclweb.org/anthology/W18-1818},
pages = {185--192},
year = 2018
}
(Neubig et al., 2018) is also a self-contained toolkit with Python and C++ hooks for extension
- Tensor2Tensor
Ashish Vaswani and Samy Bengio and Eugene Brevdo and Francois Chollet and Aidan N. Gomez and Stephan Gouws and Llion Jones and Łukasz Kaiser and Nal Kalchbrenner and Niki Parmar and Ryan Sepassi and Noam Shazeer and Jakob Uszkoreit (2018):
Tensor2Tensor for Neural Machine Translation, Annual Meeting of the Association for Machine Translation in the Americas (AMTA)

@inproceedings{AMTA2018-Vaswani,
author = {Ashish Vaswani and Samy Bengio and Eugene Brevdo and Francois Chollet and Aidan N. Gomez and Stephan Gouws and Llion Jones and Łukasz Kaiser and Nal Kalchbrenner and Niki Parmar and Ryan Sepassi and Noam Shazeer and Jakob Uszkoreit},
title = {Tensor2Tensor for Neural Machine Translation},
booktitle = {Annual Meeting of the Association for Machine Translation in the Americas (AMTA)},
location = {Boston, USA},
year = 2018
}
(Vaswani et al., 2018) is the original implementation of the Transformer model by Google
- Neural Monkey
Jindřich Helcl and Jindřich Libovický and Tom Kocmi and Tomáš Musil and Ondřej Cífka and Dusan Varis and Ondřej Bojar (2018):
Neural Monkey: The Current State and Beyond, Annual Meeting of the Association for Machine Translation in the Americas (AMTA)

@inproceedings{AMTA2018-Helcl,
author = {Jind\v{r}ich Helcl and Jind\v{r}ich Libovický and Tom Kocmi and Tom\'{a}\v{s} Musil and Ond\v{r}ej Cífka and Dusan Varis and Ond\v{r}ej Bojar},
title = {Neural Monkey: The Current State and Beyond},
booktitle = {Annual Meeting of the Association for Machine Translation in the Americas (AMTA)},
location = {Boston, USA},
url = {
https://www.aclweb.org/anthology/W18-1816},
year = 2018
}
(Helcl et al., 2018) is based on TensorFlow
- CytonMT
Wang, Xiaolin and Utiyama, Masao and Sumita, Eiichiro (2018):
CytonMT: an Efficient Neural Machine Translation Open-source Toolkit Implemented in C++, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing: System Demonstrations

@inproceedings{D18-2023,
author = {Wang, Xiaolin and Utiyama, Masao and Sumita, Eiichiro},
title = {CytonMT: an Efficient Neural Machine Translation Open-source Toolkit Implemented in C++},
booktitle = {Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing: System Demonstrations},
address = {Brussels, Belgium},
publisher = {Association for Computational Linguistics},
url = {
https://www.aclweb.org/anthology/D18-2023},
pages = {133--138},
year = 2018
}
(Wang et al., 2018) is an efficient toolkit implemented in C++
- Nematus
Sennrich, Rico and Firat, Orhan and Cho, Kyunghyun and Birch, Alexandra and Haddow, Barry and Hitschler, Julian and Junczys-Dowmunt, Marcin and Läubli, Samuel and Miceli Barone, Antonio Valerio and Mokry, Jozef and Nadejde, Maria (2017):
Nematus: a Toolkit for Neural Machine Translation, Proceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics

@InProceedings{sennrich-EtAl:2017:EACLDemo,
author = {Sennrich, Rico and Firat, Orhan and Cho, Kyunghyun and Birch, Alexandra and Haddow, Barry and Hitschler, Julian and Junczys-Dowmunt, Marcin and L\"{a}ubli, Samuel and Miceli Barone, Antonio Valerio and Mokry, Jozef and Nadejde, Maria},
title = {Nematus: a Toolkit for Neural Machine Translation},
booktitle = {Proceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics},
month = {April},
address = {Valencia, Spain},
publisher = {Association for Computational Linguistics},
pages = {65--68},
url = {
http://aclweb.org/anthology/E17-3017},
year = 2017
}
(Sennrich et al., 2017) is an early influential toolkit based on Theano
- Kyoto-NMT
Cromieres, Fabien (2016):
Kyoto-NMT: a Neural Machine Translation implementation in Chainer, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: System Demonstrations

@InProceedings{cromieres:2016:COLINGDEMO,
author = {Cromieres, Fabien},
title = {Kyoto-NMT: a Neural Machine Translation implementation in Chainer},
booktitle = {Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: System Demonstrations},
month = {December},
address = {Osaka, Japan},
publisher = {The COLING 2016 Organizing Committee},
pages = {307--311},
url = {
http://aclweb.org/anthology/C16-2064},
year = 2016
}
(Cromieres, 2016) is an implementation in Chainer
- SGNMT
Stahlberg, Felix and Saunders, Danielle and Iglesias, Gonzalo and Byrne, Bill (2018):
Why not be Versatile? Applications of the SGNMT Decoder for Machine Translation, Proceedings of the 13th Conference of the Association for Machine Translation in the Americas (Volume 1: Research Papers)

@inproceedings{W18-1821,
author = {Stahlberg, Felix and Saunders, Danielle and Iglesias, Gonzalo and Byrne, Bill},
title = {Why not be Versatile? Applications of the SGNMT Decoder for Machine Translation},
booktitle = {Proceedings of the 13th Conference of the Association for Machine Translation in the Americas (Volume 1: Research Papers)},
month = {mar},
address = {Boston, MA},
publisher = {Association for Machine Translation in the Americas},
url = {
https://www.aclweb.org/anthology/W18-1821},
pages = {208--216},
year = 2018
}
(Stahlberg et al., 2018) is a decoder that allows the combination of models implemented with different toolkits
Benchmarks
Discussion
Related Topics
New Publications
Ojha, Atul Kr and Kumar, Ritesh and Bansal, Akanksha and Rani, Priya (2019):
Panlingua-KMI MT System for Similar Language Translation Task at WMT 2019, Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2)
mentioned in Phrase Based Models and Neural Network Models@inproceedings{ojha2019panlingua,
author = {Ojha, Atul Kr and Kumar, Ritesh and Bansal, Akanksha and Rani, Priya},
title = {Panlingua-KMI MT System for Similar Language Translation Task at WMT 2019},
booktitle = {Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2)},
pages = {213--218},
year = 2019
}
Ojha et al. (2019)
Johanes Effendi and Sakriani Sakti and Katsuhito Sudoh and Satoshi Nakamura (2018):
Multiparaphrase Augmentation to Leverage Neural Caption Translation, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)

@inproceedings{iwslt18-Multiparaphrase-Effendi,
author = {Johanes Effendi and Sakriani Sakti and Katsuhito Sudoh and Satoshi Nakamura},
title = {Multiparaphrase Augmentation to Leverage Neural Caption Translation},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
year = 2018
}
Effendi et al. (2018)
Yuto Takebayashi and Chenhui Chu and Yuki Arase and Masaaki Nagata (2018):
Word Rewarding for Adequate Neural Machine Translation, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)

@inproceedings{iwslt18-Word-Rewarding-Takebayashi,
author = {Yuto Takebayashi and Chenhui Chu and Yuki Arase and Masaaki Nagata},
title = {Word Rewarding for Adequate Neural Machine Translation},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
year = 2018
}
Takebayashi et al. (2018)
Pinnis, Mārcis and Krišlauks, Rihards and Miks, Toms and Deksne, Daiga and Šics, Valters (2017):
Tilde's Machine Translation Systems for WMT 2017, Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers

@InProceedings{pinnis-EtAl:2017:WMT,
author = {Pinnis, M\={a}rcis and Kri\v{s}lauks, Rihards and Miks, Toms and Deksne, Daiga and \v{S}ics, Valters},
title = {Tilde's Machine Translation Systems for WMT 2017},
booktitle = {Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers},
month = {September},
address = {Copenhagen, Denmark},
publisher = {Association for Computational Linguistics},
pages = {374--381},
url = {
http://www.aclweb.org/anthology/W17-4737},
year = 2017
}
Pinnis et al. (2017)
Rikters, Mat\=\iss and Amrhein, Chantal and Del, Maksym and Fishel, Mark (2017):
C-3MA: Tartu-Riga-Zurich Translation Systems for WMT17, Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers

@InProceedings{rikters-EtAl:2017:WMT,
author = {Rikters, Mat\={\i}ss and Amrhein, Chantal and Del, Maksym and Fishel, Mark},
title = {C-3MA: Tartu-Riga-Zurich Translation Systems for WMT17},
booktitle = {Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers},
month = {September},
address = {Copenhagen, Denmark},
publisher = {Association for Computational Linguistics},
pages = {382--388},
url = {
http://www.aclweb.org/anthology/W17-4738},
year = 2017
}
Rikters et al. (2017)
Junczys-Dowmunt, Marcin (2018):
Microsoft's Submission to the WMT2018 News Translation Task: How I Learned to Stop Worrying and Love the Data, Proceedings of the Third Conference on Machine Translation: Shared Task Papers

@inproceedings{W18-6415,
author = {Junczys-Dowmunt, Marcin},
title = {Microsoft{'}s Submission to the WMT2018 News Translation Task: How I Learned to Stop Worrying and Love the Data},
booktitle = {Proceedings of the Third Conference on Machine Translation: Shared Task Papers},
month = {oct},
address = {Belgium, Brussels},
publisher = {Association for Computational Linguistics},
url = {
https://www.aclweb.org/anthology/W18-6415},
pages = {425--430},
year = 2018
}
Junczys-Dowmunt (2018)
Pinnis, Marcis and Rikters, Matiss and Krišlauks, Rihards (2018):
Tilde's Machine Translation Systems for WMT 2018, Proceedings of the Third Conference on Machine Translation: Shared Task Papers

@inproceedings{W18-6423,
author = {Pinnis, Marcis and Rikters, Matiss and Kri{\v{s}}lauks, Rihards},
title = {Tilde{'}s Machine Translation Systems for WMT 2018},
booktitle = {Proceedings of the Third Conference on Machine Translation: Shared Task Papers},
month = {oct},
address = {Belgium, Brussels},
publisher = {Association for Computational Linguistics},
url = {
https://www.aclweb.org/anthology/W18-6423},
pages = {473--481},
year = 2018
}
Pinnis et al. (2018)
Weng, Rongxiang and Huang, Shujian and Zheng, Zaixiang and DAI, XIN-YU and CHEN, Jiajun (2017):
Neural Machine Translation with Word Predictions, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

@InProceedings{D17-1013,
author = {Weng, Rongxiang and Huang, Shujian and Zheng, Zaixiang and DAI, XIN-YU and CHEN, Jiajun},
title = {Neural Machine Translation with Word Predictions},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
publisher = {Association for Computational Linguistics},
pages = {136--145},
location = {Copenhagen, Denmark},
url = {
http://aclweb.org/anthology/D17-1013},
year = 2017
}
Weng et al. (2017)
Sperber, Matthias and Neubig, Graham and Niehues, Jan and Waibel, Alex (2017):
Neural Lattice-to-Sequence Models for Uncertain Inputs, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

@InProceedings{D17-1146,
author = {Sperber, Matthias and Neubig, Graham and Niehues, Jan and Waibel, Alex},
title = {Neural Lattice-to-Sequence Models for Uncertain Inputs},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
publisher = {Association for Computational Linguistics},
pages = {1391--1400},
location = {Copenhagen, Denmark},
url = {
http://aclweb.org/anthology/D17-1146},
year = 2017
}
Sperber et al. (2017)
Feng, Yang and Zhang, Shiyue and Zhang, Andi and Wang, Dong and Abel, Andrew (2017):
Memory-augmented Neural Machine Translation, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

@InProceedings{D17-1147,
author = {Feng, Yang and Zhang, Shiyue and Zhang, Andi and Wang, Dong and Abel, Andrew},
title = {Memory-augmented Neural Machine Translation},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
publisher = {Association for Computational Linguistics},
pages = {1401--1410},
location = {Copenhagen, Denmark},
url = {
http://aclweb.org/anthology/D17-1147},
year = 2017
}
Feng et al. (2017)
Dahlmann, Leonard and Matusov, Evgeny and Petrushkov, Pavel and Khadivi, Shahram (2017):
Neural Machine Translation Leveraging Phrase-based Models in a Hybrid Search, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

@InProceedings{D17-1149,
author = {Dahlmann, Leonard and Matusov, Evgeny and Petrushkov, Pavel and Khadivi, Shahram},
title = {Neural Machine Translation Leveraging Phrase-based Models in a Hybrid Search},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
publisher = {Association for Computational Linguistics},
pages = {1422--1431},
location = {Copenhagen, Denmark},
url = {
http://aclweb.org/anthology/D17-1149},
year = 2017
}
Dahlmann et al. (2017)
Wang, Xing and Tu, Zhaopeng and Xiong, Deyi and Zhang, Min (2017):
Translating Phrases in Neural Machine Translation, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

@InProceedings{D17-1150,
author = {Wang, Xing and Tu, Zhaopeng and Xiong, Deyi and Zhang, Min},
title = {Translating Phrases in Neural Machine Translation},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
publisher = {Association for Computational Linguistics},
pages = {1432--1442},
location = {Copenhagen, Denmark},
url = {
http://aclweb.org/anthology/D17-1150},
year = 2017
}
Wang et al. (2017)
Zhang, Xiaowei and Chen, Wei and Wang, Feng and Xu, Shuang and Xu, Bo (2017):
Towards Compact and Fast Neural Machine Translation Using a Combined Method, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
mentioned in Neural Network Models and Inference@InProceedings{D17-1154,
author = {Zhang, Xiaowei and Chen, Wei and Wang, Feng and Xu, Shuang and Xu, Bo},
title = {Towards Compact and Fast Neural Machine Translation Using a Combined Method},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
publisher = {Association for Computational Linguistics},
pages = {1476--1482},
location = {Copenhagen, Denmark},
url = {
http://aclweb.org/anthology/D17-1154},
year = 2017
}
Zhang et al. (2017)
Stahlberg, Felix and Byrne, Bill (2017):
Unfolding and Shrinking Neural Machine Translation Ensembles, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
mentioned in Neural Network Models and Inference@InProceedings{D17-1207,
author = {Stahlberg, Felix and Byrne, Bill},
title = {Unfolding and Shrinking Neural Machine Translation Ensembles},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
publisher = {Association for Computational Linguistics},
pages = {1936--1946},
location = {Copenhagen, Denmark},
url = {
http://aclweb.org/anthology/D17-1207},
year = 2017
}
Stahlberg and Byrne (2017)
Devlin, Jacob (2017):
Sharp Models on Dull Hardware: Fast and Accurate Neural Machine Translation Decoding on the CPU, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
mentioned in Neural Network Models and Inference@InProceedings{D17-1299,
author = {Devlin, Jacob},
title = {Sharp Models on Dull Hardware: Fast and Accurate Neural Machine Translation Decoding on the CPU},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
publisher = {Association for Computational Linguistics},
pages = {2810--2815},
location = {Copenhagen, Denmark},
url = {
http://aclweb.org/anthology/D17-1300},
year = 2017
}
Devlin (2017)
Wang, Longyue and Tu, Zhaopeng and Way, Andy and Liu, Qun (2017):
Exploiting Cross-Sentence Context for Neural Machine Translation, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

@InProceedings{D17-1300,
author = {Wang, Longyue and Tu, Zhaopeng and Way, Andy and Liu, Qun},
title = {Exploiting Cross-Sentence Context for Neural Machine Translation},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
publisher = {Association for Computational Linguistics},
pages = {2816--2821},
location = {Copenhagen, Denmark},
url = {
http://aclweb.org/anthology/D17-1300},
year = 2017
}
Wang et al. (2017)
Stahlberg, Felix and Hasler, Eva and Saunders, Danielle and Byrne, Bill (2017):
SGNMT -- A Flexible NMT Decoding Platform for Quick Prototyping of New Models and Search Strategies, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations

@InProceedings{D17-2005,
author = {Stahlberg, Felix and Hasler, Eva and Saunders, Danielle and Byrne, Bill},
title = {SGNMT -- A Flexible {NMT} Decoding Platform for Quick Prototyping of New Models and Search Strategies},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations},
publisher = {Association for Computational Linguistics},
pages = {25--30},
location = {Copenhagen, Denmark},
url = {
http://aclweb.org/anthology/D17-2005},
year = 2017
}
Stahlberg et al. (2017)
- Melo (2015)
Marta R. Costa-jussá and Alexandre Allauzen and Loïc Barrault and Kyunghun Cho and Holger Schwenk (2017):
Introduction to the special issue on deep learning approaches for machine translation, Computer Speech & Language

@article{COSTAJUSSA2017367,
author = {Marta R. Costa-juss{\'a} and Alexandre Allauzen and Lo\"{i}c Barrault and Kyunghun Cho and Holger Schwenk},
title = {Introduction to the special issue on deep learning approaches for machine translation},
journal = {Computer Speech & Language},
volume = {46},
number = {""},
pages = {367 - 373},
note = {""},
issn = {0885-2308},
doi = {
http://dx.doi.org/10.1016/j.csl.2017.03.001},
url = {
http://www.sciencedirect.com/science/article/pii/S0885230816303965},
keywords = {Deep learning,},
year = 2017
}
Costa-jussá et al. (2017)
Gupta, Rohit and Orasan, Constantin and van Genabith, Josef (2015):
Machine Translation Evaluation using Recurrent Neural Networks, Proceedings of the Tenth Workshop on Statistical Machine Translation

@InProceedings{gupta-orasan-vangenabith:2015:WMT,
author = {Gupta, Rohit and Orasan, Constantin and van Genabith, Josef},
title = {Machine Translation Evaluation using Recurrent Neural Networks},
booktitle = {Proceedings of the Tenth Workshop on Statistical Machine Translation},
month = {September},
address = {Lisbon, Portugal},
publisher = {Association for Computational Linguistics},
pages = {380--384},
url = {
http://aclweb.org/anthology/W15-3047},
year = 2015
}
Gupta et al. (2015)
Markus Müller and Sebastian Stücker and Zaid Sheikh and Florian Metze and Alex Waibel (2014):
Multilingual Deep Bottle Neck Features - A Study on Language Selection and Training Techniques, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)

@inproceedings{Mueller:iwslt:2014,
author = {Markus M{\"u}ller and Sebastian St{\"u}cker and Zaid Sheikh and Florian Metze and Alex Waibel},
title = {Multilingual Deep Bottle Neck Features - A Study on Language Selection and Training Techniques},
pages = {257--264},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
year = 2014
}
Müller et al. (2014)
Rico Sennrich and Barry Haddow and Alexandra Birch (2015):
Neural Machine Translation of Rare Words with Subword Units, CoRR

@article{DBLP:journals/corr/SennrichHB15,
author = {Rico Sennrich and Barry Haddow and Alexandra Birch},
title = {Neural Machine Translation of Rare Words with Subword Units},
journal = {CoRR},
volume = {abs/1508.07909},
url = {
http://arxiv.org/abs/1508.07909},
timestamp = {Tue, 01 Sep 2015 14:42:40 +0200},
biburl = {
http://dblp.uni-trier.de/rec/bib/journals/corr/SennrichHB15},
bibsource = {dblp computer science bibliography,
http://dblp.org},
year = 2015
}
Sennrich et al. (2015)
Rico Sennrich and Barry Haddow and Alexandra Birch (2015):
Improving Neural Machine Translation Models with Monolingual Data, CoRR

@article{DBLP:journals/corr/SennrichHB15a,
author = {Rico Sennrich and Barry Haddow and Alexandra Birch},
title = {Improving Neural Machine Translation Models with Monolingual Data},
journal = {CoRR},
volume = {abs/1511.06709},
url = {
http://arxiv.org/abs/1511.06709},
timestamp = {Tue, 01 Dec 2015 19:22:34 +0100},
biburl = {
http://dblp.uni-trier.de/rec/bib/journals/corr/SennrichHB15a},
bibsource = {dblp computer science bibliography,
http://dblp.org},
year = 2015
}
Sennrich et al. (2015)
Kai Zhao and Hany Hassan and Michael Auli (2015):
Learning Translation Models from Monolingual Continuous Representations, Proceedings of NAACL

@inproceedings{zhao:2015:naacl,
author = {Kai Zhao and Hany Hassan and Michael Auli},
title = {Learning Translation Models from Monolingual Continuous Representations},
booktitle = {Proceedings of NAACL},
url = {
http://michaelauli.github.io/papers/dist\_phrase\_learn.pdf},
year = 2015
}
Zhao et al. (2015)
Heyman, Geert and Vulić, Ivan and Moens, Marie-Francine (2017):
Bilingual Lexicon Induction by Learning to Combine Word-Level and Character-Level Representations, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers

@InProceedings{heyman-vulic-moens:2017:EACLlong,
author = {Heyman, Geert and Vuli\'{c}, Ivan and Moens, Marie-Francine},
title = {Bilingual Lexicon Induction by Learning to Combine Word-Level and Character-Level Representations},
booktitle = {Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers},
month = {April},
address = {Valencia, Spain},
publisher = {Association for Computational Linguistics},
pages = {1085--1095},
url = {
http://www.aclweb.org/anthology/E17-1102},
year = 2017
}
Heyman et al. (2017)
Silva de Carvalho, Danilo and Nguyen, Minh Le (2017):
Building Lexical Vector Representations from Concept Definitions, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers

@InProceedings{silvadecarvalho-nguyen:2017:EACLlong,
author = {Silva de Carvalho, Danilo and Nguyen, Minh Le},
title = {Building Lexical Vector Representations from Concept Definitions},
booktitle = {Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers},
month = {April},
address = {Valencia, Spain},
publisher = {Association for Computational Linguistics},
pages = {905--915},
url = {
http://www.aclweb.org/anthology/E17-1085},
year = 2017
}
Carvalho and Nguyen (2017)
Carpuat, Marine and Vyas, Yogarshi and Niu, Xing (2017):
Detecting Cross-Lingual Semantic Divergence for Neural Machine Translation, Proceedings of the First Workshop on Neural Machine Translation
mentioned in Neural Network Models and Corpus Cleaning@InProceedings{carpuat-vyas-niu:2017:NMT,
author = {Carpuat, Marine and Vyas, Yogarshi and Niu, Xing},
title = {Detecting Cross-Lingual Semantic Divergence for Neural Machine Translation},
booktitle = {Proceedings of the First Workshop on Neural Machine Translation},
month = {August},
address = {Vancouver},
publisher = {Association for Computational Linguistics},
pages = {69--79},
url = {
http://www.aclweb.org/anthology/W17-3209},
year = 2017
}
Carpuat et al. (2017)
Denkowski, Michael and Neubig, Graham (2017):
Stronger Baselines for Trustable Results in Neural Machine Translation, Proceedings of the First Workshop on Neural Machine Translation

@InProceedings{denkowski-neubig:2017:NMT,
author = {Denkowski, Michael and Neubig, Graham},
title = {Stronger Baselines for Trustable Results in Neural Machine Translation},
booktitle = {Proceedings of the First Workshop on Neural Machine Translation},
month = {August},
address = {Vancouver},
publisher = {Association for Computational Linguistics},
pages = {18--27},
url = {
http://www.aclweb.org/anthology/W17-3203},
year = 2017
}
Denkowski and Neubig (2017)
Goto, Isao and Tanaka, Hideki (2017):
Detecting Untranslated Content for Neural Machine Translation, Proceedings of the First Workshop on Neural Machine Translation

@InProceedings{goto-tanaka:2017:NMT,
author = {Goto, Isao and Tanaka, Hideki},
title = {Detecting Untranslated Content for Neural Machine Translation},
booktitle = {Proceedings of the First Workshop on Neural Machine Translation},
month = {August},
address = {Vancouver},
publisher = {Association for Computational Linguistics},
pages = {47--55},
url = {
http://www.aclweb.org/anthology/W17-3206},
year = 2017
}
Goto and Tanaka (2017)
Morishita, Makoto and Oda, Yusuke and Neubig, Graham and Yoshino, Koichiro and Sudoh, Katsuhito and Nakamura, Satoshi (2017):
An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation, Proceedings of the First Workshop on Neural Machine Translation

@InProceedings{morishita-EtAl:2017:NMT,
author = {Morishita, Makoto and Oda, Yusuke and Neubig, Graham and Yoshino, Koichiro and Sudoh, Katsuhito and Nakamura, Satoshi},
title = {An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation},
booktitle = {Proceedings of the First Workshop on Neural Machine Translation},
month = {August},
address = {Vancouver},
publisher = {Association for Computational Linguistics},
pages = {61--68},
url = {
http://www.aclweb.org/anthology/W17-3208},
year = 2017
}
Morishita et al. (2017)
Shu, Raphael and Nakayama, Hideki (2017):
An Empirical Study of Adequate Vision Span for Attention-Based Neural Machine Translation, Proceedings of the First Workshop on Neural Machine Translation

@InProceedings{shu-nakayama:2017:NMT,
author = {Shu, Raphael and Nakayama, Hideki},
title = {An Empirical Study of Adequate Vision Span for Attention-Based Neural Machine Translation},
booktitle = {Proceedings of the First Workshop on Neural Machine Translation},
month = {August},
address = {Vancouver},
publisher = {Association for Computational Linguistics},
pages = {1--10},
url = {
http://www.aclweb.org/anthology/W17-3201},
year = 2017
}
Shu and Nakayama (2017)
Overcoming Low Resource
Fadaee, Marzieh and Bisazza, Arianna and Monz, Christof (2017):
Data Augmentation for Low-Resource Neural Machine Translation, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
mentioned in Neural Network Models and Corpus Cleaning@InProceedings{fadaee-bisazza-monz:2017:Short2,
author = {Fadaee, Marzieh and Bisazza, Arianna and Monz, Christof},
title = {Data Augmentation for Low-Resource Neural Machine Translation},
booktitle = {Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {July},
address = {Vancouver, Canada},
publisher = {Association for Computational Linguistics},
pages = {567--573},
url = {
http://aclweb.org/anthology/P17-2090},
year = 2017
}
Fadaee et al. (2017)
Adams, Oliver and Makarucha, Adam and Neubig, Graham and Bird, Steven and Cohn, Trevor (2017):
Cross-Lingual Word Embeddings for Low-Resource Language Modeling, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers

@InProceedings{adams-EtAl:2017:EACLlong,
author = {Adams, Oliver and Makarucha, Adam and Neubig, Graham and Bird, Steven and Cohn, Trevor},
title = {Cross-Lingual Word Embeddings for Low-Resource Language Modeling},
booktitle = {Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers},
month = {April},
address = {Valencia, Spain},
publisher = {Association for Computational Linguistics},
pages = {937--947},
url = {
http://www.aclweb.org/anthology/E17-1088},
year = 2017
}
Adams et al. (2017)
Chen, Yun and Liu, Yang and Cheng, Yong and Li, Victor O.K. (2017):
A Teacher-Student Framework for Zero-Resource Neural Machine Translation, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
mentioned in Neural Network Models and Training@InProceedings{chen-EtAl:2017:Long5,
author = {Chen, Yun and Liu, Yang and Cheng, Yong and Li, Victor O.K.},
title = {A Teacher-Student Framework for Zero-Resource Neural Machine Translation},
booktitle = {Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {July},
address = {Vancouver, Canada},
publisher = {Association for Computational Linguistics},
pages = {1925--1935},
url = {
http://aclweb.org/anthology/P17-1176},
year = 2017
}
Chen et al. (2017)
Zhang, Jiajun and Zong, Chengqing (2016):
Exploiting Source-side Monolingual Data in Neural Machine Translation, Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing

@InProceedings{zhang-zong:2016:EMNLP2016,
author = {Zhang, Jiajun and Zong, Chengqing},
title = {Exploiting Source-side Monolingual Data in Neural Machine Translation},
booktitle = {Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing},
month = {November},
address = {Austin, Texas},
publisher = {Association for Computational Linguistics},
pages = {1535--1545},
url = {
https://aclweb.org/anthology/D16-1160},
year = 2016
}
Zhang and Zong (2016)
System Descriptions (incomplete)
Junczys-Dowmunt, Marcin and Dwojak, Tomasz and Sennrich, Rico (2016):
The AMU-UEDIN Submission to the WMT16 News Translation Task: Attention-based NMT Models as Feature Functions in Phrase-based SMT, Proceedings of the First Conference on Machine Translation

@InProceedings{junczysdowmunt-dwojak-sennrich:2016:WMT,
author = {Junczys-Dowmunt, Marcin and Dwojak, Tomasz and Sennrich, Rico},
title = {The AMU-UEDIN Submission to the WMT16 News Translation Task: Attention-based {NMT} Models as Feature Functions in Phrase-based SMT},
booktitle = {Proceedings of the First Conference on Machine Translation},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {319--325},
url = {
http://www.aclweb.org/anthology/W/W16/W16-2316},
year = 2016
}
Junczys-Dowmunt et al. (2016)
Chung, Junyoung and Cho, Kyunghyun and Bengio, Yoshua (2016):
NYU-MILA Neural Machine Translation Systems for WMTâ""16, Proceedings of the First Conference on Machine Translation

@InProceedings{chung-cho-bengio:2016:WMT,
author = {Chung, Junyoung and Cho, Kyunghyun and Bengio, Yoshua},
title = {NYU-MILA Neural Machine Translation Systems for WMTâ""16},
booktitle = {Proceedings of the First Conference on Machine Translation},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {268--271},
url = {
http://www.aclweb.org/anthology/W/W16/W16-2309},
year = 2016
}
Chung et al. (2016)
Rodríguez Guasch, Sergio and Costa-jussà, Marta R. (2016):
WMT 2016 Multimodal Translation System Description based on Bidirectional Recurrent Neural Networks with Double-Embeddings, Proceedings of the First Conference on Machine Translation

@InProceedings{rodriguezguasch-costajussa:2016:WMT,
author = {Rodr\'{i}guez Guasch, Sergio and Costa-juss\`{a}, Marta R.},
title = {WMT 2016 Multimodal Translation System Description based on Bidirectional Recurrent Neural Networks with Double-Embeddings},
booktitle = {Proceedings of the First Conference on Machine Translation},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {655--659},
url = {
http://www.aclweb.org/anthology/W/W16/W16-2362},
year = 2016
}
Guasch and Costa-jussà (2016)
Sánchez-Cartagena, Víctor M. and Toral, Antonio (2016):
Abu-MaTran at WMT 2016 Translation Task: Deep Learning, Morphological Segmentation and Tuning on Character Sequences, Proceedings of the First Conference on Machine Translation

@InProceedings{sanchezcartagena-toral:2016:WMT,
author = {S\'{a}nchez-Cartagena, V\'{i}ctor M. and Toral, Antonio},
title = {Abu-MaTran at WMT 2016 Translation Task: Deep Learning, Morphological Segmentation and Tuning on Character Sequences},
booktitle = {Proceedings of the First Conference on Machine Translation},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {362--370},
url = {
http://www.aclweb.org/anthology/W/W16/W16-2322},
year = 2016
}
Sánchez-Cartagena and Toral (2016)
Bradbury, James and Socher, Richard (2016):
MetaMind Neural Machine Translation System for WMT 2016, Proceedings of the First Conference on Machine Translation

@InProceedings{bradbury-socher:2016:WMT,
author = {Bradbury, James and Socher, Richard},
title = {MetaMind Neural Machine Translation System for WMT 2016},
booktitle = {Proceedings of the First Conference on Machine Translation},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {264--267},
url = {
http://www.aclweb.org/anthology/W/W16/W16-2308},
year = 2016
}
Bradbury and Socher (2016)
Other
Mallinson, Jonathan and Sennrich, Rico and Lapata, Mirella (2017):
Paraphrasing Revisited with Neural Machine Translation, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers

@InProceedings{mallinson-sennrich-lapata:2017:EACLlong,
author = {Mallinson, Jonathan and Sennrich, Rico and Lapata, Mirella},
title = {Paraphrasing Revisited with Neural Machine Translation},
booktitle = {Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers},
month = {April},
address = {Valencia, Spain},
publisher = {Association for Computational Linguistics},
pages = {881--893},
url = {
http://www.aclweb.org/anthology/E17-1083},
year = 2017
}
Mallinson et al. (2017)
Jakubina, Laurent and Langlais, Phillippe (2017):
Reranking Translation Candidates Produced by Several Bilingual Word Similarity Sources, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers

@InProceedings{jakubina-langlais:2017:EACLshort,
author = {Jakubina, Laurent and Langlais, Phillippe},
title = {Reranking Translation Candidates Produced by Several Bilingual Word Similarity Sources},
booktitle = {Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers},
month = {April},
address = {Valencia, Spain},
publisher = {Association for Computational Linguistics},
pages = {605--611},
url = {
http://www.aclweb.org/anthology/E17-2096},
year = 2017
}
Jakubina and Langlais (2017)
Östling, Robert and Tiedemann, Jörg (2017):
Continuous multilinguality with language vectors, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers

@InProceedings{ostling-tiedemann:2017:EACLshort,
author = {\"{O}stling, Robert and Tiedemann, J\"{o}rg},
title = {Continuous multilinguality with language vectors},
booktitle = {Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers},
month = {April},
address = {Valencia, Spain},
publisher = {Association for Computational Linguistics},
pages = {644--649},
url = {
http://www.aclweb.org/anthology/E17-2102},
year = 2017
}
Östling and Tiedemann (2017)
Yang, Jie and Zhang, Yue and Dong, Fei (2017):
Neural Word Segmentation with Rich Pretraining, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

@InProceedings{yang-zhang-dong:2017:Long,
author = {Yang, Jie and Zhang, Yue and Dong, Fei},
title = {Neural Word Segmentation with Rich Pretraining},
booktitle = {Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {July},
address = {Vancouver, Canada},
publisher = {Association for Computational Linguistics},
pages = {839--849},
url = {
http://aclweb.org/anthology/P17-1078},
year = 2017
}
Yang et al. (2017)
Zhang, Jiacheng and Liu, Yang and Luan, Huanbo and Xu, Jingfang and Sun, Maosong (2017):
Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

@InProceedings{zhang-EtAl:2017:Long2,
author = {Zhang, Jiacheng and Liu, Yang and Luan, Huanbo and Xu, Jingfang and Sun, Maosong},
title = {Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization},
booktitle = {Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {July},
address = {Vancouver, Canada},
publisher = {Association for Computational Linguistics},
pages = {1514--1523},
url = {
http://aclweb.org/anthology/P17-1139},
year = 2017
}
Zhang et al. (2017)
Marie, Benjamin and Fujita, Atsushi (2017):
Efficient Extraction of Pseudo-Parallel Sentences from Raw Monolingual Data Using Word Embeddings, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

@InProceedings{marie-fujita:2017:Short,
author = {Marie, Benjamin and Fujita, Atsushi},
title = {Efficient Extraction of Pseudo-Parallel Sentences from Raw Monolingual Data Using Word Embeddings},
booktitle = {Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {July},
address = {Vancouver, Canada},
publisher = {Association for Computational Linguistics},
pages = {392--398},
url = {
http://aclweb.org/anthology/P17-2062},
year = 2017
}
Marie and Fujita (2017)
See, Abigail and Luong, Minh-Thang and Manning, Christopher D. (2016):
Compression of Neural Machine Translation Models via Pruning, Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning

@InProceedings{see-luong-manning:2016:CoNLL,
author = {See, Abigail and Luong, Minh-Thang and Manning, Christopher D.},
title = {Compression of Neural Machine Translation Models via Pruning},
booktitle = {Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {291--301},
url = {
http://www.aclweb.org/anthology/K16-1029},
year = 2016
}
See et al. (2016)
- UNKNOWN CITATION 'NIPS2014_5344'
Zhang, Biao and Xiong, Deyi and su, jinsong and Duan, Hong and Zhang, Min (2016):
Bilingual Autoencoders with Global Descriptors for Modeling Parallel Sentences, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers

@InProceedings{zhang-EtAl:2016:COLING5,
author = {Zhang, Biao and Xiong, Deyi and su, jinsong and Duan, Hong and Zhang, Min},
title = {Bilingual Autoencoders with Global Descriptors for Modeling Parallel Sentences},
booktitle = {Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers},
month = {December},
address = {Osaka, Japan},
publisher = {The COLING 2016 Organizing Committee},
pages = {2548--2558},
url = {
http://aclweb.org/anthology/C16-1240},
year = 2016
}
Zhang et al. (2016)
Pal, Santanu and Naskar, Sudip Kumar and Vela, Mihaela and van Genabith, Josef (2016):
A Neural Network based Approach to Automatic Post-Editing, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

@InProceedings{pal-EtAl:2016:P16-2,
author = {Pal, Santanu and Naskar, Sudip Kumar and Vela, Mihaela and van Genabith, Josef},
title = {A Neural Network based Approach to Automatic Post-Editing},
booktitle = {Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {281--286},
url = {
http://anthology.aclweb.org/P16-2046},
year = 2016
}
Pal et al. (2016)
Duong, Long and Anastasopoulos, Antonios and Chiang, David and Bird, Steven and Cohn, Trevor (2016):
An Attentional Model for Speech Translation Without Transcription, Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

@InProceedings{duong-EtAl:2016:N16-1,
author = {Duong, Long and Anastasopoulos, Antonios and Chiang, David and Bird, Steven and Cohn, Trevor},
title = {An Attentional Model for Speech Translation Without Transcription},
booktitle = {Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
month = {June},
address = {San Diego, California},
publisher = {Association for Computational Linguistics},
pages = {949--959},
url = {
http://www.aclweb.org/anthology/N16-1109},
year = 2016
}
Duong et al. (2016)
Jonathan H. Clark and Chris Dyer and Alon Lavie (2014):
Locally Non-Linear Learning for Statistical Machine Translation via Discretization and Structured Regularization, Transactions of the Association for Computational Linguistics (TACL)

@article{tacl14-Clark,
author = {Jonathan H. Clark and Chris Dyer and Alon Lavie},
title = {Locally Non-Linear Learning for Statistical Machine Translation via Discretization and Structured Regularization},
volume = {2},
pages = {393-404},
url = {
http://www.aclweb.org/anthology/Q/Q14/Q14-1031.pdf},
journal = {Transactions of the Association for Computational Linguistics (TACL)},
year = 2014
}
Clark et al. (2014)
Unpublished ArXiv
Mohammad Pezeshki (2015):
Sequence Modeling using Gated Recurrent Neural Networks, CoRR

@article{DBLP:journals/corr/Pezeshki15,
author = {Mohammad Pezeshki},
title = {Sequence Modeling using Gated Recurrent Neural Networks},
journal = {CoRR},
volume = {abs/1501.00299},
url = {
http://arxiv.org/abs/1501.00299},
timestamp = {Mon, 02 Feb 2015 14:12:25 +0100},
biburl = {
http://dblp.uni-trier.de/rec/bib/journals/corr/Pezeshki15},
bibsource = {dblp computer science bibliography,
http://dblp.org},
year = 2015
}
Pezeshki (2015)
Will Williams and Niranjani Prasad and David Mrva and Tom Ash and Tony Robinson (2015):
Scaling Recurrent Neural Network Language Models, CoRR

@article{DBLP:journals/corr/WilliamsPMAR15,
author = {Will Williams and Niranjani Prasad and David Mrva and Tom Ash and Tony Robinson},
title = {Scaling Recurrent Neural Network Language Models},
journal = {CoRR},
volume = {abs/1502.00512},
url = {
http://arxiv.org/abs/1502.00512},
timestamp = {Mon, 02 Mar 2015 14:17:34 +0100},
biburl = {
http://dblp.uni-trier.de/rec/bib/journals/corr/WilliamsPMAR15},
bibsource = {dblp computer science bibliography,
http://dblp.org},
year = 2015
}
Williams et al. (2015)
Jiajun Zhang (2015):
Local Translation Prediction with Global Sentence Representation, CoRR

@article{DBLP:journals/corr/Zhang15c,
author = {Jiajun Zhang},
title = {Local Translation Prediction with Global Sentence Representation},
journal = {CoRR},
volume = {abs/1502.07920},
url = {
http://arxiv.org/abs/1502.07920},
timestamp = {Mon, 02 Mar 2015 14:17:34 +0100},
biburl = {
http://dblp.uni-trier.de/rec/bib/journals/corr/Zhang15c},
bibsource = {dblp computer science bibliography,
http://dblp.org},
year = 2015
}
Zhang (2015)
Mingxuan Wang and Zhengdong Lu and Hang Li and Wenbin Jiang and Qun Liu (2015):
genCNN: A Convolutional Architecture for Word Sequence Prediction

@techreport{Wang:2015:unpublished,
author = {Mingxuan Wang and Zhengdong Lu and Hang Li and Wenbin Jiang and Qun Liu},
title = {genCNN: A Convolutional Architecture for Word Sequence Prediction},
url = {
http://arxiv.org/pdf/1503.05034.pdf},
year = 2015
}
Wang et al. (2015)
Zhaopeng Tu and Baotian Hu and Zhengdong Lu and Hang Li (2015):
Context-Dependent Translation Selection Using Convolutional Neural Network

@techreport{Tu:2015:unpublished,
author = {Zhaopeng Tu and Baotian Hu and Zhengdong Lu and Hang Li},
title = {Context-Dependent Translation Selection Using Convolutional Neural Network},
url = {
http://arxiv.org/pdf/1503.02357v1.pdf},
year = 2015
}
Tu et al. (2015)
Shujian Huang and Huadong Chen and Xinyu Dai and Jiajun Chen (2015):
Non-linear Learning for Statistical Machine Translation

@techreport{Huang:2015:unpublished,
author = {Shujian Huang and Huadong Chen and Xinyu Dai and Jiajun Chen},
title = {Non-linear Learning for Statistical Machine Translation},
url = {
http://arxiv.org/pdf/1503.00107v1.pdf},
year = 2015
}
Huang et al. (2015)
- UNKNOWN CITATION 'Gouws:2014:unpublished'