Neural Components in Statistical Machine Translation
Especially early work on neural networks for machine translation was aimed at building neural components to be used in traditional statistical machine translation systems.
Neural Components In Statistical Machine Translation is the main subject of 43 publications. 10 are discussed here.
Publications
Translation Models: By including aligned source words in the conditioning context,
Devlin, Jacob and Zbib, Rabih and Huang, Zhongqiang and Lamar, Thomas and Schwartz, Richard and Makhoul, John (2014):
Fast and Robust Neural Network Joint Models for Statistical Machine Translation, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{devlin-EtAl:2014:P14-1,
author = {Devlin, Jacob and Zbib, Rabih and Huang, Zhongqiang and Lamar, Thomas and Schwartz, Richard and Makhoul, John},
title = {Fast and Robust Neural Network Joint Models for Statistical Machine Translation},
booktitle = {Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {June},
address = {Baltimore, Maryland},
publisher = {Association for Computational Linguistics},
pages = {1370--1380},
url = {
http://www.aclweb.org/anthology/P14-1129},
year = 2014
}
Devlin et al. (2014) enrich a feed-forward neural network language model with source context
Jiajun Zhang and Dakun Zhang and Jie Hao (2015):
Local Translation Prediction with Global Sentence Representation, Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI)
@inproceedings{JiajunZhang:2015:ijcai,
author = {Jiajun Zhang and Dakun Zhang and Jie Hao},
title = {Local Translation Prediction with Global Sentence Representation},
pages = {1398--1404},
booktitle = {Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI)},
url = {
http://ijcai.org/papers15/Papers/IJCAI15-201.pdf},
location = {Buenos Aires, Argentina},
year = 2015
}
Zhang et al. (2015) add a sentence embedding to the conditional context of this model, which are learned using a variant of convolutional neural networks and mapping them across languages.
Meng, Fandong and Lu, Zhengdong and Wang, Mingxuan and Li, Hang and Jiang, Wenbin and Liu, Qun (2015):
Encoding Source Language with Convolutional Neural Network for Machine Translation, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
mentioned in Research Groups and Neural Components In Statistical Machine Translation@InProceedings{meng-EtAl:2015:ACL-IJCNLP,
author = {Meng, Fandong and Lu, Zhengdong and Wang, Mingxuan and Li, Hang and Jiang, Wenbin and Liu, Qun},
title = {Encoding Source Language with Convolutional Neural Network for Machine Translation},
booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)},
month = {July},
address = {Beijing, China},
publisher = {Association for Computational Linguistics},
pages = {20--30},
url = {
http://www.aclweb.org/anthology/P15-1003},
year = 2015
}
Meng et al. (2015) use a more complex convolutional neural network to encode the input sentence that uses gated layers and also incorporates information about the output context.
Reordering Models: Lexicalized reordering models struggle with sparse data problems when conditioned on rich context.
Li, Peng and Liu, Yang and Sun, Maosong and Izuha, Tatsuya and Zhang, Dakun (2014):
A Neural Reordering Model for Phrase-based Translation, Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers
@InProceedings{li-EtAl:2014:Coling3,
author = {Li, Peng and Liu, Yang and Sun, Maosong and Izuha, Tatsuya and Zhang, Dakun},
title = {A Neural Reordering Model for Phrase-based Translation},
booktitle = {Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers},
month = {August},
address = {Dublin, Ireland},
publisher = {Dublin City University and Association for Computational Linguistics},
pages = {1897--1907},
url = {
http://www.aclweb.org/anthology/C14-1179},
year = 2014
}
Li et al. (2014) show that a neural reordering model can be conditioned on current and previous phrase pair (encoded with a recursive neural network auto-encoder) to make the same classification decisions for orientation type.
Pre-Ordering: Instead of handing reordering within the decoding process, we may pre-order the input sentence into output word order.
de Gispert, Adrià and Iglesias, Gonzalo and Byrne, Bill (2015):
Fast and Accurate Preordering for SMT using Neural Networks, Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
mentioned in Syntactic Prereordering and Neural Components In Statistical Machine Translation@InProceedings{degispert-iglesias-byrne:2015:NAACL-HLT,
author = {de Gispert, Adri\`{a} and Iglesias, Gonzalo and Byrne, Bill},
title = {Fast and Accurate Preordering for {SMT} using Neural Networks},
booktitle = {Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
month = {May--June},
address = {Denver, Colorado},
publisher = {Association for Computational Linguistics},
pages = {1012--1017},
url = {
http://www.aclweb.org/anthology/N15-1105},
year = 2015
}
Gispert et al. (2015) use an input dependency tree to learn a model that swaps children nodes and implement it using a feed-forward neural network.
Miceli Barone, Antonio Valerio and Attardi, Giuseppe (2015):
Non-projective Dependency-based Pre-Reordering with Recurrent Neural Network for Machine Translation, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
mentioned in Syntactic Prereordering and Neural Components In Statistical Machine Translation@InProceedings{micelibarone-attardi:2015:ACL-IJCNLP,
author = {Miceli Barone, Antonio Valerio and Attardi, Giuseppe},
title = {Non-projective Dependency-based Pre-Reordering with Recurrent Neural Network for Machine Translation},
booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)},
month = {July},
address = {Beijing, China},
publisher = {Association for Computational Linguistics},
pages = {846--856},
url = {
http://www.aclweb.org/anthology/P15-1082},
year = 2015
}
Barone and Attardi (2015) formulate a top-down left-to-right walk through the dependency tree and make reordering decisions at any node. They model this process with a recurrent neural network that includes past decisions in the conditioning context.
N-Gram Translation Models: An alternative view of the phrase based translation model is to break up phrase translations into minimal translation units, and employing a n-gram model over these units to condition each minimal translation units on the previous ones.
Schwenk, Holger and Costa-jussa, Marta Ruiz and Fonollosa, Jose A. R. (2007):
Smooth Bilingual N-Gram Translation, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)
mentioned in Smoothing and Neural Components In Statistical Machine Translation@InProceedings{schwenk-rcostajussa-rfonollosa:2007:EMNLP-CoNLL2007,
author = {Schwenk, Holger and Costa-jussa, Marta Ruiz and Fonollosa, Jose A. R.},
title = {Smooth Bilingual $N$-Gram Translation},
booktitle = {Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)},
pages = {430--438},
url = {
http://www.aclweb.org/anthology/D/D07/D07-1045},
year = 2007
}
Schwenk et al. (2007) treat each minimal translation unit as an atomic symbol and train a neural language model over it. Alternatively,
Hu, Yuening and Auli, Michael and Gao, Qin and Gao, Jianfeng (2014):
Minimum Translation Modeling with Recurrent Neural Networks, Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics
@InProceedings{hu-EtAl:2014:EACL,
author = {Hu, Yuening and Auli, Michael and Gao, Qin and Gao, Jianfeng},
title = {Minimum Translation Modeling with Recurrent Neural Networks},
booktitle = {Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics},
month = {April},
address = {Gothenburg, Sweden},
publisher = {Association for Computational Linguistics},
pages = {20--29},
url = {
http://www.aclweb.org/anthology/E14-1003},
year = 2014
}
(Hu et al., 2014) represent the minimal translation units as bag of words,
Wu, Haiyang and Dong, Daxiang and Hu, Xiaoguang and Yu, Dianhai and He, Wei and Wu, Hua and Wang, Haifeng and Liu, Ting (2014):
Improve Statistical Machine Translation with Context-Sensitive Bilingual Semantic Embedding Model, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)
@InProceedings{wu-EtAl:2014:EMNLP2014,
author = {Wu, Haiyang and Dong, Daxiang and Hu, Xiaoguang and Yu, Dianhai and He, Wei and Wu, Hua and Wang, Haifeng and Liu, Ting},
title = {Improve Statistical Machine Translation with Context-Sensitive Bilingual Semantic Embedding Model},
booktitle = {Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
month = {October},
address = {Doha, Qatar},
publisher = {Association for Computational Linguistics},
pages = {142--146},
url = {
http://www.aclweb.org/anthology/D14-1015},
year = 2014
}
(Wu et al., 2014) break them even further into single input words, single output words, or single input-output word pairs, and
Yu, Heng and Zhu, Xuan (2015):
Recurrent Neural Network based Rule Sequence Model for Statistical Machine Translation, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)
@InProceedings{yu-zhu:2015:ACL-IJCNLP,
author = {Yu, Heng and Zhu, Xuan},
title = {Recurrent Neural Network based Rule Sequence Model for Statistical Machine Translation},
booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)},
month = {July},
address = {Beijing, China},
publisher = {Association for Computational Linguistics},
pages = {132--138},
url = {
http://www.aclweb.org/anthology/P15-2022},
year = 2015
}
Yu and Zhu (2015) use phrase embeddings leaned with an auto-encoder.
Benchmarks
Discussion
Related Topics
New Publications
Nadir Durrani and Fahim Dalvi (2017):
Continuous Space Reordering Models for Phrase-based MT, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{IWSLT2017:Durrani,
author = {Nadir Durrani and Fahim Dalvi},
title = {Continuous Space Reordering Models for Phrase-based {MT}},
url = {
http://workshop2017.iwslt.org/downloads/O03-1-Paper.pdf},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
location = {Tokyo, Japan},
year = 2017
}
Durrani and Dalvi (2017)
Neural Models as Statistical Machine Translation Components
Wang, Longyue and Tu, Zhaopeng and Zhang, Xiaojun and Li, Hang and Way, Andy and Liu, Qun (2016):
A Novel Approach to Dropped Pronoun Translation, Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
mentioned in Document Level Aspects and Neural Components In Statistical Machine Translation@InProceedings{wang-EtAl:2016:N16-13,
author = {Wang, Longyue and Tu, Zhaopeng and Zhang, Xiaojun and Li, Hang and Way, Andy and Liu, Qun},
title = {A Novel Approach to Dropped Pronoun Translation},
booktitle = {Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
month = {June},
address = {San Diego, California},
publisher = {Association for Computational Linguistics},
pages = {983--993},
url = {
http://www.aclweb.org/anthology/N16-1113},
year = 2016
}
Wang et al. (2016)
Wang, Longyue and Tu, Zhaopeng and Zhang, Xiaojun and Liu, Siyou and Li, Hang and Way, Andy and Liu, Qun (2017):
A novel and robust approach for pro-drop language translation, Machine Translation
mentioned in Document Level Aspects and Neural Components In Statistical Machine Translation@Article{Wang2017,
author = {Wang, Longyue and Tu, Zhaopeng and Zhang, Xiaojun and Liu, Siyou and Li, Hang and Way, Andy and Liu, Qun},
title = {A novel and robust approach for pro-drop language translation},
journal = {Machine Translation},
pages = {1--23},
issn = {1573-0573},
doi = {10.1007/s10590-016-9184-9},
url = {
http://dx.doi.org/10.1007/s10590-016-9184-9},
year = 2017
}
Wang et al. (2017)
Rico Sennrich (2015):
Modelling and Optimizing on Syntactic N-Grams for Statistical Machine Translation, Transactions of the Association for Computational Linguistics
@article{Sennrich:2015:TACL,
author = {Rico Sennrich},
title = {Modelling and Optimizing on Syntactic N-Grams for Statistical Machine Translation},
journal = {Transactions of the Association for Computational Linguistics},
volume = {3},
issn = {2307-387X},
url = {
https://tacl2013.cs.columbia.edu/ojs/index.php/tacl/article/view/510},
pages = {169--182},
year = 2015
}
Sennrich (2015)
Zhang, Jingyi and Utiyama, Masao and Sumita, Eiichro and Neubig, Graham and Nakamura, Satoshi (2016):
A Continuous Space Rule Selection Model for Syntax-based Statistical Machine Translation, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{zhang-EtAl:2016:P16-11,
author = {Zhang, Jingyi and Utiyama, Masao and Sumita, Eiichro and Neubig, Graham and Nakamura, Satoshi},
title = {A Continuous Space Rule Selection Model for Syntax-based Statistical Machine Translation},
booktitle = {Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {1372--1381},
url = {
http://www.aclweb.org/anthology/P16-1130},
year = 2016
}
Zhang et al. (2016)
Peter, Jan-Thorsten and Wang, Weiyue and Ney, Hermann (2016):
Exponentially Decaying Bag-of-Words Input Features for Feed-Forward Neural Network in Statistical Machine Translation, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
@InProceedings{peter-wang-ney:2016:P16-2,
author = {Peter, Jan-Thorsten and Wang, Weiyue and Ney, Hermann},
title = {Exponentially Decaying Bag-of-Words Input Features for Feed-Forward Neural Network in Statistical Machine Translation},
booktitle = {Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {293--298},
url = {
http://anthology.aclweb.org/P16-2048},
year = 2016
}
Peter et al. (2016)
Zhang, Jingyi and Utiyama, Masao and Sumita, Eiichiro and Neubig, Graham and Nakamura, Satoshi (2015):
A Binarized Neural Network Joint Model for Machine Translation, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing
@InProceedings{zhang-EtAl:2015:EMNLP3,
author = {Zhang, Jingyi and Utiyama, Masao and Sumita, Eiichiro and Neubig, Graham and Nakamura, Satoshi},
title = {A Binarized Neural Network Joint Model for Machine Translation},
booktitle = {Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing},
month = {September},
address = {Lisbon, Portugal},
publisher = {Association for Computational Linguistics},
pages = {2094--2099},
url = {
http://aclweb.org/anthology/D15-1250},
year = 2015
}
Zhang et al. (2015)
Setiawan, Hendra and Huang, Zhongqiang and Devlin, Jacob and Lamar, Thomas and Zbib, Rabih and Schwartz, Richard and Makhoul, John (2015):
Statistical Machine Translation Features with Multitask Tensor Networks, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
@InProceedings{setiawan-EtAl:2015:ACL-IJCNLP,
author = {Setiawan, Hendra and Huang, Zhongqiang and Devlin, Jacob and Lamar, Thomas and Zbib, Rabih and Schwartz, Richard and Makhoul, John},
title = {Statistical Machine Translation Features with Multitask Tensor Networks},
booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)},
month = {July},
address = {Beijing, China},
publisher = {Association for Computational Linguistics},
pages = {31--41},
url = {
http://www.aclweb.org/anthology/P15-1004},
year = 2015
}
Setiawan et al. (2015)
Lu, Shixiang and Chen, Zhenbiao and Xu, Bo (2014):
Learning New Semi-Supervised Deep Auto-encoder Features for Statistical Machine Translation, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{lu-chen-xu:2014:P14-1,
author = {Lu, Shixiang and Chen, Zhenbiao and Xu, Bo},
title = {Learning New Semi-Supervised Deep Auto-encoder Features for Statistical Machine Translation},
booktitle = {Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {June},
address = {Baltimore, Maryland},
publisher = {Association for Computational Linguistics},
pages = {122--132},
url = {
http://www.aclweb.org/anthology/P14-1012},
year = 2014
}
Lu et al. (2014)
Zhai, Feifei and Zhang, Jiajun and Zhou, Yu and Zong, Chengqing (2014):
RNN-based Derivation Structure Prediction for SMT, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
@InProceedings{zhai-EtAl:2014:P14-2,
author = {Zhai, Feifei and Zhang, Jiajun and Zhou, Yu and Zong, Chengqing},
title = {RNN-based Derivation Structure Prediction for SMT},
booktitle = {Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {June},
address = {Baltimore, Maryland},
publisher = {Association for Computational Linguistics},
pages = {779--784},
url = {
http://www.aclweb.org/anthology/P14-2126},
year = 2014
}
Zhai et al. (2014)
Liu, Lemao and Watanabe, Taro and Sumita, Eiichiro and Zhao, Tiejun (2013):
Additive Neural Networks for Statistical Machine Translation, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{liu-EtAl:2013:ACL20131,
author = {Liu, Lemao and Watanabe, Taro and Sumita, Eiichiro and Zhao, Tiejun},
title = {Additive Neural Networks for Statistical Machine Translation},
booktitle = {Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {791--801},
url = {
http://www.aclweb.org/anthology/P13-1078},
year = 2013
}
Liu et al. (2013)
Liu, Shujie and Yang, Nan and Li, Mu and Zhou, Ming (2014):
A Recursive Recurrent Neural Network for Statistical Machine Translation, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{liu-EtAl:2014:P14-1,
author = {Liu, Shujie and Yang, Nan and Li, Mu and Zhou, Ming},
title = {A Recursive Recurrent Neural Network for Statistical Machine Translation},
booktitle = {Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {June},
address = {Baltimore, Maryland},
publisher = {Association for Computational Linguistics},
pages = {1491--1500},
url = {
http://www.aclweb.org/anthology/P14-1140},
year = 2014
}
Liu et al. (2014)
Reordering Models
Kanouchi, Shin and Sudoh, Katsuhito and Komachi, Mamoru (2016):
Neural Reordering Model Considering Phrase Translation and Word Alignment for Phrase-based Translation, Proceedings of the 3rd Workshop on Asian Translation (WAT2016)
@InProceedings{kanouchi-sudoh-komachi:2016:WAT2016,
author = {Kanouchi, Shin and Sudoh, Katsuhito and Komachi, Mamoru},
title = {Neural Reordering Model Considering Phrase Translation and Word Alignment for Phrase-based Translation},
booktitle = {Proceedings of the 3rd Workshop on Asian Translation (WAT2016)},
month = {December},
address = {Osaka, Japan},
publisher = {The COLING 2016 Organizing Committee},
pages = {94--103},
url = {
http://aclweb.org/anthology/W16-4607},
year = 2016
}
Kanouchi et al. (2016)
Cui, Yiming and Wang, Shijin and Li, Jianfeng (2016):
LSTM Neural Reordering Feature for Statistical Machine Translation, Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
@InProceedings{cui-wang-li:2016:N16-1,
author = {Cui, Yiming and Wang, Shijin and Li, Jianfeng},
title = {LSTM Neural Reordering Feature for Statistical Machine Translation},
booktitle = {Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
month = {June},
address = {San Diego, California},
publisher = {Association for Computational Linguistics},
pages = {977--982},
url = {
http://www.aclweb.org/anthology/N16-1112},
year = 2016
}
Cui et al. (2016)
Translation Models
Stahlberg, Felix and Hasler, Eva and Waite, Aurelien and Byrne, Bill (2016):
Syntactically Guided Neural Machine Translation, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
@InProceedings{stahlberg-EtAl:2016:P16-2,
author = {Stahlberg, Felix and Hasler, Eva and Waite, Aurelien and Byrne, Bill},
title = {Syntactically Guided Neural Machine Translation},
booktitle = {Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {299--305},
url = {
http://anthology.aclweb.org/P16-2049},
year = 2016
}
Stahlberg et al. (2016)
Sundermeyer, Martin and Alkhouli, Tamer and Wuebker, Joern and Ney, Hermann (2014):
Translation Modeling with Bidirectional Recurrent Neural Networks, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)
@InProceedings{sundermeyer-EtAl:2014:EMNLP2014,
author = {Sundermeyer, Martin and Alkhouli, Tamer and Wuebker, Joern and Ney, Hermann},
title = {Translation Modeling with Bidirectional Recurrent Neural Networks},
booktitle = {Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
month = {October},
address = {Doha, Qatar},
publisher = {Association for Computational Linguistics},
pages = {14--25},
url = {
http://www.aclweb.org/anthology/D14-1003},
year = 2014
}
Sundermeyer et al. (2014)
Schwenk, Holger (2012):
Continuous Space Translation Models for Phrase-Based Statistical Machine Translation, Proceedings of COLING 2012: Posters
mentioned in Smoothing and Neural Components In Statistical Machine Translation@InProceedings{schwenk:2012:POSTERS,
author = {Schwenk, Holger},
title = {Continuous Space Translation Models for Phrase-Based Statistical Machine Translation},
booktitle = {Proceedings of COLING 2012: Posters},
month = {December},
address = {Mumbai, India},
publisher = {The COLING 2012 Organizing Committee},
pages = {1071--1080},
url = {
http://www.aclweb.org/anthology/C12-2104},
year = 2012
}
Schwenk (2012)
Le, Hai-Son and Allauzen, Alexandre and Yvon, François (2012):
Continuous Space Translation Models with Neural Networks, Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
mentioned in Phrase Based Model Refinement and Neural Components In Statistical Machine Translation@InProceedings{le-allauzen-yvon:2012:NAACL-HLT,
author = {Le, Hai-Son and Allauzen, Alexandre and Yvon, Fran\c{c}ois},
title = {Continuous Space Translation Models with Neural Networks},
booktitle = {Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
month = {June},
address = {Montr\'{e}al, Canada},
publisher = {Association for Computational Linguistics},
pages = {39--48},
url = {
http://www.aclweb.org/anthology/N12-1005},
year = 2012
}
Le et al. (2012)
Addanki, Karteek and Wu, Dekai (2014):
Transduction Recursive Auto-Associative Memory: Learning Bilingual Compositional Distributed Vector Representations of Inversion Transduction Grammars, Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation
@InProceedings{addanki-wu:2014:SSST-8,
author = {Addanki, Karteek and Wu, Dekai},
title = {Transduction Recursive Auto-Associative Memory: Learning Bilingual Compositional Distributed Vector Representations of Inversion Transduction Grammars},
booktitle = {Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation},
month = {October},
address = {Doha, Qatar},
publisher = {Association for Computational Linguistics},
pages = {112--121},
url = {
http://www.aclweb.org/anthology/W14-4013},
year = 2014
}
Addanki and Wu (2014)
Quoc Khanh Do and Alexandre Allauzen and François Yvon (2014):
Discriminative Adaptation of Continuous Space Translation Models, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{Khanh:iwslt:2014,
author = {Quoc Khanh Do and Alexandre Allauzen and Fran{\,c}ois Yvon},
title = {Discriminative Adaptation of Continuous Space Translation Models},
pages = {192--199},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
year = 2014
}
Do et al. (2014)
Li, Peng and Liu, Yang and Sun, Maosong (2013):
Recursive Autoencoders for ITG-Based Translation, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
@InProceedings{li-liu-sun:2013:EMNLP,
author = {Li, Peng and Liu, Yang and Sun, Maosong},
title = {Recursive Autoencoders for {ITG}-Based Translation},
booktitle = {Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing},
month = {October},
address = {Seattle, Washington, USA},
publisher = {Association for Computational Linguistics},
pages = {567--577},
url = {
http://www.aclweb.org/anthology/D13-1054},
year = 2013
}
Li et al. (2013)
Wu, Youzheng and Watanabe, Taro and Hori, Chiori (2014):
Recurrent Neural Network-based Tuple Sequence Model for Machine Translation, Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers
@InProceedings{wu-watanabe-hori:2014:Coling,
author = {Wu, Youzheng and Watanabe, Taro and Hori, Chiori},
title = {Recurrent Neural Network-based Tuple Sequence Model for Machine Translation},
booktitle = {Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers},
month = {August},
address = {Dublin, Ireland},
publisher = {Dublin City University and Association for Computational Linguistics},
pages = {1908--1917},
url = {
http://www.aclweb.org/anthology/C14-1180},
year = 2014
}
Wu et al. (2014)
Auli, Michael and Galley, Michel and Quirk, Chris and Zweig, Geoffrey (2013):
Joint Language and Translation Modeling with Recurrent Neural Networks, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
@InProceedings{auli-EtAl:2013:EMNLP,
author = {Auli, Michael and Galley, Michel and Quirk, Chris and Zweig, Geoffrey},
title = {Joint Language and Translation Modeling with Recurrent Neural Networks},
booktitle = {Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing},
month = {October},
address = {Seattle, Washington, USA},
publisher = {Association for Computational Linguistics},
pages = {1044--1054},
url = {
http://www.aclweb.org/anthology/D13-1106},
year = 2013
}
Auli et al. (2013)
Word Alignment
Legrand, Joël and Auli, Michael and Collobert, Ronan (2016):
Neural Network-based Word Alignment through Score Aggregation, Proceedings of the First Conference on Machine Translation
@InProceedings{legrand-auli-collobert:2016:WMT,
author = {Legrand, Jo\"{e}l and Auli, Michael and Collobert, Ronan},
title = {Neural Network-based Word Alignment through Score Aggregation},
booktitle = {Proceedings of the First Conference on Machine Translation},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {66--73},
url = {
http://www.aclweb.org/anthology/W/W16/W16-2207},
year = 2016
}
Legrand et al. (2016)
Jalili Sabet, Masoud and Faili, Heshaam and Haffari, Gholamreza (2016):
Improving Word Alignment of Rare Words with Word Embeddings, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
@InProceedings{jalilisabet-faili-haffari:2016:COLING,
author = {Jalili Sabet, Masoud and Faili, Heshaam and Haffari, Gholamreza},
title = {Improving Word Alignment of Rare Words with Word Embeddings},
booktitle = {Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers},
month = {December},
address = {Osaka, Japan},
publisher = {The COLING 2016 Organizing Committee},
pages = {3209--3215},
url = {
http://aclweb.org/anthology/C16-1302},
year = 2016
}
Sabet et al. (2016)
Tamura, Akihiro and Watanabe, Taro and Sumita, Eiichiro (2014):
Recurrent Neural Networks for Word Alignment Model, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{tamura-watanabe-sumita:2014:P14-1,
author = {Tamura, Akihiro and Watanabe, Taro and Sumita, Eiichiro},
title = {Recurrent Neural Networks for Word Alignment Model},
booktitle = {Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {June},
address = {Baltimore, Maryland},
publisher = {Association for Computational Linguistics},
pages = {1470--1480},
url = {
http://www.aclweb.org/anthology/P14-1138},
year = 2014
}
Tamura et al. (2014)
Yang, Nan and Liu, Shujie and Li, Mu and Zhou, Ming and Yu, Nenghai (2013):
Word Alignment Modeling with Context Dependent Deep Neural Network, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{yang-EtAl:2013:ACL2013,
author = {Yang, Nan and Liu, Shujie and Li, Mu and Zhou, Ming and Yu, Nenghai},
title = {Word Alignment Modeling with Context Dependent Deep Neural Network},
booktitle = {Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {166--175},
url = {
http://www.aclweb.org/anthology/P13-1017},
year = 2013
}
Yang et al. (2013)
Morphology
Soricut, Radu and Och, Franz (2015):
Unsupervised Morphology Induction Using Word Embeddings, Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
@InProceedings{soricut-och:2015:NAACL-HLT,
author = {Soricut, Radu and Och, Franz},
title = {Unsupervised Morphology Induction Using Word Embeddings},
booktitle = {Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
month = {May--June},
address = {Denver, Colorado},
publisher = {Association for Computational Linguistics},
pages = {1627--1637},
url = {
http://www.aclweb.org/anthology/N15-1186},
year = 2015
}
Soricut and Och (2015)
Tran, Ke M. and Bisazza, Arianna and Monz, Christof (2014):
Word Translation Prediction for Morphologically Rich Languages with Bilingual Neural Networks, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)
@InProceedings{tran-bisazza-monz:2014:EMNLP2014,
author = {Tran, Ke M. and Bisazza, Arianna and Monz, Christof},
title = {Word Translation Prediction for Morphologically Rich Languages with Bilingual Neural Networks},
booktitle = {Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
month = {October},
address = {Doha, Qatar},
publisher = {Association for Computational Linguistics},
pages = {1676--1688},
url = {
http://www.aclweb.org/anthology/D14-1175},
year = 2014
}
Tran et al. (2014)
Topic Models
Cui, Lei and Zhang, Dongdong and Liu, Shujie and Chen, Qiming and Li, Mu and Zhou, Ming and Yang, Muyun (2014):
Learning Topic Representation for SMT with Neural Networks, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{cui-EtAl:2014:P14-1,
author = {Cui, Lei and Zhang, Dongdong and Liu, Shujie and Chen, Qiming and Li, Mu and Zhou, Ming and Yang, Muyun},
title = {Learning Topic Representation for {SMT} with Neural Networks},
booktitle = {Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {June},
address = {Baltimore, Maryland},
publisher = {Association for Computational Linguistics},
pages = {133--143},
url = {
http://www.aclweb.org/anthology/P14-1013},
year = 2014
}
Cui et al. (2014)
Evaluation Metrics based on Neural Models
Guzmán, Francisco and Bouamor, Houda and Baly, Ramy and Habash, Nizar (2016):
Machine Translation Evaluation for Arabic using Morphologically-enriched Embeddings, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
@InProceedings{guzman-EtAl:2016:COLING,
author = {Guzm\'{a}n, Francisco and Bouamor, Houda and Baly, Ramy and Habash, Nizar},
title = {Machine Translation Evaluation for Arabic using Morphologically-enriched Embeddings},
booktitle = {Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers},
month = {December},
address = {Osaka, Japan},
publisher = {The COLING 2016 Organizing Committee},
pages = {1398--1408},
url = {
http://aclweb.org/anthology/C16-1132},
year = 2016
}
Guzmán et al. (2016)
Gupta, Rohit and Orasan, Constantin and van Genabith, Josef (2015):
ReVal: A Simple and Effective Machine Translation Evaluation Metric Based on Recurrent Neural Networks, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing
mentioned in Trained Metrics and Neural Components In Statistical Machine Translation@InProceedings{gupta-orasan-vangenabith:2015:EMNLP,
author = {Gupta, Rohit and Orasan, Constantin and van Genabith, Josef},
title = {ReVal: A Simple and Effective Machine Translation Evaluation Metric Based on Recurrent Neural Networks},
booktitle = {Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing},
month = {September},
address = {Lisbon, Portugal},
publisher = {Association for Computational Linguistics},
pages = {1066--1072},
url = {
http://aclweb.org/anthology/D15-1124},
year = 2015
}
Gupta et al. (2015)
Guzmán, Francisco and Joty, Shafiq and Màrquez, Lluís and Nakov, Preslav (2015):
Pairwise Neural Machine Translation Evaluation, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
@InProceedings{guzman-EtAl:2015:ACL-IJCNLP,
author = {Guzm\'{a}n, Francisco and Joty, Shafiq and M\`{a}rquez, Llu\'{i}s and Nakov, Preslav},
title = {Pairwise Neural Machine Translation Evaluation},
booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)},
month = {July},
address = {Beijing, China},
publisher = {Association for Computational Linguistics},
pages = {805--814},
url = {
http://www.aclweb.org/anthology/P15-1078},
year = 2015
}
Guzmán et al. (2015)