Decoding for Word Based Models
When decoding was first encountered in word-based models, a quite diverse set of decoding algorithms was explored.
Word Based Decoding is the main subject of 5 publications. 4 are discussed here.
Publications
Several decoding methods for word-based models are compared by
Ulrich Germann and Michael Jahr and Kevin Knight and Daniel Marcu and Kenji Yamada (2001):
Fast Decoding and Optimal Decoding for Machine Translation, Proceedings of the 39th Annual Meeting of the Association of Computational Linguistics (ACL)
@Inproceedings{Germann:2001,
author = {Ulrich Germann and Michael Jahr and Kevin Knight and Daniel Marcu and Kenji Yamada},
title = {Fast Decoding and Optimal Decoding for Machine Translation},
url = {
http://acl.ldc.upenn.edu/acl2001/MAIN/GERMANN.PDF},
booktitle = {Proceedings of the 39th Annual Meeting of the Association of Computational Linguistics (ACL)},
year = 2001
}
Germann et al. (2001), who introduce a greedy search
Ulrich Germann (2003):
Greedy Decoding for Statistical Machine Translation in Almost Linear Time, Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computation Linguistics (HLT-NAACL)
@InProceedings{Germann:2003,
author = {Ulrich Germann},
title = {Greedy Decoding for Statistical Machine Translation in Almost Linear Time},
url = {
http://acl.ldc.upenn.edu/N/N03/N03-1010.pdf},
booktitle = {Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computation Linguistics (HLT-NAACL)},
year = 2003
}
(Germann, 2003) and integer programming search method. Search errors of the greedy decoder may be reduces by a better initialization, for instance using an example-based machine translation system for seeding the search
Michael Paul and Eiichiro Sumita and Seiichi Yamamoto (2004):
Example-based Rescoring of Statistical Machine Translation Output, Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)
@Inproceedings{Paul:2004,
author = {Michael Paul and Eiichiro Sumita and Seiichi Yamamoto},
title = {Example-based Rescoring of Statistical Machine Translation Output},
url = {
http://acl.ldc.upenn.edu/hlt-naacl2004/shorts/pdf/198\_Paper.pdf},
googlescholar = {13438954660923401880},
booktitle = {Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)},
year = 2004
}
(Paul et al., 2004). A decoding algorithm based on alternately optimizing alignment (given translation) and translation (given alignment) is proposed by
Udupa, Raghavendra and Faruquie, Tanveer A. and Maji, Hemanta K. (2004):
An Algorithmic Framework for Solving the Decoding Problem in Statistical Machine Translation , Proceedings of Coling 2004
@inproceedings{Udupa:2004,
author = {Udupa, Raghavendra and Faruquie, Tanveer A. and Maji, Hemanta K.},
title = {An Algorithmic Framework for Solving the Decoding Problem in Statistical Machine Translation },
booktitle = {Proceedings of Coling 2004 },
editor = {{}},
month = {Aug 23--Aug 27},
address = {Geneva, Switzerland},
publisher = {COLING},
pages = {631--637},
year = 2004
}
Udupa et al. (2004).
Publications
Benchmarks
Discussion
Related Topics
New Publications
Riedel, Sebastian and Clarke, James (2009):
Revisiting Optimal Decoding for Machine Translation IBM Model 4, Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
@InProceedings{riedel-clarke:2009:NAACLHLT09-Short,
author = {Riedel, Sebastian and Clarke, James},
title = {Revisiting Optimal Decoding for Machine Translation IBM Model 4},
booktitle = {Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers},
month = {June},
address = {Boulder, Colorado},
publisher = {Association for Computational Linguistics},
pages = {5--8},
url = {
http://www.aclweb.org/anthology/N/N09/N09-2002},
year = 2009
}
Riedel and Clarke (2009)