Stack Decoding
An important principle for efficient search is the organization of the search space to take advantage of dynamic programming and the early pruning of comparable hypothesis (that are placed in the same stack).
Stack Decoding is the main subject of 15 publications. 11 are discussed here.
Publications
The stack decoding algorithm has it roots in work by
Christoph Tillmann and Stephan Vogel and Hermann Ney and Alex Zubiaga (1997):
A DP-based Search Using Monotone Alignments in Statistical Translation, Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL)
mentioned in Stack Decoding and Edit Rate Metrics@Inproceedings{Tillmann:1997,
author = {Christoph Tillmann and Stephan Vogel and Hermann Ney and Alex Zubiaga},
title = {A DP-based Search Using Monotone Alignments in Statistical Translation},
url = {
http://acl.ldc.upenn.edu/P/P97/P97-1037.pdf},
googlescholar = {16581080566414205368},
booktitle = {Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL)},
year = 1997
}
Tillmann et al. (1997), who describe a dynamic programming algorithm, similar to techniques for hidden Markov models, for monotone decoding of word-based models. Allowing for reordering makes decoding more complex.
Efficient A* search makes the translation of short sentences possible
Franz Josef Och and Nicola Ueffing and Hermann Ney (2001):
An Efficient A* Search Algorithm for Statistical Machine Translation, Workshop on Data-Driven Machine Translation at 39th Annual Meeting of the Association of Computational Linguistics (ACL)
@inproceedings{Och:2001a,
author = {Franz Josef Och and Nicola Ueffing and Hermann Ney},
title = {An Efficient {A}* Search Algorithm for Statistical Machine Translation},
url = {
http://acl.ldc.upenn.edu/W/W01/W01-1408.pdf?origin=publicationDetail},
googlescholar = {13901395445887072916},
booktitle = {Workshop on Data-Driven Machine Translation at 39th Annual Meeting of the Association of Computational Linguistics (ACL)},
year = 2001
}
(Och et al., 2001). However, longer sentences require pruning
Ye-Yi Wang and Alex Waibel (1997):
Decoding Algorithm in Statistical Machine Translation, Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL)
@Inproceedings{Wang:1997,
author = {Ye-Yi Wang and Alex Waibel},
title = {Decoding Algorithm in Statistical Machine Translation},
url = {
http://acl.ldc.upenn.edu/P/P97/P97-1047.pdf},
googlescholar = {2733198255365933192},
booktitle = {Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL)},
year = 1997
}
Wang and Waibel (1997);
Sonja Nießen and Stephan Vogel and Hermann Ney and Christoph Tillmann (1998):
A DP based Search Algorithm for Statistical Machine Translation, Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)
@Inproceedings{Niessen:1998,
author = {Sonja Nie{\ss}en and Stephan Vogel and Hermann Ney and Christoph Tillmann},
title = {A DP based Search Algorithm for Statistical Machine Translation},
url = {
http://acl.ldc.upenn.edu/C/C98/C98-2153.pdf},
googlescholar = {17201987687279997101},
booktitle = {Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)},
year = 1998
}
Nießen et al. (1998) or restrictions on reordering
Christoph Tillmann and Hermann Ney (2000):
Word Re-ordering and DP-based Search in Statistical Machine Translation, Proceedings of the International Conference on Computational Linguistics (COLING)
@InProceedings{Tillmann:2000,
author = {Christoph Tillmann and Hermann Ney},
title = {Word Re-ordering and DP-based Search in Statistical Machine Translation},
url = {
http://www.aclweb.org/anthology/C00-2123},
googlescholar = {9015618188518603000},
booktitle = {Proceedings of the International Conference on Computational Linguistics (COLING)},
year = 2000
}
(Tillmann and Ney, 2000) or both
Christoph Tillmann and Hermann Ney (2003):
Word Reordering and a Dynamic Programming Beam Search Algorithm for Statistical Machine Translation, Computational Linguistics
@Article{Tillmann:2003j,
author = {Christoph Tillmann and Hermann Ney},
title = {Word Reordering and a Dynamic Programming Beam Search Algorithm for Statistical Machine Translation},
url = {
http://acl.ldc.upenn.edu/J/J03/J03-1005.pdf},
googlescholar = {4047567465539950833},
journal = {Computational Linguistics},
volume = {29},
number = {1},
year = 2003
}
(Tillmann and Ney, 2003).
Ortiz-Martínez, Daniel and García-Varea, Ismael and Casacuberta, Francisco (2006):
Generalized Stack Decoding Algorithms for Statistical Machine Translation, Proceedings on the Workshop on Statistical Machine Translation
@InProceedings{ortizmartinez-garciavarea-casacuberta:2006:WMT,
author = {Ortiz-Mart\'{i}nez, Daniel and Garc\'{i}a-Varea, Ismael and Casacuberta, Francisco},
title = {Generalized Stack Decoding Algorithms for Statistical Machine Translation},
booktitle = {Proceedings on the Workshop on Statistical Machine Translation},
month = {June},
address = {New York City},
publisher = {Association for Computational Linguistics},
pages = {64--71},
url = {
http://www.aclweb.org/anthology/W/W06/W06-3109},
year = 2006
}
Ortiz-Martínez et al. (2006) compare different type of stack decoding with varying number of stacks.
Brian Delaney and Wade Shen and Timothy Anderson (2006):
An efficient graph search decoder for phrase-based statistical machine translation, Proc. of the International Workshop on Spoken Language Translation
@inproceedings{Delaney:2006:IWSLT,
author = {Brian Delaney and Wade Shen and Timothy Anderson},
title = {An efficient graph search decoder for phrase-based statistical machine translation},
url = {
http://20.210-193-52.unknown.qala.com.sg/archive/iwslt\_06/papers/slt6\_197.pdf},
googlescholar = {9663394719199124569},
month = {November},
booktitle = {Proc. of the International Workshop on Spoken Language Translation},
address = {Kyoto, Japan},
year = 2006
}
Delaney et al. (2006) speed up stack decoding with A* pruning.
Robert C. Moore and Chris Quirk (2007):
Faster Beam-Search Decoding for Phrasal Statistical Machine Translation, Proceedings of the MT Summit XI
@inproceedings{Moore:2007:MTSummit,
author = {Robert C. Moore and Chris Quirk},
title = {Faster Beam-Search Decoding for Phrasal Statistical Machine Translation},
url = {
http://mt-archive.info/MTS-2007-Moore.pdf},
googlescholar = {16008390157426171074},
booktitle = {Proceedings of the {MT} Summit XI},
year = 2007
}
Moore and Quirk (2007) stress the importance of threshold pruning and the avoidance of expanding doomed hypotheses
Robert C. Moore and Chris Quirk (2007):
Faster Beam-Search Decoding for Phrasal Statistical Machine Translation, Proceedings of the MT Summit XI
@inproceedings{Moore:2007:MTSummit,
author = {Robert C. Moore and Chris Quirk},
title = {Faster Beam-Search Decoding for Phrasal Statistical Machine Translation},
url = {
http://mt-archive.info/MTS-2007-Moore.pdf},
googlescholar = {16008390157426171074},
booktitle = {Proceedings of the {MT} Summit XI},
year = 2007
}
(Moore and Quirk, 2007).
The rest cost (or future cost) estimation may be improved by taking expected reordering costs into account
Green, Spence and Galley, Michel and Manning, Christopher D. (2010):
Improved Models of Distortion Cost for Statistical Machine Translation, Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
mentioned in Reordering Models and Stack Decoding@InProceedings{green-galley-manning:2010:NAACLHLT,
author = {Green, Spence and Galley, Michel and Manning, Christopher D.},
title = {Improved Models of Distortion Cost for Statistical Machine Translation},
booktitle = {Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics},
month = {June},
address = {Los Angeles, California},
publisher = {Association for Computational Linguistics},
pages = {867--875},
url = {
http://www.aclweb.org/anthology/N10-1129},
year = 2010
}
(Green et al., 2010).
Additional efficiency of dynamic programming is gained by collapsing contexts that are invariant to the language model
Li, Zhifei and Khudanpur, Sanjeev (2008):
A Scalable Decoder for Parsing-Based Machine Translation with Equivalent Language Model State Maintenance, Proceedings of the ACL-08: HLT Second Workshop on Syntax and Structure in Statistical Translation (SSST-2)
mentioned in Stack Decoding and Syntax Model Decoding@InProceedings{li-khudanpur:2008:SSST,
author = {Li, Zhifei and Khudanpur, Sanjeev},
title = {A Scalable Decoder for Parsing-Based Machine Translation with Equivalent Language Model State Maintenance},
booktitle = {Proceedings of the ACL-08:~HLT Second Workshop on Syntax and Structure in Statistical Translation (SSST-2)},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {10--18},
url = {
http://www.aclweb.org/anthology/W/W08/W08-0402},
year = 2008
}
(Li and Khudanpur, 2008).
Benchmarks
Discussion
Related Topics
New Publications
Heafield, Kenneth and Kayser, Michael and Manning, Christopher D. (2014):
Faster Phrase-Based Decoding by Refining Feature State, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
@InProceedings{heafield-kayser-manning:2014:P14-2,
author = {Heafield, Kenneth and Kayser, Michael and Manning, Christopher D.},
title = {Faster Phrase-Based Decoding by Refining Feature State},
booktitle = {Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {June},
address = {Baltimore, Maryland},
publisher = {Association for Computational Linguistics},
pages = {130--135},
url = {
http://www.aclweb.org/anthology/P14-2022},
year = 2014
}
Heafield et al. (2014)
Wuebker, Joern and Ney, Hermann and Zens, Richard (2012):
Fast and Scalable Decoding with Language Model Look-Ahead for Phrase-based Statistical Machine Translation, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
@InProceedings{wuebker-ney-zens:2012:ACL2012short,
author = {Wuebker, Joern and Ney, Hermann and Zens, Richard},
title = {Fast and Scalable Decoding with Language Model Look-Ahead for Phrase-based Statistical Machine Translation},
booktitle = {Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {July},
address = {Jeju Island, Korea},
publisher = {Association for Computational Linguistics},
pages = {28--32},
url = {
http://www.aclweb.org/anthology/P12-2006},
year = 2012
}
Wuebker et al. (2012)
Richard Zens and Hermann Ney (2008):
Improvements in dynamic programming beam search for phrase-based statistical machine translation, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{zens:iwslt:2008,
author = {Richard Zens and Hermann Ney},
title = {Improvements in dynamic programming beam search for phrase-based statistical machine translation},
url = {
http://www-i6.informatik.rwth-aachen.de/publications/download/618/Zens-IWSLT-2008.pdf},
googlescholar = {3947364333513060547},
pages = {195--205},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
year = 2008
}
Zens and Ney (2008)
Bach, Nguyen and Vogel, Stephan and Cherry, Colin (2009):
Cohesive Constraints in A Beam Search Phrase-based Decoder, Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
@InProceedings{bach-vogel-cherry:2009:NAACLHLT09-Short,
author = {Bach, Nguyen and Vogel, Stephan and Cherry, Colin},
title = {Cohesive Constraints in A Beam Search Phrase-based Decoder},
booktitle = {Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers},
month = {June},
address = {Boulder, Colorado},
publisher = {Association for Computational Linguistics},
pages = {1--4},
url = {
http://www.aclweb.org/anthology/N/N09/N09-2001},
year = 2009
}
Bach et al. (2009)