Re-Ranking

Instead of solely relying on an integrated search for the best translation, we may introduce a second decoding pass in which the best translation is chosen from the set of the most likely candidates generated by a traditional decoder. This allows more features or alternate decision rules.

Reranking is the main subject of 12 publications. 8 are discussed here.

Topics in MachineLearning

Publications

Minimum error rate training has been used for re-ranking

Franz Josef Och and Daniel Gildea and Sanjeev Khudanpur and Anoop Sarkar and Kenji Yamada and Alexander Fraser and Shankar Kumar and Libin Shen and David A. Smith and Katherine Eng and Viren Jain and Zhen Jin and Dragomir Radev (2004): A Smorgasbord of Features for Statistical Machine Translation, Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)

mentioned in Syntactic Reranking and Reranking

(Och et al., 2004), other proposed methods are based on ordinal regression to separate good translations from bad ones

Libin Shen and Anoop Sarkar and Franz Josef Och (2004): Discriminative Reranking for Machine Translation, Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)

(Shen et al., 2004) and SPSA

Patrik Lambert and Rafael E. Banchs (2006): Tuning machine translation parameters with SPSA, Proc. of the International Workshop on Spoken Language Translation

(Lambert and Banchs, 2006).

Duh, Kevin and Kirchhoff, Katrin (2008): Beyond Log-Linear Models: Boosted Minimum Error Rate Training for N-best Re-ranking, Proceedings of ACL-08: HLT, Short Papers

Duh and Kirchhoff (2008) use boosting to improve over the log-linear model without any additional features.

Hasan, Saša and Zens, Richard and Ney, Hermann (2007): Are Very Large N-Best Lists Useful for SMT?, Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers

Hasan et al. (2007) examine the required size of n-best lists, both considering Oracle-BLEU and actual re-ranking performance, and see gains with n-best lists of up to 10,000. The use of a log-linear model imposes certain restrictions on features that may be relaxed using other machine learning approaches such as kernel methods or Gaussian mixture models

Nguyen, Patrick and Mahajan, Milind and He, Xiaodong (2007): Training Non-Parametric Features for Statistical Machine Translation, Proceedings of the Second Workshop on Statistical Machine Translation

(Nguyen et al., 2007).

See work by

Boxing Chen and Jun Sun and Hongfei Jiang and Min Zhang and Aiti Aw (2007): I^2R Chinese-English Translation System forIWSLT 2007, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)

mentioned in Research Groups and Reranking

Chen et al. (2007) and

Alexandre Patry and Philippe Langlais and Frédéric Béchet (2007): MISTRAL: A Lattice Translation System for IWSLT 2007, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)

mentioned in Research Groups and Reranking

Patry et al. (2007) for features used in re-ranking.

Benchmarks

Discussion

New Publications

Simon Carter and Christof Monz (2010): Discriminative Syntactic Reranking for Statistical Machine Translation, Proceedings of the Ninth Conference of the Association for Machine Translation in the Americas
add
@inproceedings{AMTA-2010-Carter,
author = {Simon Carter and Christof Monz},
title = {Discriminative Syntactic Reranking for Statistical Machine Translation},
url = {http://www.mt-archive.info/AMTA-2010-Carter.pdf},
booktitle = {Proceedings of the Ninth Conference of the Association for Machine Translation in the Americas},
location = {Denver, Colorado},
year = 2010
}
Carter and Monz (2010)
Artem Sokolov and Guillaume Wisniewski and François Yvon (2012): Non-linear n-best List Reranking with Few Features, Proceedings of the Tenth Conference of the Association for Machine Translation in the Americas (AMTA)
add
@inproceedings{AMTA-2012-Sokolov,
author = {Artem Sokolov and Guillaume Wisniewski and Fran{\,c}ois Yvon},
title = {Non-linear n-best List Reranking with Few Features},
url = {http://www.mt-archive.info/AMTA-2012-Sokolov.pdf},
booktitle = {Proceedings of the Tenth Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {San Diego, California},
year = 2012
}
Sokolov et al. (2012)
Erik Velldal and Stephan Oepen (2005): Maximum Entropy Models for Realization Ranking, Proceedings of the Tenth Machine Translation Summit (MT Summit X)
add
@InProceedings{Velldal:2005:MTS,
author = {Erik Velldal and Stephan Oepen},
title = {Maximum Entropy Models for Realization Ranking},
url = {http://heim.ifi.uio.no/~erikve/pubs/VelOep05.pdf},
googlescholar = {17230566164713489421},
booktitle = {Proceedings of the Tenth Machine Translation Summit (MT Summit X)},
month = {September},
address = {Phuket, Thailand},
year = 2005
}
Velldal and Oepen (2005)
Duh, Kevin and Sudoh, Katsuhito and Tsukada, Hajime and Isozaki, Hideki and Nagata, Masaaki (2010): N-Best Reranking by Multitask Learning, Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
add
@InProceedings{duh-EtAl:2010:WMT,
author = {Duh, Kevin and Sudoh, Katsuhito and Tsukada, Hajime and Isozaki, Hideki and Nagata, Masaaki},
title = {N-Best Reranking by Multitask Learning},
booktitle = {Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR},
month = {July},
address = {Uppsala, Sweden},
publisher = {Association for Computational Linguistics},
pages = {375--383},
url = {http://www.aclweb.org/anthology/W10-1757},
year = 2010
}
Duh et al. (2010)

MT Research Survey Wiki

A Comprehensive Survey of Neural and Statistical Machine Translation Research Publications

Search Descriptions

Re-Ranking

Publications

Benchmarks

Discussion

Related Topics

New Publications