Syntactic Re-Ranking
After a decoder generates the most promising translations, these can be checked for syntactic adherence in a separate stage.
Syntactic Reranking is the main subject of 9 publications. 5 are discussed here.
Publications
Linguistic features may be added to an n-best of candidate translation to be exploited by a re-ranking approach. This was explored by
Franz Josef Och and Daniel Gildea and Sanjeev Khudanpur and Anoop Sarkar and Kenji Yamada and Alexander Fraser and Shankar Kumar and Libin Shen and David A. Smith and Katherine Eng and Viren Jain and Zhen Jin and Dragomir Radev (2004):
A Smorgasbord of Features for Statistical Machine Translation, Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)
mentioned in Syntactic Reranking and Reranking@Inproceedings{Och:2004,
author = {Franz Josef Och and Daniel Gildea and Sanjeev Khudanpur and Anoop Sarkar and Kenji Yamada and Alexander Fraser and Shankar Kumar and Libin Shen and David A. Smith and Katherine Eng and Viren Jain and Zhen Jin and Dragomir Radev},
title = {A Smorgasbord of Features for Statistical Machine Translation},
url = {
http://acl.ldc.upenn.edu/hlt-naacl2004/main/pdf/54\_Paper.pdf},
googlescholar = {2846180344359494752},
booktitle = {Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)},
year = 2004
}
Och et al. (2004) and by
Koehn, Philipp and Knight, Kevin (2003):
Feature-Rich Statistical Translation of Noun Phrases, Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics
mentioned in Translating Chunks And Clauses and Syntactic Reranking@inproceedings{Koehn:2003np,
author = {Koehn, Philipp and Knight, Kevin},
title = {Feature-Rich Statistical Translation of Noun Phrases},
booktitle = {Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics},
editor = {Erhard Hinrichs and Dan Roth},
url = {
http://www.aclweb.org/anthology/P03-1040.pdf},
pages = {311--318},
year = 2003
}
Koehn and Knight (2003) for noun phrases. Sequence models trained on part-of-speech tags
Bonneau Maynard, Hélène and Allauzen, Alexandre and Déchelotte, Daniel and Schwenk, Holger (2007):
Combining Morphosyntactic Enriched Representation with n-best Reranking in Statistical Translation, Proceedings of SSST, NAACL-HLT 2007 / AMTA Workshop on Syntax and Structure in Statistical Translation
@InProceedings{bonneaumaynard-EtAl:2007:SSST,
author = {Bonneau Maynard, H\'{e}l\`{e}ne and Allauzen, Alexandre and D\'{e}chelotte, Daniel and Schwenk, Holger},
title = {Combining Morphosyntactic Enriched Representation with n-best Reranking in Statistical Translation},
booktitle = {Proceedings of SSST, NAACL-HLT 2007 / AMTA Workshop on Syntax and Structure in Statistical Translation},
month = {April},
address = {Rochester, New York},
publisher = {Association for Computational Linguistics},
pages = {65--71},
url = {
http://www.aclweb.org/anthology/W/W07/W07-0409},
year = 2007
}
(Maynard et al., 2007) or CCG supertags
Hassan, Hany and Sima'an, Khalil and Way, Andy (2007):
Supertagged Phrase-Based Statistical Machine Translation, Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics
mentioned in String To Tree and Syntactic Reranking@InProceedings{hassan-simaan-way:2007:ACLMain,
author = {Hassan, Hany and Sima'an, Khalil and Way, Andy},
title = {Supertagged Phrase-Based Statistical Machine Translation},
booktitle = {Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics},
month = {June},
address = {Prague, Czech Republic},
publisher = {Association for Computational Linguistics},
pages = {288--295},
url = {
http://www.aclweb.org/anthology/P/P07/P07-1037},
year = 2007
}
(Hassan et al., 2007) may be also used in re-ranking. To check if the dependency structure is preserved during translation, the number of preserved dependency links or paths may be a useful feature
Nikoulina, Vassilina and Dymetman, Marc (2008):
Experiments in Discriminating Phrase-Based Translations on the Basis of Syntactic Coupling Features, Proceedings of the ACL-08: HLT Second Workshop on Syntax and Structure in Statistical Translation (SSST-2)
@InProceedings{nikoulina-dymetman:2008:SSST,
author = {Nikoulina, Vassilina and Dymetman, Marc},
title = {Experiments in Discriminating Phrase-Based Translations on the Basis of Syntactic Coupling Features},
booktitle = {Proceedings of the ACL-08:~HLT Second Workshop on Syntax and Structure in Statistical Translation (SSST-2)},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {55--60},
url = {
http://www.aclweb.org/anthology/W/W08/W08-0407},
year = 2008
}
(Nikoulina and Dymetman, 2008).
Benchmarks
Discussion
Related Topics
New Publications
Miceli Barone, Antonio Valerio (2015):
Translation reranking using source phrase dependency features, Proceedings of the Ninth Workshop on Syntax, Semantics and Structure in Statistical Translation
@InProceedings{micelibarone:2015:SSST-9,
author = {Miceli Barone, Antonio Valerio},
title = {Translation reranking using source phrase dependency features},
booktitle = {Proceedings of the Ninth Workshop on Syntax, Semantics and Structure in Statistical Translation},
month = {June},
address = {Denver, Colorado, USA},
publisher = {Association for Computational Linguistics},
pages = {57--60},
url = {
http://www.aclweb.org/anthology/W15-1009},
year = 2015
}
Barone (2015)
Katsuhito Sudoh and Xianchao Wu and Kevin Duh and Hajime Tsukada and Masaaki Nagata (2011):
Post-ordering in Statistical Machine Translation, Proceedings of the 13th Machine Translation Summit (MT Summit XIII)
@inproceedings{MTS-2011-Sudoh,
author = {Katsuhito Sudoh and Xianchao Wu and Kevin Duh and Hajime Tsukada and Masaaki Nagata},
title = {Post-ordering in Statistical Machine Translation},
url = {
http://www.mt-archive.info/MTS-2011-Sudoh.pdf},
pages = {316-324},
booktitle = {Proceedings of the 13th Machine Translation Summit (MT Summit XIII)},
publisher = {International Association for Machine Translation},
location = {Xiamen, China},
year = 2011
}
Sudoh et al. (2011)
Wu, Dekai and Fung, Pascale (2009):
Semantic Roles for SMT: A Hybrid Two-Pass Model, Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
@InProceedings{wu-fung:2009:NAACLHLT09-Short,
author = {Wu, Dekai and Fung, Pascale},
title = {Semantic Roles for SMT: A Hybrid Two-Pass Model},
booktitle = {Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers},
month = {June},
address = {Boulder, Colorado},
publisher = {Association for Computational Linguistics},
pages = {13--16},
url = {
http://www.aclweb.org/anthology/N/N09/N09-2004},
year = 2009
}
Wu and Fung (2009)
Rosa, Rudolf and Mareček, David and Dušek, Ondřej (2012):
DEPFIX: A System for Automatic Correction of Czech MT Outputs, Proceedings of the Seventh Workshop on Statistical Machine Translation
@InProceedings{rosa-marevcek-duvsek:2012:WMT,
author = {Rosa, Rudolf and Mare\v{c}ek, David and Du\v{s}ek, Ond\v{r}ej},
title = {DEPFIX: A System for Automatic Correction of Czech {MT} Outputs},
booktitle = {Proceedings of the Seventh Workshop on Statistical Machine Translation},
month = {June},
address = {Montreal, Canada},
publisher = {Association for Computational Linguistics},
pages = {408--414},
url = {
http://www.aclweb.org/anthology/W12-3151},
year = 2012
}
Rosa et al. (2012)