Symmetrization
The birth defect of the IBM Models is the restriction to one-to-many alignments. Training models in both directions, and merging the outcome of the training overcomes this.
Symmetrization is the main subject of 12 publications. 10 are discussed here.
Publications
Symmetrizing IBM model alignments was first proposed by
Franz Josef Och and Hermann Ney (2003):
A Systematic Comparison of Various Statistical Alignment Models, Computational Linguistics
@Article{Och:2003j,
author = {Franz Josef Och and Hermann Ney},
title = {A Systematic Comparison of Various Statistical Alignment Models},
url = {
http://acl.ldc.upenn.edu/J/J03/J03-1002.pdf?q=modles},
googlescholar = {7906670690027479083},
journal = {Computational Linguistics},
volume = {29},
number = {1},
year = 2003
}
Och and Ney (2003) and may be improved by already symmetrizing during the IBM Model training
Matusov, Evgeny and Zens, Richard and Ney, Hermann (2004):
Symmetric Word Alignments for Statistical Machine Translation , Proceedings of Coling 2004
@inproceedings{Matusov:2004,
author = {Matusov, Evgeny and Zens, Richard and Ney, Hermann},
title = {Symmetric Word Alignments for Statistical Machine Translation },
url = {
http://acl.ldc.upenn.edu/C/C04/C04-1032.pdf},
booktitle = {Proceedings of Coling 2004 },
editor = {{}},
month = {Aug 23--Aug 27},
address = {Geneva, Switzerland},
publisher = {COLING},
pages = {219--225},
year = 2004
}
(Matusov et al., 2004), or by explicitly modeling the agreement between the two alignments and optimizing it during with EM training
Liang, Percy and Taskar, Ben and Klein, Dan (2006):
Alignment by Agreement, Proceedings of the Human Language Technology Conference of the NAACL, Main Conference
@InProceedings{liang-taskar-klein:2006:HLT-NAACL06-Main,
author = {Liang, Percy and Taskar, Ben and Klein, Dan},
title = {Alignment by Agreement},
booktitle = {Proceedings of the Human Language Technology Conference of the NAACL, Main Conference},
month = {June},
address = {New York City, USA},
publisher = {Association for Computational Linguistics},
pages = {104--111},
url = {
http://www.aclweb.org/anthology/N/N06/N06-1014},
year = 2006
}
(Liang et al., 2006). Different word alignments obtained with various IBM Models and symmetrization methods may also be combined using a maximum entropy approach
Ayan, Necip Fazil and Dorr, Bonnie J. (2006):
A Maximum Entropy Approach to Combining Word Alignments, Proceedings of the Human Language Technology Conference of the NAACL, Main Conference
@InProceedings{ayan-dorr:2006:HLT-NAACL06-Main,
author = {Ayan, Necip Fazil and Dorr, Bonnie J.},
title = {A Maximum Entropy Approach to Combining Word Alignments},
booktitle = {Proceedings of the Human Language Technology Conference of the NAACL, Main Conference},
month = {June},
address = {New York City, USA},
publisher = {Association for Computational Linguistics},
pages = {96--103},
url = {
http://www.aclweb.org/anthology/N/N06/N06-1013},
year = 2006
}
(Ayan and Dorr, 2006;
Ganchev, Kuzman and Gra\,ca, João V. and Taskar, Ben (2008):
Better Alignments = Better Translations?, Proceedings of ACL-08: HLT
@InProceedings{ganchev-gracca-taskar:2008:ACLMain,
author = {Ganchev, Kuzman and Gra\,{c}a, Jo\~{a}o V. and Taskar, Ben},
title = {Better Alignments = Better Translations?},
booktitle = {Proceedings of ACL-08: HLT},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {986--993},
url = {
http://www.aclweb.org/anthology/P/P08/P08-1112},
year = 2008
}
Ganchev et al., 2008).
Crego, Josep M. and Habash, Nizar (2008):
Using Shallow Syntax Information to Improve Word Alignment and Reordering for SMT, Proceedings of the Third Workshop on Statistical Machine Translation
mentioned in Symmetrization and POS Chunk Prereordering@InProceedings{crego-habash:2008:WMT,
author = {Crego, Josep M. and Habash, Nizar},
title = {Using Shallow Syntax Information to Improve Word Alignment and Reordering for {SMT}},
booktitle = {Proceedings of the Third Workshop on Statistical Machine Translation},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {53--61},
url = {
http://www.aclweb.org/anthology/W/W08/W08-0307},
year = 2008
}
Crego and Habash (2008) use constraints over syntactic chunks to guide symmetrization.
Starting an word alignment resulting from IBM models, additional features may be defined to assess each alignment points.
Fraser, Alexander and Marcu, Daniel (2006):
Semi-Supervised Training for Statistical Word Alignment, Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics
@InProceedings{fraser-marcu:2006:COLACL,
author = {Fraser, Alexander and Marcu, Daniel},
title = {Semi-Supervised Training for Statistical Word Alignment},
booktitle = {Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics},
month = {July},
address = {Sydney, Australia},
publisher = {Association for Computational Linguistics},
pages = {769--776},
url = {
http://www.aclweb.org/anthology/P/P06/P06-1097},
year = 2006
}
Fraser and Marcu (2006) use additional features during symmetrization, either to be used in re-ranking or integrated into the search.
Such features may form the basis for a classifier that adds one alignment points at at time
Dengjun Ren and Hua Wu and Haifeng Wang (2007):
Improving Statistical Word Alignment with Various Clues, Proceedings of the MT Summit XI
@inproceedings{Ren:2007:MTSummit,
author = {Dengjun Ren and Hua Wu and Haifeng Wang},
title = {Improving Statistical Word Alignment with Various Clues},
url = {
http://www.mt-archive.info/MTS-2007-Ren.pdf},
googlescholar = {10845914611432201731},
booktitle = {Proceedings of the {MT} Summit XI},
year = 2007
}
(Ren et al., 2007), possibly based on a skeleton of highly likely alignment points
Ma, Yanjun and Ozdowska, Sylwia and Sun, Yanli and Way, Andy (2008):
Improving Word Alignment Using Syntactic Dependencies, Proceedings of the ACL-08: HLT Second Workshop on Syntax and Structure in Statistical Translation (SSST-2)
@InProceedings{ma-EtAl:2008:SSST,
author = {Ma, Yanjun and Ozdowska, Sylwia and Sun, Yanli and Way, Andy},
title = {Improving Word Alignment Using Syntactic Dependencies},
booktitle = {Proceedings of the ACL-08:~HLT Second Workshop on Syntax and Structure in Statistical Translation (SSST-2)},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {69--77},
url = {
http://www.aclweb.org/anthology/W/W08/W08-0409},
year = 2008
}
(Ma et al., 2008), or deletes alignment points one at a time from the symmetrized union alignment
Fossum, Victoria Li and Knight, Kevin and Abney, Steven (2008):
Using Syntax to Improve Word Alignment Precision for Syntax-Based Machine Translation, Proceedings of the Third Workshop on Statistical Machine Translation
@InProceedings{fossum-knight-abney:2008:WMT,
author = {Fossum, Victoria Li and Knight, Kevin and Abney, Steven},
title = {Using Syntax to Improve Word Alignment Precision for Syntax-Based Machine Translation},
booktitle = {Proceedings of the Third Workshop on Statistical Machine Translation},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {44--52},
url = {
http://www.aclweb.org/anthology/W/W08/W08-0306},
year = 2008
}
(Fossum et al., 2008).
Benchmarks
Discussion
Related Topics
New Publications
Liu, Chunyang and Liu, Yang and Sun, Maosong and Luan, Huanbo and Yu, Heng (2015):
Generalized Agreement for Bidirectional Word Alignment, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing
@InProceedings{liu-EtAl:2015:EMNLP1,
author = {Liu, Chunyang and Liu, Yang and Sun, Maosong and Luan, Huanbo and Yu, Heng},
title = {Generalized Agreement for Bidirectional Word Alignment},
booktitle = {Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing},
month = {September},
address = {Lisbon, Portugal},
publisher = {Association for Computational Linguistics},
pages = {1828--1836},
url = {
http://aclweb.org/anthology/D15-1210},
year = 2015
}
Liu et al. (2015)
Brown, Ralf D. and Kim, Jae Dong and Jansen, Peter J. and Carbonell, Jaime G. (2005):
Symmetric Probabilistic Alignment, Proceedings of the ACL Workshop on Building and Using Parallel Texts
@InProceedings{brown-EtAl:2005:WPT,
author = {Brown, Ralf D. and Kim, Jae Dong and Jansen, Peter J. and Carbonell, Jaime G.},
title = {Symmetric Probabilistic Alignment},
booktitle = {Proceedings of the ACL Workshop on Building and Using Parallel Texts},
month = {June},
address = {Ann Arbor, Michigan},
publisher = {Association for Computational Linguistics},
pages = {87--90},
url = {
http://www.aclweb.org/anthology/W/W05/W05-0813},
year = 2005
}
Brown et al. (2005)