Discriminative Word Alignment

Viewed from machine learning, word alignment is an interesting structured prediction problem, with the interesting angle of having small amounts of supervised and large amount of unsupervised data.

Discriminative Word Alignment is the main subject of 22 publications. 17 are discussed here.

Topics in WordAlignment

Topics in WordBasedModels

Publications

Statistical machine translation systems achieve better quality with manually labeled word alignments (Callison-Burch et al., 2004), but such data does not exist in large quantities. Discriminative word alignment methods typically generate statistics over a large unlabeled corpus which may have been aligned with some baseline method such as the IBM models, which form the basis for features that are optimized during machine learning over a much smaller labeled corpus. Fraser and Marcu (2007) extend their generative model that allows many-to-many alignments by a discriminative optimization step that uses small amounts of labeled data.

Discriminative approaches may use the perceptron algorithm (Moore, 2005; Moore et al., 2006), maximum entropy models (Ittycheriah and Roukos, 2005), neural networks (Ayan et al., 2005), max-margin methods (Taskar et al., 2005), boosting (Wu and Wang, 2005; Wu et al., 2006), support vector machines (Cherry and Lin, 2006), conditional random fields (Blunsom and Cohn, 2006; Niehues and Vogel, 2008) or MIRA (Venkatapathy and Joshi, 2007).

Such methods allow the integration of features such as a more flexible fertility model and interactions between consecutive words (Lacoste-Julien et al., 2006). Especially smaller parallel corpora benefit from more attention to less frequent words (Zhang et al., 2005). Discriminative models open a path to add additional features such as ITG constraint (Chao and Li, 2007).

Related to the discriminative approach, posterior methods use agreement in the n-best alignments to adjust alignment points (Kumar and Byrne, 2002).

Benchmarks

Discussion

New Publications

Nadi Tomeh and Alexandre Allauzen and François Yvon and Guillaume Wisniewski (2010): Refining Word Alignment with Discriminative Training, Proceedings of the Ninth Conference of the Association for Machine Translation in the Americas
add
@inproceedings{AMTA-2010-Tomeh,
author = {Nadi Tomeh and Alexandre Allauzen and Fran{\,c}ois Yvon and Guillaume Wisniewski},
title = {Refining Word Alignment with Discriminative Training},
url = {http://www.mt-archive.info/AMTA-2010-Tomeh.pdf},
booktitle = {Proceedings of the Ninth Conference of the Association for Machine Translation in the Americas},
location = {Denver, Colorado},
year = 2010
}
Tomeh et al. (2010)
Yang Liu and Qun Liu and Shouxun Lin (2010): Discriminative Word Alignment by Linear Modeling, Computational Linguistics
add
@Article{CL:2010-3002,
author = {Yang Liu and Qun Liu and Shouxun Lin},
title = {Discriminative Word Alignment by Linear Modeling},
journal = {Computational Linguistics},
volume = {36},
number = {3},
url = {http://aclweb.org/anthology-new/J/J10/J10-3002.pdf},
year = 2010
}
Liu et al. (2010)
Liu, Yang and Xia, Tian and Xiao, Xinyan and Liu, Qun (2009): Weighted Alignment Matrices for Statistical Machine Translation, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing
add
@InProceedings{liu-EtAl:2009:EMNLP3,
author = {Liu, Yang and Xia, Tian and Xiao, Xinyan and Liu, Qun},
title = {Weighted Alignment Matrices for Statistical Machine Translation},
booktitle = {Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing},
month = {August},
address = {Singapore},
publisher = {Association for Computational Linguistics},
pages = {1017--1026},
url = {http://www.aclweb.org/anthology/D/D09/D09-1106},
year = 2009
}
Liu et al. (2009)
Setiawan, Hendra and Dyer, Chris and Resnik, Philip (2010): Discriminative Word Alignment with a Function Word Reordering Model, Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
add
@InProceedings{setiawan-dyer-resnik:2010:EMNLP,
author = {Setiawan, Hendra and Dyer, Chris and Resnik, Philip},
title = {Discriminative Word Alignment with a Function Word Reordering Model},
booktitle = {Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing},
month = {October},
address = {Cambridge, MA},
publisher = {Association for Computational Linguistics},
pages = {534--544},
url = {http://www.aclweb.org/anthology/D/D10/D10-1052},
year = 2010
}
Setiawan et al. (2010)
Dyer, Chris and Clark, Jonathan H. and Lavie, Alon and Smith, Noah A. (2011): Unsupervised Word Alignment with Arbitrary Features, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Techologies
add
@InProceedings{dyer-EtAl:2011:ACL-HLT2011,
author = {Dyer, Chris and Clark, Jonathan H. and Lavie, Alon and Smith, Noah A.},
title = {Unsupervised Word Alignment with Arbitrary Features},
booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Techologies},
month = {June},
address = {Portland, Oregon, USA},
publisher = {Association for Computational Linguistics},
pages = {409--419},
url = {http://www.aclweb.org/anthology/P11-1042},
year = 2011
}
Dyer et al. (2011)

MT Research Survey Wiki

A Comprehensive Survey of Neural and Statistical Machine Translation Research Publications

Search Descriptions

Discriminative Word Alignment

Publications

Benchmarks

Discussion

Related Topics

New Publications