IBM Models
The IBM Models are a sequence of models with increasing complexity, starting with lexical translation probabilities, adding models for reordering and word duplication.
IBM Models is the main subject of 45 publications. 28 are discussed here.
Publications
The IBM models are described in detail by
Peter F. Brown and Stephen A. Della-Pietra and Vincent J. Della-Pietra and Robert L. Mercer (1993):
The Mathematics of Statistical Machine Translation, Computational Linguistics
mentioned in Word Based Models and IBM Models@Article{Brown:1993,
author = {Peter F. Brown and Stephen A. Della-Pietra and Vincent J. Della-Pietra and Robert L. Mercer},
title = {The Mathematics of Statistical Machine Translation},
url = {
http://acl.ldc.upenn.edu/J/J93/J93-2003.pdf},
volume = {19},
number = {2},
pages = {263--313},
journal = {Computational Linguistics},
year = 1993
}
Brown et al. (1993), who originally presented the statistical machine translation approach in earlier papers
Peter F. Brown and John Cocke and Stephen A. Della-Pietra and Vincent J. Della-Pietra and Frederick Jelinek and Robert L. Mercer and Paul Rossin (1988):
A STATISTICAL APPROACH TO LANGUAGE TRANSLATION, Proceedings of the International Conference on Computational Linguistics (COLING)
mentioned in Word Based Models and IBM Models@InProceedings{Brown:1988,
author = {Peter F. Brown and John Cocke and Stephen A. Della-Pietra and Vincent J. Della-Pietra and Frederick Jelinek and Robert L. Mercer and Paul Rossin},
title = {A STATISTICAL APPROACH TO LANGUAGE TRANSLATION},
booktitle = {Proceedings of the International Conference on Computational Linguistics (COLING)},
year = 1988
}
(Brown et al., 1988;
Peter F. Brown and John Cocke and Stephen A. Della-Pietra and Vincent J. Della-Pietra and Frederick Jelinek and John D. Lafferty and Robert L. Mercer and Paul Rossin (1990):
A statistical approach to machine translation, Computational Linguistics
mentioned in Word Based Models and IBM Models@Article{Brown:1990,
author = {Peter F. Brown and John Cocke and Stephen A. Della-Pietra and Vincent J. Della-Pietra and Frederick Jelinek and John D. Lafferty and Robert L. Mercer and Paul Rossin},
title = {A statistical approach to machine translation},
journal = {Computational Linguistics},
volume = {16},
number = {2},
pages = {76--85},
year = 1990
}
Brown et al., 1990). See also the introductions by
Knight (1997);
Knight (1999).
During a 1999 Johns Hopkins University Workshop, the IBM models were implemented in a toolkit called GIZA
Yaser Al-Onaizan and Jan Cuřín and Michael Jahr and Kevin Knight and John D. Lafferty and I. Dan Melamed and Franz-Josef Och and David Purdy and Noah A. Smith and David Yarowsky (1999):
Statistical Machine Translation @Techreport{Giza,
author = {Yaser Al-Onaizan and Jan Cu\v{r}\'{i}n and Michael Jahr and Kevin Knight and John D. Lafferty and I. Dan Melamed and Franz-Josef Och and David Purdy and Noah A. Smith and David Yarowsky},
title = {Statistical Machine Translation},
institution = {John Hopkins University Summer Workshop {\tt
http://www.clsp.jhu.edu/ws99/projects/mt/}},
year = 1999
}
(Al-Onaizan et al., 1999), later refined into GIZA++ by
Franz Josef Och and Hermann Ney (2000):
Improved Statistical Alignment Models, Proceedings of the 38th Annual Meeting of the Association of Computational Linguistics (ACL)
mentioned in Word Based Models and IBM Models@InProceedings{Och:2000,
author = {Franz Josef Och and Hermann Ney},
title = {Improved Statistical Alignment Models},
booktitle = {Proceedings of the 38th Annual Meeting of the Association of Computational Linguistics (ACL)},
url = {
http://acl.ldc.upenn.edu/P/P00/P00-1056.pdf},
year = 2000
}
Och and Ney (2000). GIZA++ is open source and widely used. The estimation of the bilingual word classes is described by
Franz Josef Och (1999):
An Efficient Method for Determining Bilingual Word Classes, Proceedings of the 9th Conference of the European Chapter of the Association for Computational Linguistics (EACL)
@inproceedings{Och:WordClasses,
author = {Franz Josef Och},
title = {An Efficient Method for Determining Bilingual Word Classes},
booktitle = {Proceedings of the 9th Conference of the European Chapter of the Association for Computational Linguistics (EACL)},
pages = {71--76},
year = 1999
}
Och (1999).
Instead of hill-climbing to the Viterbi alignment, algorithms such as Estimation of Distributions may be employed
Rodríguez, Luis and García-Varea, Ismael and Gámez, Jose A. (2006):
Searching for alignments in SMT. A novel approach based on an Estimation of Distribution Algorithm, Proceedings on the Workshop on Statistical Machine Translation
@InProceedings{rodriguez-garciavarea-gamez:2006:WMT,
author = {Rodr\'{i}guez, Luis and Garc\'{i}a-Varea, Ismael and G\'{a}mez, Jose A.},
title = {Searching for alignments in {SMT}. {A} novel approach based on an Estimation of Distribution Algorithm},
booktitle = {Proceedings on the Workshop on Statistical Machine Translation},
month = {June},
address = {New York City},
publisher = {Association for Computational Linguistics},
pages = {47--54},
url = {
http://www.aclweb.org/anthology/W/W06/W06-3107},
year = 2006
}
(Rodríguez et al., 2006). The stochastic modelling approach for translation is described by
Hermann Ney (2001):
Stochastic Modelling: From Pattern Classification to Language Translation, Workshop on Data-Driven Machine Translation at 39th Annual Meeting of the Association of Computational Linguistics (ACL)
@InProceedings{Ney:2001,
author = {Hermann Ney},
title = {Stochastic Modelling: From Pattern Classification to Language Translation},
url = {
http://acl.ldc.upenn.edu/acl2001/DD-MT/Ney.pdf},
googlescholar = {497375019661278255},
booktitle = {Workshop on Data-Driven Machine Translation at 39th Annual Meeting of the Association of Computational Linguistics (ACL)},
year = 2001
}
Ney (2001).
A variation on the IBM models is the HMM model which uses relative distortion but not fertility
Stephan Vogel and Hermann Ney and Christoph Tillmann (1996):
HMM-Based Word Alignment in Statistical Translation, Proceedings of the 16th International Conference on Computational Linguistics (COLING)
@InProceedings{Vogel:1996,
author = {Stephan Vogel and Hermann Ney and Christoph Tillmann},
title = {HMM-Based Word Alignment in Statistical Translation},
url = {
http://acl.ldc.upenn.edu/C/C96/C96-2141.pdf},
googlescholar = {6742027174667056165},
booktitle = {Proceedings of the 16th International Conference on Computational Linguistics (COLING)},
year = 1996
}
(Vogel et al., 1996). This model was extended by treating jumps to other source words differently from repeated translations of the same source word
Toutanova, Kristina and Ilhan, H. Tolga and Manning, Christopher D. (2002):
Extentions to HMM-based Statistical Word Alignment Models, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP)
@inproceedings{Toutanova:2002,
author = {Toutanova, Kristina and Ilhan, H. Tolga and Manning, Christopher D.},
title = {Extentions to {HMM}-based Statistical Word Alignment Models},
url = {
http://acl.ldc.upenn.edu/W/W02/W02-1012.pdf},
googlescholar = {10043266141230029290},
booktitle = {Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP)},
month = {July},
address = {Philadelphia},
publisher = {Association for Computational Linguistics},
pages = {87--94},
year = 2002
}
(Toutanova et al., 2002), and conditioning jumps on the source word
He, Xiaodong (2007):
Using Word-Dependent Transition Models in HMM-Based Word Alignment for Statistical Machine Translation, Proceedings of the Second Workshop on Statistical Machine Translation
@InProceedings{he:2007:WMT,
author = {He, Xiaodong},
title = {Using Word-Dependent Transition Models in {HMM}-Based Word Alignment for Statistical Machine Translation},
booktitle = {Proceedings of the Second Workshop on Statistical Machine Translation},
month = {June},
address = {Prague, Czech Republic},
publisher = {Association for Computational Linguistics},
pages = {80--87},
url = {
http://www.aclweb.org/anthology/W/W07/W07-0211},
year = 2007
}
(He, 2007).
IBM models have been extended using maximum entropy models
George Foster (2000):
A Maximum Entropy/Minimum Divergence Translation Model, Proceedings of the 38th Annual Meeting of the Association of Computational Linguistics (ACL)
@InProceedings{Foster:2000,
author = {George Foster},
title = {A Maximum Entropy/Minimum Divergence Translation Model},
url = {
http://acl.ldc.upenn.edu/P/P00/P00-1006.pdf},
booktitle = {Proceedings of the 38th Annual Meeting of the Association of Computational Linguistics (ACL)},
year = 2000
}
(Foster, 2000) to include position
George Foster (2000):
Incorporating Position Information into a Maximum Entropy/Minimum Divergence Translation Model, Proceedings of the Fourth Conference on Computational Natural Language Learning and of the Second Learning Language in Logic Workshop
@InProceedings{Foster:2000b,
author = {George Foster},
title = {Incorporating Position Information into a Maximum Entropy/Minimum Divergence Translation Model},
booktitle = {Proceedings of the Fourth Conference on Computational Natural Language Learning and of the Second Learning Language in Logic Workshop},
year = 2000
}
(Foster, 2000), part-of-speech tag information
Seonho Kim and Juntae Yoon and Mansuk Song (2000):
Structural Feature Selection For English-Korean Statistical Machine Translation, Proceedings of the International Conference on Computational Linguistics (COLING)
@InProceedings{Kim:2000,
author = {Seonho Kim and Juntae Yoon and Mansuk Song},
title = {Structural Feature Selection For {English-Korean} Statistical Machine Translation},
url = {
http://pdf.aminer.org/000/107/450/structural\_feature\_selection\_for\_english\_korean\_statistical\_machine\_translation.pdf},
booktitle = {Proceedings of the International Conference on Computational Linguistics (COLING)},
year = 2000
}
(Kim et al., 2000), even in the EM training algorithm
Ismael García-Varea and Franz Josef Och and Hermann Ney and Francisco Casacuberta (2002):
Improving alignment quality in statistical machine translation using context-dependent maximum entropy models, Proceedings of the International Conference on Computational Linguistics (COLING)
@InProceedings{Varea:2002b,
author = {Ismael Garc\'{i}a-Varea and Franz Josef Och and Hermann Ney and Francisco Casacuberta},
title = {Improving alignment quality in statistical machine translation using context-dependent maximum entropy models},
booktitle = {Proceedings of the International Conference on Computational Linguistics (COLING)},
year = 2002
}
(García-Varea et al., 2002;
Ismael García-Varea and Franz Josef Och and Hermann Ney and Francisco Casacuberta (2002):
Efficient Integration of maximum entropy models within a maximum likelihood training scheme of statistical machine translation, Machine Translation: From Research to Real Users, 5th Conference of the Association for Machine Translation in the Americas, AMTA 2002 Tiburon, CA, USA, October 6-12, 2002, Proceedings
@inproceedings{Varea:2002c,
author = {Ismael Garc\'{i}a-Varea and Franz Josef Och and Hermann Ney and Francisco Casacuberta},
title = {Efficient Integration of maximum entropy models within a maximum likelihood training scheme of statistical machine translation},
editor = {Stephen D. Richardson},
booktitle = {Machine Translation: From Research to Real Users, 5th Conference of the Association for Machine Translation in the Americas, AMTA 2002 Tiburon, CA, USA, October 6-12, 2002, Proceedings},
publisher = {Springer},
series = {Lecture Notes in Computer Science},
volume = {2499},
isbn = {3-540-44282-0},
bibsource = {DBLP,
http://dblp.uni-trier.de},
year = 2002
}
García-Varea et al., 2002b). Improvements have also been obtained by adding bilingual dictionaries
Wu, Hua and Wang, Haifeng (2004):
Improving Statistical Word Alignment with a Rule-Based Machine Translation System , Proceedings of Coling 2004
@inproceedings{Wu:2004,
author = {Wu, Hua and Wang, Haifeng},
title = {Improving Statistical Word Alignment with a Rule-Based Machine Translation System },
url = {
http://acl.ldc.upenn.edu/C/C04/C04-1005.pdf},
googlescholar = {15371424355112100237},
booktitle = {Proceedings of Coling 2004 },
editor = {{}},
month = {Aug 23--Aug 27},
address = {Geneva, Switzerland},
publisher = {COLING},
pages = {29--35},
year = 2004
}
(Wu and Wang, 2004) and context vectors estimated from monolingual corpora
Wang, Wei and Zhou, Ming (2004):
Improving Word Alignment Models using Structured Monolingual Corpora, Proceedings of EMNLP 2004
@inproceedings{Wang:2004,
author = {Wang, Wei and Zhou, Ming},
title = {Improving Word Alignment Models using Structured Monolingual Corpora},
booktitle = {Proceedings of EMNLP 2004},
editor = {Dekang Lin and Dekai Wu},
month = {July},
address = {Barcelona, Spain},
publisher = {Association for Computational Linguistics},
pages = {198--205},
year = 2004
}
(Wang and Zhou, 2004), lemmatizing words
Herve Dejean and Eric Gaussier and Cyril Goutte and Kenji Yamada (2003):
Reducing Parameter Space for Word Alignment, HLT-NAACL 2003 Workshop: Building and Using Parallel Texts: Data Driven Machine Translation and Beyond
@inproceedings{Dejean:2003,
author = {Herve Dejean and Eric Gaussier and Cyril Goutte and Kenji Yamada },
title = {Reducing Parameter Space for Word Alignment},
url = {
http://acl.ldc.upenn.edu/W/W03/W03-0305.pdf},
booktitle = {HLT-NAACL 2003 Workshop: Building and Using Parallel Texts: Data Driven Machine Translation and Beyond},
editor = {Rada Mihalcea and Ted Pedersen},
month = {May 31},
address = {Edmonton, Alberta, Canada},
publisher = {Association for Computational Linguistics},
year = 2003
}
(Dejean et al., 2003;
Popovic, Maja and Ney, Hermann (2004):
Improving Word Alignment Quality using Morpho-syntactic Information , Proceedings of Coling 2004
@inproceedings{Popovic:2004,
author = {Popovic, Maja and Ney, Hermann},
title = {Improving Word Alignment Quality using Morpho-syntactic Information },
url = {
http://acl.ldc.upenn.edu/C/C04/C04-1045.pdf},
googlescholar = {252551793077182696},
booktitle = {Proceedings of Coling 2004 },
editor = {{}},
month = {Aug 23--Aug 27},
address = {Geneva, Switzerland},
publisher = {COLING},
pages = {310--314},
year = 2004
}
Popovic and Ney, 2004;
Pianta, Emanuele and Bentivogli, Luisa (2004):
Knowledge Intensive Word Alignment with KNOWA , Proceedings of Coling 2004
@inproceedings{Pianta:2004,
author = {Pianta, Emanuele and Bentivogli, Luisa},
title = {Knowledge Intensive Word Alignment with {KNOWA} },
url = {
http://acl.ldc.upenn.edu/coling2004/MAIN/pdf/156-851.pdf},
googlescholar = {9458498707488230785},
booktitle = {Proceedings of Coling 2004 },
month = {Aug 23--Aug 27},
address = {Geneva, Switzerland},
publisher = {COLING},
pages = {1086--1092},
year = 2004
}
Pianta and Bentivogli, 2004), interpolating lemma and word aligment models
Zhang, Ruiqiang and Sumita, Eiichiro (2007):
Boosting Statistical Machine Translation by Lemmatization and Linear Interpolation, Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions
@InProceedings{zhang-sumita:2007:PosterDemo,
author = {Zhang, Ruiqiang and Sumita, Eiichiro},
title = {Boosting Statistical Machine Translation by Lemmatization and Linear Interpolation},
booktitle = {Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions},
month = {June},
address = {Prague, Czech Republic},
publisher = {Association for Computational Linguistics},
pages = {181--184},
url = {
http://www.aclweb.org/anthology/P/P07/P07-2046},
year = 2007
}
(Zhang and Sumita, 2007), as well as smoothing
Moore, Robert C. (2004):
Improving IBM Word Alignment Model 1, Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume
@inproceedings{Moore:2004,
author = {Moore, Robert C.},
title = {Improving IBM Word Alignment Model 1},
url = {
http://acl.ldc.upenn.edu/acl2004/main/pdf/352\_pdf\_2-col.pdf},
googlescholar = {15225310021912992123},
booktitle = {Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume},
month = {July},
address = {Barcelona, Spain},
pages = {518--525},
year = 2004
}
(Moore, 2004). Mixture models for word translation probabilities have been explored to automatically learn topic-dependent translation models
Zhao, Bing and Xing, Eric P. (2006):
BiTAM: Bilingual Topic AdMixture Models for Word Alignment, Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions
@InProceedings{zhao-xing:2006:POS,
author = {Zhao, Bing and Xing, Eric P.},
title = {Bi{TAM}: Bilingual Topic {A}d{M}ixture Models for Word Alignment},
booktitle = {Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions},
month = {July},
address = {Sydney, Australia},
publisher = {Association for Computational Linguistics},
pages = {969--976},
url = {
http://www.aclweb.org/anthology/P/P06/P06-2124},
year = 2006
}
(Zhao and Xing, 2006;
Jorge Civera and Alfons Juan (2006):
Mixtures of IBM Model 2, Proceedings of the 11th Conference of the European Association for Machine Translation (EAMT)
@InProceedings{Civera:2006b:EAMT,
author = {Jorge Civera and Alfons Juan},
title = {Mixtures of {IBM} Model 2},
url = {
http://www.dsic.upv.es/~ajuan/research/2006/Juan06\_06a.pdf},
googlescholar = {11022975119241749772},
booktitle = {Proceedings of the 11th Conference of the European Association for Machine Translation (EAMT)},
month = {June},
address = {Oslo, Norway},
year = 2006
}
Civera and Juan, 2006). Packing words that typically occur in many-to-one alignments into a single token may improve alignment quality
Ma, Yanjun and Stroppa, Nicolas and Way, Andy (2007):
Bootstrapping Word Alignment via Word Packing, Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics
@InProceedings{ma-stroppa-way:2007:ACLMain,
author = {Ma, Yanjun and Stroppa, Nicolas and Way, Andy},
title = {Bootstrapping Word Alignment via Word Packing},
booktitle = {Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics},
month = {June},
address = {Prague, Czech Republic},
publisher = {Association for Computational Linguistics},
pages = {304--311},
url = {
http://www.aclweb.org/anthology/P/P07/P07-1039},
year = 2007
}
(Ma et al., 2007).
Benchmarks
Discussion
Related Topics
New Publications
Eyigöz, Elif and Gildea, Daniel and Oflazer, Kemal (2013):
Multi-Rate HMMs for Word Alignment, Proceedings of the Eighth Workshop on Statistical Machine Translation
@InProceedings{eyigoz-gildea-oflazer:2013:WMT,
author = {Eyig\"{o}z, Elif and Gildea, Daniel and Oflazer, Kemal},
title = {{Multi-Rate} {HMMs} for Word Alignment},
booktitle = {Proceedings of the Eighth Workshop on Statistical Machine Translation},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {494--502},
url = {
http://www.aclweb.org/anthology/W13-2262},
year = 2013
}
Eyigöz et al. (2013)
Schulz, Philip and Aziz, Wilker (2016):
Fast Collocation-Based Bayesian HMM Word Alignment, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
@InProceedings{schulz-aziz:2016:COLING,
author = {Schulz, Philip and Aziz, Wilker},
title = {Fast Collocation-Based Bayesian HMM Word Alignment},
booktitle = {Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers},
month = {December},
address = {Osaka, Japan},
publisher = {The COLING 2016 Organizing Committee},
pages = {3146--3155},
url = {
http://aclweb.org/anthology/C16-1296},
year = 2016
}
Schulz and Aziz (2016)
- UNKNOWN CITATION 'simion-collins-stein:2015:EMNLP'
Dyer, Chris and Chahuneau, Victor and Smith, Noah A. (2013):
A Simple, Fast, and Effective Reparameterization of IBM Model 2, Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
@InProceedings{dyer-chahuneau-smith:2013:NAACL-HLT,
author = {Dyer, Chris and Chahuneau, Victor and Smith, Noah A.},
title = {A Simple, Fast, and Effective Reparameterization of IBM Model 2},
booktitle = {Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
month = {June},
address = {Atlanta, Georgia},
publisher = {Association for Computational Linguistics},
pages = {644--648},
url = {
http://www.aclweb.org/anthology/N13-1073},
year = 2013
}
Dyer et al. (2013)
Gal, Yarin and Blunsom, Phil (2013):
A Systematic Bayesian Treatment of the IBM Alignment Models, Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
@InProceedings{gal-blunsom:2013:NAACL-HLT,
author = {Gal, Yarin and Blunsom, Phil},
title = {A Systematic Bayesian Treatment of the IBM Alignment Models},
booktitle = {Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
month = {June},
address = {Atlanta, Georgia},
publisher = {Association for Computational Linguistics},
pages = {969--977},
url = {
http://www.aclweb.org/anthology/N13-1117},
year = 2013
}
Gal and Blunsom (2013)
Simion, Andrei and Collins, Michael and Stein, Cliff (2013):
A Convex Alternative to IBM Model 2, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
@InProceedings{simion-collins-stein:2013:EMNLP,
author = {Simion, Andrei and Collins, Michael and Stein, Cliff},
title = {A Convex Alternative to {IBM} Model 2},
booktitle = {Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing},
month = {October},
address = {Seattle, Washington, USA},
publisher = {Association for Computational Linguistics},
pages = {1574--1583},
url = {
http://www.aclweb.org/anthology/D13-1164},
year = 2013
}
Simion et al. (2013)
Simion, Andrei and Collins, Michael and Stein, Cliff (2014):
Some Experiments with a Convex IBM Model 2, Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers
@InProceedings{simion-collins-stein:2014:EACL2014-SP,
author = {Simion, Andrei and Collins, Michael and Stein, Cliff},
title = {Some Experiments with a Convex IBM Model 2},
booktitle = {Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers},
month = {April},
address = {Gothenburg, Sweden},
publisher = {Association for Computational Linguistics},
pages = {180--184},
url = {
http://www.aclweb.org/anthology/E14-4035},
year = 2014
}
Simion et al. (2014)
Schoenemann, Thomas (2013):
Training Nondeficient Variants of IBM-3 and IBM-4 for Word Alignment, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{schoenemann:2013:ACL2013,
author = {Schoenemann, Thomas},
title = {Training Nondeficient Variants of IBM-3 and IBM-4 for Word Alignment},
booktitle = {Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {22--31},
url = {
http://www.aclweb.org/anthology/P13-1003},
year = 2013
}
Schoenemann (2013)
Gelling, Douwe and Cohn, Trevor (2014):
Simple extensions and POS Tags for a reparameterised IBM Model 2, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
@InProceedings{gelling-cohn:2014:P14-2,
author = {Gelling, Douwe and Cohn, Trevor},
title = {Simple extensions and POS Tags for a reparameterised IBM Model 2},
booktitle = {Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {June},
address = {Baltimore, Maryland},
publisher = {Association for Computational Linguistics},
pages = {150--154},
url = {
http://www.aclweb.org/anthology/P14-2025},
year = 2014
}
Gelling and Cohn (2014)
Vaswani, Ashish and Huang, Liang and Chiang, David (2012):
Smaller Alignment Models for Better Translations: Unsupervised Word Alignment with the l0-norm, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{vaswani-huang-chiang:2012:ACL2012,
author = {Vaswani, Ashish and Huang, Liang and Chiang, David},
title = {Smaller Alignment Models for Better Translations: Unsupervised Word Alignment with the l0-norm},
booktitle = {Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {July},
address = {Jeju Island, Korea},
publisher = {Association for Computational Linguistics},
pages = {311--319},
url = {
http://www.aclweb.org/anthology/P12-1033},
year = 2012
}
Vaswani et al. (2012)
Riley, Darcey and Gildea, Daniel (2012):
Improving the IBM Alignment Models Using Variational Bayes, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
@InProceedings{riley-gildea:2012:ACL2012short,
author = {Riley, Darcey and Gildea, Daniel},
title = {Improving the IBM Alignment Models Using Variational Bayes},
booktitle = {Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {July},
address = {Jeju Island, Korea},
publisher = {Association for Computational Linguistics},
pages = {306--310},
url = {
http://www.aclweb.org/anthology/P12-2060},
year = 2012
}
Riley and Gildea (2012)
Sujith Ravi and Kevin Knight (2010):
Squibs: Does GIZA++ Make Search Errors?, Computational Linguistics
@Article{CL:2010-3001,
author = {Sujith Ravi and Kevin Knight},
title = {Squibs: Does {GIZA++} Make Search Errors?},
journal = {Computational Linguistics},
volume = {36},
number = {3},
url = {
http://aclweb.org/anthology-new/J/J10/J10-3001.pdf},
year = 2010
}
Ravi and Knight (2010)
Christer Samuelsson (2012):
HAL: Challenging Three Key Aspects of IBM-style Statistical Machine Translation, Proceedings of the Tenth Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{AMTA-2012-Samuelsson,
author = {Christer Samuelsson},
title = {HAL}: Challenging Three Key Aspects of {IBM-style Statistical Machine Translation},
url = {
http://www.mt-archive.info/AMTA-2012-Samuelsson.pdf},
booktitle = {Proceedings of the Tenth Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {San Diego, California},
year = 2012
}
Samuelsson (2012)
Brunning, Jamie and de Gispert, Adrià and Byrne, William (2009):
Context-Dependent Alignment Models for Statistical Machine Translation, Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
@InProceedings{brunning-degispert-byrne:2009:NAACLHLT09,
author = {Brunning, Jamie and de Gispert, Adri\`{a} and Byrne, William},
title = {Context-Dependent Alignment Models for Statistical Machine Translation},
booktitle = {Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics},
month = {June},
address = {Boulder, Colorado},
publisher = {Association for Computational Linguistics},
pages = {110--118},
url = {
http://www.aclweb.org/anthology/N/N09/N09-1013},
year = 2009
}
Brunning et al. (2009)
Gao, Qin and Bach, Nguyen and Vogel, Stephan (2010):
A Semi-Supervised Word Alignment Algorithm with Partial Manual Alignments, Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
@InProceedings{gao-bach-vogel:2010:WMT,
author = {Gao, Qin and Bach, Nguyen and Vogel, Stephan},
title = {A Semi-Supervised Word Alignment Algorithm with Partial Manual Alignments},
booktitle = {Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR},
month = {July},
address = {Uppsala, Sweden},
publisher = {Association for Computational Linguistics},
pages = {1--10},
url = {
http://www.aclweb.org/anthology/W10-1701},
year = 2010
}
Gao et al. (2010)
Schoenemann, Thomas (2010):
Computing Optimal Alignments for the IBM-3 Translation Model, Proceedings of the Fourteenth Conference on Computational Natural Language Learning
@InProceedings{schoenemann:2010:CONLL,
author = {Schoenemann, Thomas},
title = {Computing Optimal Alignments for the {IBM}-3 Translation Model},
booktitle = {Proceedings of the Fourteenth Conference on Computational Natural Language Learning},
month = {July},
address = {Uppsala, Sweden},
publisher = {Association for Computational Linguistics},
pages = {98--106},
url = {
http://www.aclweb.org/anthology/W10-2913},
year = 2010
}
Schoenemann (2010)
Toutanova, Kristina and Galley, Michel (2011):
Why Initialization Matters for IBM Model 1: Multiple Optima and Non-Strict Convexity, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Techologies
@InProceedings{toutanova-galley:2011:ACL-HLT2011,
author = {Toutanova, Kristina and Galley, Michel},
title = {Why Initialization Matters for IBM Model 1: Multiple Optima and Non-Strict Convexity},
booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Techologies},
month = {June},
address = {Portland, Oregon, USA},
publisher = {Association for Computational Linguistics},
pages = {461--466},
url = {
http://www.aclweb.org/anthology/P11-2081},
year = 2011
}
Toutanova and Galley (2011)
Lopez, Adam and Resnik, Philip (2005):
Improved HMM Alignment Models for Languages with Scarce Resources, Proceedings of the ACL Workshop on Building and Using Parallel Texts
@InProceedings{lopez-resnik:2005:WPT,
author = {Lopez, Adam and Resnik, Philip},
title = {Improved {HMM} Alignment Models for Languages with Scarce Resources},
booktitle = {Proceedings of the ACL Workshop on Building and Using Parallel Texts},
month = {June},
address = {Ann Arbor, Michigan},
publisher = {Association for Computational Linguistics},
pages = {83--86},
url = {
http://www.aclweb.org/anthology/W/W05/W05-0812},
year = 2005
}
Lopez and Resnik (2005)