Word-Based Models

Being the initial models for statistical machine translation, word based models are tied to the translation of individual words.

Word Based Models and its 13 sub-topics are the main subject of 395 publications.

Topics in WordBasedModels

Publications

The initial approach to statistical machine translation led to the development of the IBM Models

Peter F. Brown and John Cocke and Stephen A. Della-Pietra and Vincent J. Della-Pietra and Frederick Jelinek and Robert L. Mercer and Paul Rossin (1988): A STATISTICAL APPROACH TO LANGUAGE TRANSLATION, Proceedings of the International Conference on Computational Linguistics (COLING)

mentioned in Word Based Models and IBM Models

(Brown et al., 1988;

Peter F. Brown and John Cocke and Stephen A. Della-Pietra and Vincent J. Della-Pietra and Frederick Jelinek and John D. Lafferty and Robert L. Mercer and Paul Rossin (1990): A statistical approach to machine translation, Computational Linguistics

mentioned in Word Based Models and IBM Models

Brown et al., 1990;

Peter F. Brown and Stephen A. Della-Pietra and Vincent J. Della-Pietra and Robert L. Mercer (1993): The Mathematics of Statistical Machine Translation, Computational Linguistics

mentioned in Word Based Models and IBM Models

Brown et al., 1993). A popular implementation of the training of these models is GIZA++

Franz Josef Och and Hermann Ney (2000): Improved Statistical Alignment Models, Proceedings of the 38th Annual Meeting of the Association of Computational Linguistics (ACL)

mentioned in Word Based Models and IBM Models

(Och and Ney, 2000) which is still used for word alignment as a initial training step of more complex models.

Benchmarks

Discussion

None of the currently competitive machine translation systems are word based models, but nevertheless the principles such as generative modelling and the use of the expectation maximimization algorithm are still core methods today. Moreover, word alignment based on word based models is more often than not the first step in training more complex models.

MT Research Survey Wiki

A Comprehensive Survey of Neural and Statistical Machine Translation Research Publications

Search Descriptions

Word-Based Models

Publications

Benchmarks

Discussion

New Publications