General Introduction
The field of statistical machine translation concerns itself with methods to automatically learn how to translate from translated texts (so-called parallel corpora).
Introduction and its 7 sub-topics are the main subject of 981 publications.
Publications
W. John Hutchins (2007):
Machine translation: a concise history, Computer aided translation: theory and practice
@inproceedings{Hutchins:2007,
author = {W. John Hutchins},
title = {Machine translation: a concise history},
booktitle = {Computer aided translation: theory and practice},
url = {
http://www.hutchinsweb.me.uk/CUHK-2006.pdf},
year = 2007
}
Hutchins (2007) gives a concise overview of the history of machine translation.
Frederick Jelinek (2009):
ACL Lifetime Achievement Award: The Dawn of Statistical ASR and MT, Computational Linguistics
@Article{CL:J09-4004,
author = {Frederick Jelinek},
title = {{ACL} Lifetime Achievement Award: The Dawn of Statistical {ASR} and {MT}},
journal = {Computational Linguistics},
volume = {35},
pages = {483--494},
number = {4},
url = {
http://aclweb.org/anthology-new/J/J09/J09-4004.pdf},
year = 2009
}
Jelinek (2009) recalls the birth of statistical machine translation, and previously statistical speech recognition, at IBM. See also the famous ALPAC report
John R. Pierce and John B. Carroll (1966):
Languages and Machines --- Computers in Translation and Linguistics @techreport{ALPAC,
author = {John R. Pierce and John B. Carroll},
title = {Languages and Machines --- Computers in Translation and Linguistics},
institution = {Automatic Language Processing Advisory Committe (ALPAC), National Academy of Sciences},
location = {Washington, DC},
url = {
http://www.mt-archive.info/ALPAC-1966.pdf},
year = 1966
}
(Pierce and Carroll, 1966).
Federico Gaspari and W. John Hutchins (2007):
Online and Free! Ten Years of Online Machine Translation: Origins, Developments, Current Use and Future Prospects, Proceedings of the MT Summit XI
@inproceedings{Gaspari:2007:MTSummit,
author = {Federico Gaspari and W. John Hutchins},
title = {Online and Free! {T}en Years of Online Machine Translation: Origins, Developments, Current Use and Future Prospects},
url = {
http://hutchinsweb.me.uk/MTS-2007.pdf},
googlescholar = {17061656689508028453},
booktitle = {Proceedings of the {MT} Summit XI},
year = 2007
}
Gaspari and Hutchins (2007) reports on the recent rise of online machine translation services and usage patterns.
Recently, a textbook about the field was published
(Koehn, 2010). A survey of work in statistical machine translation is presented by
Adam Lopez (2008):
Statistical Machine Translation, ACM Computing Surveys
@article{lopez-survey,
author = {Adam Lopez},
title = {Statistical Machine Translation},
url = {
http://www.cs.jhu.edu/~alopez/papers/survey.pdf},
googlescholar = {13327711981648149476},
journal = {ACM Computing Surveys},
volume = {40},
number = {3},
year = 2008
}
Lopez (2008). For non-statistical methods to machine translation, refer to the books by
Arnold et al. (1994) and by
Hutchins and Somers (1992).
A good introduction into probability theory and information is given by
Cover and Thomas (1991). For an application of probabilistic methods to the related field of speech recognition, see the book by
Jelinek (1998).
There are several textbooks on natural language processing that may serve as background to the material presented here. Good general introductions are given by
Manning and Schütze (1999) as well as
Jurafsky and Martin (2008).
Benchmarks
Each year, a few evaluation campaigns are staged whose aim is to assess the validity of novel methods in competitive systems.
Discussion
New Publications
Sara Morrissey and Andy Way (2013):
Manual labour: tackling machine translation for sign languages, Machine Translation
mentioned in Introduction and Sign Language@article{mtj13-Morrissey,
author = {Sara Morrissey and Andy Way},
title = {Manual labour: tackling machine translation for sign languages},
pages = {25--64},
journal = {Machine Translation},
volume = {27},
number = {1},
month = {March},
year = 2013
}
Morrissey and Way (2013)
Adam Lopez and Matt Post and Chris Callison-Burch and Jonathan Weese and Juri Ganitkevitch and Narges Ahmidi and Olivia Buzek and Leah Hanson and Beaniesh Jamil and Matthias Lee and Ya-Ting Lin and Henry Pao and Fatima Rivera and Leili Shahriyari and Debu Sinha and Adam Teichert and Stephen Wampler and Michael Weinberger and Daguang Xu and Lin Yang and Shang Zhao (2013):
Learning to translate with products of novices: a suite of open-ended challenge problems for teaching MT, Transactions of the Association for Computational Linguistics (TACL)
@inproceedings{tacl13-lopez,
author = {Adam Lopez and Matt Post and Chris Callison-Burch and Jonathan Weese and Juri Ganitkevitch and Narges Ahmidi and Olivia Buzek and Leah Hanson and Beaniesh Jamil and Matthias Lee and Ya-Ting Lin and Henry Pao and Fatima Rivera and Leili Shahriyari and Debu Sinha and Adam Teichert and Stephen Wampler and Michael Weinberger and Daguang Xu and Lin Yang and Shang Zhao},
title = {Learning to translate with products of novices: a suite of open-ended challenge problems for teaching MT},
number = {1},
month = {May},
pages = {165--178},
url = {
http://www.transacl.org/wp-content/uploads/2013/05/paper165.pdf},
booktitle = {Transactions of the Association for Computational Linguistics (TACL)},
year = 2013
}
Lopez et al. (2013)
Adam Lopez and Matt Post and Chris Callison-Burch and Jonathan Weese and Juri Ganitkevitch and Narges Ahmidi and Olivia Buzek and Leah Hanson and Beaniesh Jamil and Matthias Lee and Ya-Ting Lin and Henry Pao and Fatima Rivera and Leili Shahriyari and Debu Sinha and Adam Teichert and Stephen Wampler and Michael Weinberger and Daguang Xu and Lin Yang and Shang Zhao (2013):
Learning to translate with products of novices: a suite of open-ended challenge problems for teaching MT, Transactions of the Association for Computational Linguistics (TACL)
@inproceedings{tacl13-lopez,
author = {Adam Lopez and Matt Post and Chris Callison-Burch and Jonathan Weese and Juri Ganitkevitch and Narges Ahmidi and Olivia Buzek and Leah Hanson and Beaniesh Jamil and Matthias Lee and Ya-Ting Lin and Henry Pao and Fatima Rivera and Leili Shahriyari and Debu Sinha and Adam Teichert and Stephen Wampler and Michael Weinberger and Daguang Xu and Lin Yang and Shang Zhao},
title = {Learning to translate with products of novices: a suite of open-ended challenge problems for teaching MT},
number = {1},
month = {May},
pages = {165--178},
url = {
http://www.transacl.org/wp-content/uploads/2013/05/paper165.pdf},
booktitle = {Transactions of the Association for Computational Linguistics (TACL)},
year = 2013
}
Lopez et al. (2013)
Cancedda, Nicola (2012):
Private Access to Phrase Tables for Statistical Machine Translation, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
@InProceedings{cancedda:2012:ACL2012short,
author = {Cancedda, Nicola},
title = {Private Access to Phrase Tables for Statistical Machine Translation},
booktitle = {Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {July},
address = {Jeju Island, Korea},
publisher = {Association for Computational Linguistics},
pages = {23--27},
url = {
http://www.aclweb.org/anthology/P12-2005},
year = 2012
}
Cancedda (2012)
Harold L. Somers (1992):
Current research in Machine Translation, Machine Translation
@article{MTJ:1992:Somers,
author = {Harold L. Somers},
title = {Current research in Machine Translation},
pages = {231--246},
journal = {Machine Translation},
volume = {7},
number = {4},
month = {December},
year = 1992
}
Somers (1992)
Kenneth W. Church and Eduard H. Hovy (1993):
Good applications for crummy machine translation, Machine Translation
@article{MTJ:1993:Church,
author = {Kenneth W. Church and Eduard H. Hovy},
title = {Good applications for crummy machine translation},
url = {
http://www.isi.edu/natural-language/people/hovy/papers/93churchhovy.pdf},
googlescholar = {5948636947149730470},
pages = {239--258},
journal = {Machine Translation},
volume = {8},
number = {4},
month = {December},
year = 1993
}
Church and Hovy (1993)