Morphology
Languages with rich morphology which increases vocabulary size and, on the target size, requires the enforcement of agreement constraints.
Morphology is the main subject of 51 publications. 17 are discussed here.
Publications
Statistical models for morphology as part of a statistical machine translation system have been developed for inflected languages
Sonja Nießen and Hermann Ney (2001):
Toward hierarchical models for statistical machine translation of inflected languages, Workshop on Data-Driven Machine Translation at 39th Annual Meeting of the Association of Computational Linguistics (ACL)
@InProceedings{Niessen:2001,
author = {Sonja Nie{\ss}en and Hermann Ney},
title = {Toward hierarchical models for statistical machine translation of inflected languages},
url = {
http://acl.ldc.upenn.edu/W/W01/W01-1407.pdf},
googlescholar = {7768974274543154986},
booktitle = {Workshop on Data-Driven Machine Translation at 39th Annual Meeting of the Association of Computational Linguistics (ACL)},
pages = {47--54},
year = 2001
}
(Nießen and Ney, 2001;
Sonja Nießen and Hermann Ney (2004):
Statistical Machine Translation with Scarce Resources Using Morpho-syntactic Information, Computational Linguistics
@Article{Niessen:CL:2004,
author = {Sonja Nie{\ss}en and Hermann Ney},
title = {Statistical Machine Translation with Scarce Resources Using Morpho-syntactic Information},
url = {
http://acl.ldc.upenn.edu/J/J04/J04-2003.pdf},
googlescholar = {9752569724083448031},
journal = {Computational Linguistics},
volume = {30},
number = {2},
year = 2004
}
Nießen and Ney, 2004). Morphological features may be replaced by pseudo-words
Goldwater, Sharon and McClosky, David (2005):
Improving Statistical MT through Morphological Analysis, Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing
@InProceedings{goldwater-mcclosky:2005:HLTEMNLP,
author = {Goldwater, Sharon and Mc{C}losky, David},
title = {Improving Statistical {MT} through Morphological Analysis},
booktitle = {Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing},
month = {October},
address = {Vancouver, British Columbia, Canada},
publisher = {Association for Computational Linguistics},
pages = {676--683},
url = {
http://www.aclweb.org/anthology/H/H05/H05-1085},
year = 2005
}
(Goldwater and McClosky, 2005). Morphological annotation is especially useful for small training corpora
Popović, Maja and Vilar, David and Ney, Hermann and Jovičić, Slobodan and Šarić, Zoran (2005):
Augmenting a Small Parallel Text with Morpho-Syntactic Language, Proceedings of the ACL Workshop on Building and Using Parallel Texts
@InProceedings{popovic-EtAl:2005:WPT,
author = {Popovi\'{c}, Maja and Vilar, David and Ney, Hermann and Jovi\v{c}i\'{c}, Slobodan and \v{S}ari\'{c}, Zoran},
title = {Augmenting a Small Parallel Text with Morpho-Syntactic Language},
booktitle = {Proceedings of the ACL Workshop on Building and Using Parallel Texts},
month = {June},
address = {Ann Arbor, Michigan},
publisher = {Association for Computational Linguistics},
pages = {41--48},
url = {
http://www.aclweb.org/anthology/W/W05/W05-0806},
year = 2005
}
(Popović et al., 2005). For highly agglutinative languages such as Arabic, the main focus is splitting off affixes with various schemes
Habash, Nizar and Sadat, Fatiha (2006):
Arabic Preprocessing Schemes for Statistical Machine Translation, Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
@InProceedings{habash-sadat:2006:HLT-NAACL06-Short,
author = {Habash, Nizar and Sadat, Fatiha},
title = {Arabic Preprocessing Schemes for Statistical Machine Translation},
booktitle = {Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers},
month = {June},
address = {New York City, USA},
publisher = {Association for Computational Linguistics},
pages = {49--52},
url = {
http://www.aclweb.org/anthology/N/N06/N06-2013},
year = 2006
}
(Habash and Sadat, 2006), or by combination of such schemes
Sadat, Fatiha and Habash, Nizar (2006):
Combination of Arabic Preprocessing Schemes for Statistical Machine Translation, Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics
@InProceedings{sadat-habash:2006:COLACL,
author = {Sadat, Fatiha and Habash, Nizar},
title = {Combination of {A}rabic Preprocessing Schemes for Statistical Machine Translation},
booktitle = {Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics},
month = {July},
address = {Sydney, Australia},
publisher = {Association for Computational Linguistics},
pages = {1--8},
url = {
http://www.aclweb.org/anthology/P/P06/P06-1001},
year = 2006
}
(Sadat and Habash, 2006). A straight-forward application of the morpheme splitting idea may not always lead to performance gains
Sami Virpioja and Jaakko J. Väyrynen and Mathias Creutz and Markus Sadeniemi (2007):
Morphology-Aware Statistical Machine Translation Based on Morphs Induced in an Unsupervised Manner, Proceedings of the MT Summit XI
@inproceedings{Virpioja:2007:MTSummit,
author = {Sami Virpioja and Jaakko J. Väyrynen and Mathias Creutz and Markus Sadeniemi},
title = {Morphology-Aware Statistical Machine Translation Based on Morphs Induced in an Unsupervised Manner},
url = {
http://users.ics.aalto.fi/jjvayryn/online-papers/Virpioja07MTSummitXI.pdf},
googlescholar = {14073059966140459108},
booktitle = {Proceedings of the {MT} Summit XI},
year = 2007
}
(Virpioja et al., 2007).
Mei Yang and Katrin Kirchhoff (2006):
Phrase-Based Backoff Models for Machine Translation of Highly Inflected Languages, Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics
@InProceedings{Yang:2006:EACL,
author = {Mei Yang and Katrin Kirchhoff},
title = {Phrase-Based Backoff Models for Machine Translation of Highly Inflected Languages},
url = {
http://acl.ldc.upenn.edu/eacl2006/main/papers/02\_3\_yangkirchhoff\_273.pdf},
googlescholar = {10348611392153477702},
booktitle = {Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics},
month = {April},
address = {Trento, Italy},
year = 2006
}
Yang and Kirchhoff (2006) present a back-off method that resolves unknown words by increasingly aggressive morphological stemming and compound splitting.
Etienne Denoual (2007):
Analogical translation of unknown words in a statistical machine translation framework, Proceedings of the MT Summit XI
@inproceedings{Denoual:2007:MTSummit,
author = {Etienne Denoual},
title = {Analogical translation of unknown words in a statistical machine translation framework},
url = {
http://www.mt-archive.info/MTS-2007-Denoual.pdf},
googlescholar = {17474921433138152083},
booktitle = {Proceedings of the {MT} Summit XI},
year = 2007
}
Denoual (2007) uses spelling similarity to find the translation of unknown words by analogy.
Talbot, David and Osborne, Miles (2006):
Modelling Lexical Redundancy for Machine Translation, Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics
@InProceedings{talbot-osborne:2006:COLACL,
author = {Talbot, David and Osborne, Miles},
title = {Modelling Lexical Redundancy for Machine Translation},
booktitle = {Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics},
month = {July},
address = {Sydney, Australia},
publisher = {Association for Computational Linguistics},
pages = {969--976},
url = {
http://www.aclweb.org/anthology/P/P06/P06-1122},
year = 2006
}
Talbot and Osborne (2006) motivate similar work by reducing redundancy in the input language, again mostly by morphological stemming. Lemmatizing words may improve word alignment performance
Simon Corston-Oliver and Michael Gamon (2004):
Normalizing German and English inflectional morphology to improve statistical word alignment, Proceedings of the 6th Conference of the Association for Machine Translation in the Americas (AMTA 2004)
@inproceedings{corston-oliver:2004:AMTA,
author = {Simon Corston-Oliver and Michael Gamon},
title = {Normalizing {German} and {English} inflectional morphology to improve statistical word alignment},
url = {
http://research.microsoft.com/pubs/68977/morphanalysis.amta.pdf},
googlescholar = {9409928250709207073},
booktitle = {Proceedings of the 6th Conference of the Association for Machine Translation in the Americas (AMTA 2004)},
pages = {48--57},
year = 2004
}
(Corston-Oliver and Gamon, 2004). Using the frequency of stems in the corpus in a finite state approach may guide when to split
El Isbihani, Anas and Khadivi, Shahram and Bender, Oliver and Ney, Hermann (2006):
Morpho-syntactic Arabic Preprocessing for Arabic to English Statistical Machine Translation, Proceedings on the Workshop on Statistical Machine Translation
@InProceedings{elisbihani-EtAl:2006:WMT,
author = {El Isbihani, Anas and Khadivi, Shahram and Bender, Oliver and Ney, Hermann},
title = {Morpho-syntactic {Arabic} Preprocessing for {Arabic} to {English} Statistical Machine Translation},
booktitle = {Proceedings on the Workshop on Statistical Machine Translation},
month = {June},
address = {New York City},
publisher = {Association for Computational Linguistics},
pages = {15--22},
url = {
http://www.aclweb.org/anthology/W/W06/W06-3103},
year = 2006
}
(Isbihani et al., 2006), potentially guided by a small lexicon
Jason Riesa and David Yarowsky (2006):
Minimally Supervised Morphological Segmentation with Applications to Machine Translation, 5th Conference of the Association for Machine Translation in the Americas (AMTA)
@InProceedings{Riesa:2006:AMTA,
author = {Jason Riesa and David Yarowsky},
title = {Minimally Supervised Morphological Segmentation with Applications to Machine Translation},
url = {
http://people.csail.mit.edu/imcgraw/links/research/pubs/ama2006/papers/021.pdf},
googlescholar = {9522991093728557953},
booktitle = {5th Conference of the Association for Machine Translation in the Americas (AMTA)},
month = {August},
address = {Boston, Massachusetts},
year = 2006
}
(Riesa and Yarowsky, 2006). Splitting off affixes may also be a viable strategy when translating into morphologically rich languages
Durgar El-Kahlout, ilknur and Oflazer, Kemal (2006):
Initial Explorations in English to Turkish Statistical Machine Translation, Proceedings on the Workshop on Statistical Machine Translation
@InProceedings{durgarelkahlout-oflazer:2006:WMT,
author = {Durgar El-Kahlout, ilknur and Oflazer, Kemal},
title = {Initial Explorations in {English} to {Turkish} Statistical Machine Translation},
booktitle = {Proceedings on the Workshop on Statistical Machine Translation},
month = {June},
address = {New York City},
publisher = {Association for Computational Linguistics},
pages = {7--14},
url = {
http://www.aclweb.org/anthology/W/W06/W06-3102},
year = 2006
}
(El-Kahlout and Oflazer, 2006).
Different morphological analyses may be encoded in a confusion network as input to the translation system to avoid hard choices
Dyer, Christopher J. (2007):
The "Noisier Channel": Translation from Morphologically Complex Languages, Proceedings of the Second Workshop on Statistical Machine Translation
mentioned in Research Groups and Morphology@InProceedings{dyer:2007:WMT,
author = {Dyer, Christopher J.},
title = {The "Noisier Channel": Translation from Morphologically Complex Languages},
booktitle = {Proceedings of the Second Workshop on Statistical Machine Translation},
month = {June},
address = {Prague, Czech Republic},
publisher = {Association for Computational Linguistics},
pages = {207--211},
url = {
http://www.aclweb.org/anthology/W/W07/W07-0229},
year = 2007
}
(Dyer, 2007;
Christopher J. Dyer (2007):
The University of Maryland Translation System for IWSLT 2007, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
mentioned in Research Groups and Morphology@inproceedings{Dyer:2007:IWSLT,
author = {Christopher J. Dyer},
title = {The {U}niversity of {M}aryland Translation System for {IWSLT} 2007},
url = {
http://20.210-193-52.unknown.qala.com.sg/archive/iwslt\_07/papers/slt7\_180.pdf},
googlescholar = {14689354752478914583},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
year = 2007
}
Dyer, 2007b). This approach may be extended to lattice decoding and tree-based models
Dyer, Christopher J. and Muresan, Smaranda and Resnik, Philip (2008):
Generalizing Word Lattice Translation, Proceedings of ACL-08: HLT
@InProceedings{dyer-muresan-resnik:2008:ACLMain,
author = {Dyer, Christopher J. and Muresan, Smaranda and Resnik, Philip},
title = {Generalizing Word Lattice Translation},
booktitle = {Proceedings of ACL-08: HLT},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {1012--1020},
url = {
http://www.aclweb.org/anthology/P/P08/P08-1115},
year = 2008
}
(Dyer et al., 2008).
Benchmarks
Discussion
Related Topics
New Publications
Salameh, Mohammad and Cherry, Colin and Kondrak, Grzegorz (2014):
Lattice Desegmentation for Statistical Machine Translation, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{salameh-cherry-kondrak:2014:P14-1,
author = {Salameh, Mohammad and Cherry, Colin and Kondrak, Grzegorz},
title = {Lattice Desegmentation for Statistical Machine Translation},
booktitle = {Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {June},
address = {Baltimore, Maryland},
publisher = {Association for Computational Linguistics},
pages = {100--110},
url = {
http://www.aclweb.org/anthology/P14-1010},
year = 2014
}
Salameh et al. (2014)
Joachim Daiber and Khalil Sima'an (2015):
Machine translation with source-predicted target morphology, Machine Translation Summit XV
@inproceedings{MTS2015-Daiber,
author = {Joachim Daiber and Khalil Sima'an},
title = {Machine translation with source-predicted target morphology},
url = {
http://www.mt-archive.info/15/MTS-2015-Daiber.pdf},
pages = {283--296},
booktitle = {Machine Translation Summit XV},
year = 2015
}
Daiber and Sima'an (2015)
Grönroos, Stig-Arne and Virpioja, Sami and Kurimo, Mikko (2015):
Tuning Phrase-Based Segmented Translation for a Morphologically Complex Target Language, Proceedings of the Tenth Workshop on Statistical Machine Translation
@InProceedings{gronroos-virpioja-kurimo:2015:WMT,
author = {Gr\"{o}nroos, Stig-Arne and Virpioja, Sami and Kurimo, Mikko},
title = {Tuning Phrase-Based Segmented Translation for a Morphologically Complex Target Language},
booktitle = {Proceedings of the Tenth Workshop on Statistical Machine Translation},
month = {September},
address = {Lisbon, Portugal},
publisher = {Association for Computational Linguistics},
pages = {105--111},
url = {
http://aclweb.org/anthology/W15-3010},
year = 2015
}
Grönroos et al. (2015)
Grönroos, Stig-Arne and Virpioja, Sami and Kurimo, Mikko (2016):
Hybrid Morphological Segmentation for Phrase-Based Machine Translation, Proceedings of the First Conference on Machine Translation
@InProceedings{gronroos-virpioja-kurimo:2016:WMT,
author = {Gr\"{o}nroos, Stig-Arne and Virpioja, Sami and Kurimo, Mikko},
title = {Hybrid Morphological Segmentation for Phrase-Based Machine Translation},
booktitle = {Proceedings of the First Conference on Machine Translation},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {289--295},
url = {
http://www.aclweb.org/anthology/W/W16/W16-2312},
year = 2016
}
Grönroos et al. (2016)
Ramm, Anita and Fraser, Alexander (2016):
Modeling verbal inflection for English to German SMT, Proceedings of the First Conference on Machine Translation
@InProceedings{ramm-fraser:2016:WMT,
author = {Ramm, Anita and Fraser, Alexander},
title = {Modeling verbal inflection for {English} to {German} SMT},
booktitle = {Proceedings of the First Conference on Machine Translation},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {21--31},
url = {
http://www.aclweb.org/anthology/W/W16/W16-2203},
year = 2016
}
Ramm and Fraser (2016)
Kunchukuttan, Anoop and Bhattacharyya, Pushpak (2016):
Orthographic Syllable as basic unit for SMT between Related Languages, Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
@InProceedings{kunchukuttan-bhattacharyya:2016:EMNLP2016,
author = {Kunchukuttan, Anoop and Bhattacharyya, Pushpak},
title = {Orthographic Syllable as basic unit for {SMT} between Related Languages},
booktitle = {Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing},
month = {November},
address = {Austin, Texas},
publisher = {Association for Computational Linguistics},
pages = {1912--1917},
url = {
https://aclweb.org/anthology/D16-1196},
year = 2016
}
Kunchukuttan and Bhattacharyya (2016)
Huck, Matthias and Tamchyna, Aleš and Bojar, Ondřej and Fraser, Alexander (2017):
Producing Unseen Morphological Variants in Statistical Machine Translation, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers
@InProceedings{huck-EtAl:2017:EACLshort,
author = {Huck, Matthias and Tamchyna, Ale\v{s} and Bojar, Ond\v{r}ej and Fraser, Alexander},
title = {Producing Unseen Morphological Variants in Statistical Machine Translation},
booktitle = {Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers},
month = {April},
address = {Valencia, Spain},
publisher = {Association for Computational Linguistics},
pages = {369--375},
url = {
http://www.aclweb.org/anthology/E17-2059},
year = 2017
}
Huck et al. (2017)
Salameh, Mohammad and Cherry, Colin and Kondrak, Grzegorz (2015):
What Matters Most in Morphologically Segmented SMT Models?, Proceedings of the Ninth Workshop on Syntax, Semantics and Structure in Statistical Translation
@InProceedings{salameh-cherry-kondrak:2015:SSST-9,
author = {Salameh, Mohammad and Cherry, Colin and Kondrak, Grzegorz},
title = {What Matters Most in Morphologically Segmented {SMT} Models?},
booktitle = {Proceedings of the Ninth Workshop on Syntax, Semantics and Structure in Statistical Translation},
month = {June},
address = {Denver, Colorado, USA},
publisher = {Association for Computational Linguistics},
pages = {65--73},
url = {
http://www.aclweb.org/anthology/W15-1011},
year = 2015
}
Salameh et al. (2015)
Eyigöz, Elif and Gildea, Daniel and Oflazer, Kemal (2013):
Simultaneous Word-Morpheme Alignment for Statistical Machine Translation, Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
@InProceedings{eyigoz-gildea-oflazer:2013:NAACL-HLT,
author = {Eyig\"{o}z, Elif and Gildea, Daniel and Oflazer, Kemal},
title = {Simultaneous Word-Morpheme Alignment for Statistical Machine Translation},
booktitle = {Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
month = {June},
address = {Atlanta, Georgia},
publisher = {Association for Computational Linguistics},
pages = {32--40},
url = {
http://www.aclweb.org/anthology/N13-1004},
year = 2013
}
Eyigöz et al. (2013)
Cholakov, Kostadin and Kordoni, Valia (2014):
Better Statistical Machine Translation through Linguistic Treatment of Phrasal Verbs, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)
@InProceedings{cholakov-kordoni:2014:EMNLP2014,
author = {Cholakov, Kostadin and Kordoni, Valia},
title = {Better Statistical Machine Translation through Linguistic Treatment of Phrasal Verbs},
booktitle = {Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
month = {October},
address = {Doha, Qatar},
publisher = {Association for Computational Linguistics},
pages = {196--201},
url = {
http://www.aclweb.org/anthology/D14-1024},
year = 2014
}
Cholakov and Kordoni (2014)
Williams, Philip and Koehn, Philipp (2014):
Using Feature Structures to Improve Verb Translation in English-to-German Statistical MT, Proceedings of the 3rd Workshop on Hybrid Approaches to Machine Translation (HyTra)
@InProceedings{williams-koehn:2014:HyTra,
author = {Williams, Philip and Koehn, Philipp},
title = {Using Feature Structures to Improve Verb Translation in English-to-German Statistical MT},
booktitle = {Proceedings of the 3rd Workshop on Hybrid Approaches to Machine Translation (HyTra)},
month = {April},
address = {Gothenburg, Sweden},
publisher = {Association for Computational Linguistics},
pages = {21--29},
url = {
http://www.aclweb.org/anthology/W14-1005},
year = 2014
}
Williams and Koehn (2014)
Weller, Marion and Kisselew, Max and Smekalova, Svetlana and Fraser, Alexander and Schmid, Helmut and Durrani, Nadir and Sajjad, Hassan and Farkas, Richárd (2013):
Munich-Edinburgh-Stuttgart Submissions at WMT13: Morphological and Syntactic Processing for SMT, Proceedings of the Eighth Workshop on Statistical Machine Translation
@InProceedings{weller-EtAl:2013:WMT,
author = {Weller, Marion and Kisselew, Max and Smekalova, Svetlana and Fraser, Alexander and Schmid, Helmut and Durrani, Nadir and Sajjad, Hassan and Farkas, Rich\'{a}rd},
title = {{Munich-Edinburgh-Stuttgart} Submissions at {WMT}13: Morphological and Syntactic Processing for {SMT}},
booktitle = {Proceedings of the Eighth Workshop on Statistical Machine Translation},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {232--239},
url = {
http://www.aclweb.org/anthology/W13-2230},
year = 2013
}
Weller et al. (2013)
Ahmed Hamdi and Rahma Boujelbane and Nizar Habash and Alexis Nasr (2013):
The Effects of Factorizing Root and Pattern Mapping in Bidirectional Tunisian--Standard-Arabic Machine Translation, Machine Translation Summit XIV
@inproceedings{MTS2013-Hamdi,
author = {Ahmed Hamdi and Rahma Boujelbane and Nizar Habash and Alexis Nasr},
title = {The Effects of Factorizing Root and Pattern Mapping in Bidirectional {Tunisian}--Standard-{A}rabic Machine Translation},
url = {
http://www.mt-archive.info/10/MTS-2013-Hamdi.pdf},
pages = {125--134},
booktitle = {Machine Translation Summit XIV},
year = 2013
}
Hamdi et al. (2013)
Carlos Ramish and Laurent Besacier and Alexander Kobzar (2013):
How hard is it to automatically translate phrasal verbs from English to French?, Workshop on Multi-word Units in Machine Translation and Translation Technologies
@inproceedings{MTS2013W4-Ramish,
author = {Carlos Ramish and Laurent Besacier and Alexander Kobzar},
title = {How hard is it to automatically translate phrasal verbs from {English} to French?},
url = {
http://www.mt-archive.info/10/MTS-2013-W4-Ramish.pdf},
pages = {53--61},
booktitle = {Workshop on Multi-word Units in Machine Translation and Translation Technologies},
year = 2013
}
Ramish et al. (2013)
Carlos Ramish and Laurent Besacier and Alexander Kobzar (2013):
How hard is it to automatically translate phrasal verbs from English to French?, Workshop on Multi-word Units in Machine Translation and Translation Technologies
@inproceedings{MTS2013W4-Ramish,
author = {Carlos Ramish and Laurent Besacier and Alexander Kobzar},
title = {How hard is it to automatically translate phrasal verbs from {English} to French?},
url = {
http://www.mt-archive.info/10/MTS-2013-W4-Ramish.pdf},
pages = {53--61},
booktitle = {Workshop on Multi-word Units in Machine Translation and Translation Technologies},
year = 2013
}
Ramish et al. (2013)
Hassan Al-Haj and Alon Lavie (2012):
The impact of Arabic morphological segmentation on broad-coverage English-to-Arabic statistical machine translation, Machine Translation
@article{MTJ:2012:AlHaj,
author = {Hassan Al-Haj and Alon Lavie},
title = {The impact of {Arabic} morphological segmentation on broad-coverage {English}-to-{Arabic} statistical machine translation},
pages = {3-24},
journal = {Machine Translation},
volume = {26},
number = {1-2},
month = {March},
year = 2012
}
Al-Haj and Lavie (2012)
Saša Hasan and Saab Mansour and Hermann Ney (2012):
A comparison of segmentation methods and extended lexicon models for Arabic statistical machine translation, Machine Translation
@article{MTJ:2012:Hasan,
author = {Sa\v{s}a Hasan and Saab Mansour and Hermann Ney},
title = {A comparison of segmentation methods and extended lexicon models for {Arabic} statistical machine translation},
pages = {47-65},
journal = {Machine Translation},
volume = {26},
number = {1-2},
month = {March},
year = 2012
}
Hasan et al. (2012)
Ahmed El Kholy and Nizar Habash (2012):
Orthographic and morphological processing for English-Arabic statistical machine translation, Machine Translation
@article{MTJ:2012:Kholy,
author = {Ahmed El Kholy and Nizar Habash},
title = {Orthographic and morphological processing for {English}-{Arabic} statistical machine translation},
url = {
http://www.mt-archive.info/TALN-2010-ElKholy.pdf},
googlescholar = {17108904403760687858},
pages = {25-45},
journal = {Machine Translation},
volume = {26},
number = {1-2},
month = {March},
year = 2012
}
Kholy and Habash (2012)
Hassan Al-Haj and Alon Lavie (2012):
The impact of Arabic morphological segmentation on broad-coverage English-to-Arabic statistical machine translation, Machine Translation
@article{MTJ:2012:AlHaj,
author = {Hassan Al-Haj and Alon Lavie},
title = {The impact of {Arabic} morphological segmentation on broad-coverage {English}-to-{Arabic} statistical machine translation},
pages = {3-24},
journal = {Machine Translation},
volume = {26},
number = {1-2},
month = {March},
year = 2012
}
Al-Haj and Lavie (2012)
Saša Hasan and Saab Mansour and Hermann Ney (2012):
A comparison of segmentation methods and extended lexicon models for Arabic statistical machine translation, Machine Translation
@article{MTJ:2012:Hasan,
author = {Sa\v{s}a Hasan and Saab Mansour and Hermann Ney},
title = {A comparison of segmentation methods and extended lexicon models for {Arabic} statistical machine translation},
pages = {47-65},
journal = {Machine Translation},
volume = {26},
number = {1-2},
month = {March},
year = 2012
}
Hasan et al. (2012)
Ahmed El Kholy and Nizar Habash (2012):
Orthographic and morphological processing for English-Arabic statistical machine translation, Machine Translation
@article{MTJ:2012:Kholy,
author = {Ahmed El Kholy and Nizar Habash},
title = {Orthographic and morphological processing for {English}-{Arabic} statistical machine translation},
url = {
http://www.mt-archive.info/TALN-2010-ElKholy.pdf},
googlescholar = {17108904403760687858},
pages = {25-45},
journal = {Machine Translation},
volume = {26},
number = {1-2},
month = {March},
year = 2012
}
Kholy and Habash (2012)
Zhiyang Wang and Yajuan Lü and Qun Liu (2011):
Multi-granularity Word Alignment and Decoding for Agglutinative Language Translation, Proceedings of the 13th Machine Translation Summit (MT Summit XIII)
@inproceedings{MTS-2011-Wang,
author = {Zhiyang Wang and Yajuan L{\"u} and Qun Liu},
title = {Multi-granularity Word Alignment and Decoding for Agglutinative Language Translation},
url = {
http://www.mt-archive.info/MTS-2011-Wang.pdf},
pages = {360-367},
booktitle = {Proceedings of the 13th Machine Translation Summit (MT Summit XIII)},
publisher = {International Association for Machine Translation},
location = {Xiamen, China},
year = 2011
}
Wang et al. (2011)
A. El Kholy and Nizar Habash (2012):
Translate, Predict or Generate: Modeling Rich Morphology in Statistical Machine Translation, Proceedings of th 16th International Conference of the European Association for Machine Translation (EAMT)
@inproceedings{EAMT-2012-ElKholy,
author = {A. El Kholy and Nizar Habash},
title = {Translate, Predict or Generate: Modeling Rich Morphology in Statistical Machine Translation},
url = {
http://www.mt-archive.info/EAMT-2012-ElKholy},
pages = {27-34},
booktitle = {Proceedings of th 16th International Conference of the European Association for Machine Translation (EAMT)},
location = {Trento, Italy},
editor = {Mauro Cettolo and Marcello Federico and Lucia Specia and Andy Way},
year = 2012
}
Kholy and Habash (2012)
N. Singh and Nizar Habash (2012):
Hebrew Morphological Preprocessing for Statistical Machine Translation, Proceedings of th 16th International Conference of the European Association for Machine Translation (EAMT)
@inproceedings{EAMT-2012-Singh,
author = {N. Singh and Nizar Habash},
title = {Hebrew Morphological Preprocessing for Statistical Machine Translation},
url = {
http://www.mt-archive.info/EAMT-2012-Singh},
pages = {43-50},
booktitle = {Proceedings of th 16th International Conference of the European Association for Machine Translation (EAMT)},
location = {Trento, Italy},
editor = {Mauro Cettolo and Marcello Federico and Lucia Specia and Andy Way},
year = 2012
}
Singh and Habash (2012)
Stallard, David and Devlin, Jacob and Kayser, Michael and Lee, Yoong Keok and Barzilay, Regina (2012):
Unsupervised Morphology Rivals Supervised Morphology for Arabic MT, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
@InProceedings{stallard-EtAl:2012:ACL2012short,
author = {Stallard, David and Devlin, Jacob and Kayser, Michael and Lee, Yoong Keok and Barzilay, Regina},
title = {Unsupervised Morphology Rivals Supervised Morphology for Arabic MT},
booktitle = {Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {July},
address = {Jeju Island, Korea},
publisher = {Association for Computational Linguistics},
pages = {322--327},
url = {
http://www.aclweb.org/anthology/P12-2063},
year = 2012
}
Stallard et al. (2012)
Carpuat, Marine (2009):
Toward Using Morphology in French-English Phrase-Based SMT, Proceedings of the Fourth Workshop on Statistical Machine Translation
@InProceedings{carpuat:2009:WMT-09,
author = {Carpuat, Marine},
title = {Toward Using Morphology in {F}rench-{E}nglish Phrase-Based {SMT}},
booktitle = {Proceedings of the Fourth Workshop on Statistical Machine Translation},
month = {March},
address = {Athens, Greece},
publisher = {Association for Computational Linguistics},
pages = {150--154},
url = {
http://www.aclweb.org/anthology/W/W09/W09-0427},
year = 2009
}
Carpuat (2009)
Fraser, Alexander (2009):
Experiments in Morphosyntactic Processing for Translating to and from German, Proceedings of the Fourth Workshop on Statistical Machine Translation
@InProceedings{fraser:2009:WMT-09,
author = {Fraser, Alexander},
title = {Experiments in Morphosyntactic Processing for Translating to and from {G}erman},
booktitle = {Proceedings of the Fourth Workshop on Statistical Machine Translation},
month = {March},
address = {Athens, Greece},
publisher = {Association for Computational Linguistics},
pages = {115--119},
url = {
http://www.aclweb.org/anthology/W/W09/W09-0420},
year = 2009
}
Fraser (2009)
Arianna Bisazza and Marcello Federico (2009):
Morphological pre-processing for Turkish to English statistical machine translation, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{IWSLT:2009:Bisazza,
author = {Arianna Bisazza and Marcello Federico},
title = {Morphological pre-processing for {T}urkish to {E}nglish statistical machine translation},
url = {
http://20.210-193-52.unknown.qala.com.sg/archive/iwslt\_09/papers/slt9\_129.pdf},
googlescholar = {11760787701945451587},
pages = {129--135},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
location = {Tokyo, Japan},
month = {December},
year = 2009
}
Bisazza and Federico (2009)
Luong, Minh-Thang and Nakov, Preslav and Kan, Min-Yen (2010):
A Hybrid Morpheme-Word Representation for Machine Translation of Morphologically Rich Languages, Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
@InProceedings{luong-nakov-kan:2010:EMNLP,
author = {Luong, Minh-Thang and Nakov, Preslav and Kan, Min-Yen},
title = {A Hybrid Morpheme-Word Representation for Machine Translation of Morphologically Rich Languages},
booktitle = {Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing},
month = {October},
address = {Cambridge, MA},
publisher = {Association for Computational Linguistics},
pages = {148--157},
url = {
http://www.aclweb.org/anthology/D/D10/D10-1015},
year = 2010
}
Luong et al. (2010)
Hong, Gumwon and Lee, Seung-Wook and Rim, Hae-Chang (2009):
Bridging Morpho-Syntactic Gap between Source and Target Sentences for English-Korean Statistical Machine Translation, Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
@InProceedings{hong-lee-rim:2009:Short,
author = {Hong, Gumwon and Lee, Seung-Wook and Rim, Hae-Chang},
title = {Bridging Morpho-Syntactic Gap between Source and Target Sentences for English-Korean Statistical Machine Translation},
booktitle = {Proceedings of the ACL-IJCNLP 2009 Conference Short Papers},
month = {August},
address = {Suntec, Singapore},
publisher = {Association for Computational Linguistics},
pages = {233--236},
url = {
http://www.aclweb.org/anthology/P/P09/P09-2059},
year = 2009
}
Hong et al. (2009)
Ilknur Durgar El-Kahlout and François Yvon (2010):
The pay-offs of preprocessing for German-English Statistical Machine Translation, Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{iwslt10:TP:el-kahlout,
author = {Ilknur {Durgar El-Kahlout} and Fran\c{c}ois Yvon},
title = {{The pay-offs of preprocessing for German-English Statistical Machine Translation}},
url = {
http://www.mt-archive.info/IWSLT-2010-El-Kahlout.pdf},
googlescholar = {17458117287538406206},
editor = {Marcello Federico and Ian Lane and Michael Paul and Fran\c{c}ois Yvon},
booktitle = {Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)},
pages = {251--258},
location = {Paris, France},
year = 2010
}
Durgar El-Kahlout and Yvon (2010)
Saab Mansour (2010):
MorphTagger: HMM-Based Arabic Segmentation for Statistical Machine Translation, Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{iwslt10:TP:mansour,
author = {Saab Mansour},
title = {{MorphTagger: HMM-Based Arabic Segmentation for Statistical Machine Translation}},
url = {
http://20.210-193-52.unknown.qala.com.sg/archive/iwslt\_10/papers/slta\_321.pdf},
googlescholar = {11892858770769554082},
editor = {Marcello Federico and Ian Lane and Michael Paul and Fran\c{c}ois Yvon},
booktitle = {Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)},
pages = {321--327},
location = {Paris, France},
year = 2010
}
Mansour (2010)
Stymne, Sara and Holmqvist, Maria and Ahrenberg, Lars (2010):
Vs and OOVs: Two Problems for Translation between German and English, Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
@InProceedings{stymne-holmqvist-ahrenberg:2010:WMT,
author = {Stymne, Sara and Holmqvist, Maria and Ahrenberg, Lars},
title = {Vs and OOVs: Two Problems for Translation between {German} and English},
booktitle = {Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR},
month = {July},
address = {Uppsala, Sweden},
publisher = {Association for Computational Linguistics},
pages = {189--194},
url = {
http://www.aclweb.org/anthology/W10-1728},
year = 2010
}
Stymne et al. (2010)
Virpioja, Sami and Väyrynen, Jaakko and Mansikkaniemi, Andre and Kurimo, Mikko (2010):
Applying Morphological Decompositions to Statistical Machine Translation, Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
@InProceedings{virpioja-EtAl:2010:WMT,
author = {Virpioja, Sami and V\"{a}yrynen, Jaakko and Mansikkaniemi, Andre and Kurimo, Mikko},
title = {Applying Morphological Decompositions to Statistical Machine Translation},
booktitle = {Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR},
month = {July},
address = {Uppsala, Sweden},
publisher = {Association for Computational Linguistics},
pages = {201--206},
url = {
http://www.aclweb.org/anthology/W10-1730},
year = 2010
}
Virpioja et al. (2010)
Cartoni, Bruno (2009):
Lexical Morphology in Machine Translation: A Feasibility Study, Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009)
@InProceedings{cartoni:2009:EACL,
author = {Cartoni, Bruno},
title = {Lexical Morphology in Machine Translation: A Feasibility Study},
booktitle = {Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009)},
month = {March},
address = {Athens, Greece},
publisher = {Association for Computational Linguistics},
pages = {130--138},
url = {
http://www.aclweb.org/anthology/E09-1016},
year = 2009
}
Cartoni (2009)
Igor Mel'čuk and Leo Wanner (2008):
Morphological mismatches in machine translation, Machine Translation
@article{MTJ:2008:Melcuk,
author = {Igor Mel'\v{c}uk and Leo Wanner},
title = {Morphological mismatches in machine translation},
url = {
http://xa.yimg.com/kq/groups/23172933/261791657/name/Melcuk-Wanner\_Online.pdf},
googlescholar = {10416098283572959314},
pages = {101--152},
journal = {Machine Translation},
volume = {22},
number = {3},
month = {September},
year = 2008
}
Mel'čuk and Wanner (2008)
R. Mahesh K. Sinha (2007):
Using Rich Morphology in Resolving certain Hindi-English Machine Translation Divergence, Proceedings of the MT Summit XI
@inproceedings{Sinha:2007:MTSummit,
author = {R. Mahesh K. Sinha},
title = {Using Rich Morphology in Resolving certain {H}indi-{E}nglish Machine Translation Divergence},
url = {
http://www.mt-archive.info/MTS-2007-Sinha.pdf},
googlescholar = {9009762213208278530},
booktitle = {Proceedings of the {MT} Summit XI},
year = 2007
}
Sinha (2007)
Young-Suk Lee (2004):
Morphological Analysis for Statistical Machine Translation, Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)
@Inproceedings{Lee:2004,
author = {Young-Suk Lee},
title = {Morphological Analysis for Statistical Machine Translation},
url = {
http://acl.ldc.upenn.edu/N/N04/N04-4015.pdf},
booktitle = {Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)},
year = 2004
}
Lee (2004)