Context Features
Standard phrase based models condition the generation of an output phrase solely on its aligned input phrase. However, a wider source context may give additional guidance.
Context Features is the main subject of 26 publications. 13 are discussed here.
Publications
Phrase translation may be informed by additional context features, for instance by applying methods used in word sense disambiguation
Marine Carpuat and Yihai Shen and Xiaofeng Yu and Dekai Wu (2006):
Toward integrating word sense and entity disambiguation into statistical machine translation, Proc. of the International Workshop on Spoken Language Translation
@inproceedings{Carpuat:2006:IWSLT,
author = {Marine Carpuat and Yihai Shen and Xiaofeng Yu and Dekai Wu},
title = {Toward integrating word sense and entity disambiguation into statistical machine translation},
url = {
http://20.210-193-52.unknown.qala.com.sg/archive/iwslt\_06/papers/slt6\_037.pdf},
googlescholar = {11373792371528455811},
month = {November},
booktitle = {Proc. of the International Workshop on Spoken Language Translation},
address = {Kyoto, Japan},
year = 2006
}
(Carpuat et al., 2006). Such features may be integrated using a maximum entropy model
Srinivas Bangalore and Stephan Kanthak and Patrick Haffner (2006):
Finite-state transducer-based statistical machine translation using joint probabilities, Proc. of the International Workshop on Spoken Language Translation
@inproceedings{Bangalore:2006:IWSLT,
author = {Srinivas Bangalore and Stephan Kanthak and Patrick Haffner},
title = {Finite-state transducer-based statistical machine translation using joint probabilities},
url = {
http://20.210-193-52.unknown.qala.com.sg/archive/iwslt\_06/papers/slt6\_016.pdf},
googlescholar = {18287607466151575450},
month = {November},
booktitle = {Proc. of the International Workshop on Spoken Language Translation},
address = {Kyoto, Japan},
year = 2006
}
(Bangalore et al., 2006), support vector machines
Giménez, Jesús and Màrquez, Lluís (2007):
Context-aware Discriminative Phrase Selection for Statistical Machine Translation, Proceedings of the Second Workshop on Statistical Machine Translation
@InProceedings{gimenez-marquez:2007:WMT1,
author = {Gim\'{e}nez, Jes\'{u}s and M\`{a}rquez, Llu\'{i}s},
title = {Context-aware Discriminative Phrase Selection for Statistical Machine Translation},
booktitle = {Proceedings of the Second Workshop on Statistical Machine Translation},
month = {June},
address = {Prague, Czech Republic},
publisher = {Association for Computational Linguistics},
pages = {159--166},
url = {
http://www.aclweb.org/anthology/W/W07/W07-0219},
year = 2007
}
(Giménez and Màrquez, 2007), or by directly integrating more complex word sense disambiguation components, such as an ensemble of different machine learning methods
Carpuat, Marine and Wu, Dekai (2007):
Improving Statistical Machine Translation Using Word Sense Disambiguation, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)
@InProceedings{carpuat-wu:2007:EMNLP-CoNLL2007,
author = {Carpuat, Marine and Wu, Dekai},
title = {Improving Statistical Machine Translation Using Word Sense Disambiguation},
booktitle = {Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)},
pages = {61--72},
url = {
http://www.aclweb.org/anthology/D/D07/D07-1007},
year = 2007
}
(Carpuat and Wu, 2007;
Marine Carpuat and Dekai Wu (2007):
Context-Dependent Phrasal Translation Lexicons for Statistical Machine Translation, Proceedings of the MT Summit XI
@inproceedings{Carpuat:2007:MTSummit,
author = {Marine Carpuat and Dekai Wu},
title = {Context-Dependent Phrasal Translation Lexicons for Statistical Machine Translation},
url = {
http://www2.cse.ust.hk/~dekai/library/WU\_Dekai/CarpuatWu\_MTSummit2007.pdf},
googlescholar = {17911931138967811487},
booktitle = {Proceedings of the {MT} Summit XI},
year = 2007
}
Carpuat and Wu, 2007b).
Ittycheriah, Abraham and Roukos, Salim (2007):
Direct Translation Model 2, Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference
@InProceedings{ittycheriah-roukos:2007:main,
author = {Ittycheriah, Abraham and Roukos, Salim},
title = {Direct Translation Model 2},
booktitle = {Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference},
month = {April},
address = {Rochester, New York},
publisher = {Association for Computational Linguistics},
pages = {57--64},
url = {
http://www.aclweb.org/anthology/N/N07/N07-1008},
year = 2007
}
Ittycheriah and Roukos (2007) propose a maximum entropy model for phrase translation.
Syntactic context dependencies may be added to phrase translations in the phrase-based approach, for instance verb-argument relationships
Hwang, Young-Sook and Sasaki, Yutaka (2005):
Context-Dependent SMT Model using Bilingual Verb-Noun Collocation, Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05)
@InProceedings{hwang-sasaki:2005:ACL,
author = {Hwang, Young-Sook and Sasaki, Yutaka},
title = {Context-Dependent {SMT} Model using Bilingual Verb-Noun Collocation},
booktitle = {Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05)},
month = {June},
address = {Ann Arbor, Michigan},
publisher = {Association for Computational Linguistics},
pages = {549--556},
url = {
http://www.aclweb.org/anthology/P/P05/P05-1068},
year = 2005
}
(Hwang and Sasaki, 2005), or the syntactic structure underlying each phrase translation
Sun, Jiadong and Zhao, Tiejun and Liang, Huashen (2007):
Meta-Structure Transformation Model for Statistical Machine Translation, Proceedings of the Second Workshop on Statistical Machine Translation
@InProceedings{sun-zhao-liang:2007:WMT,
author = {Sun, Jiadong and Zhao, Tiejun and Liang, Huashen},
title = {Meta-Structure Transformation Model for Statistical Machine Translation},
booktitle = {Proceedings of the Second Workshop on Statistical Machine Translation},
month = {June},
address = {Prague, Czech Republic},
publisher = {Association for Computational Linguistics},
pages = {64--71},
url = {
http://www.aclweb.org/anthology/W/W07/W07-0209},
year = 2007
}
(Sun et al., 2007).
Gimpel, Kevin and Smith, Noah A. (2008):
Rich Source-Side Context for Statistical Machine Translation, Proceedings of the Third Workshop on Statistical Machine Translation
@InProceedings{gimpel-smith:2008:WMT,
author = {Gimpel, Kevin and Smith, Noah A.},
title = {Rich Source-Side Context for Statistical Machine Translation},
booktitle = {Proceedings of the Third Workshop on Statistical Machine Translation},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {9--17},
url = {
http://www.aclweb.org/anthology/W/W08/W08-0302},
year = 2008
}
Gimpel and Smith (2008) add features around the context of a source phrase into a probabilistic back-off model.
Chan, Yee Seng and Ng, Hwee Tou and Chiang, David (2007):
Word Sense Disambiguation Improves Statistical Machine Translation, Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics
@InProceedings{chan-ng-chiang:2007:ACLMain,
author = {Chan, Yee Seng and Ng, Hwee Tou and Chiang, David},
title = {Word Sense Disambiguation Improves Statistical Machine Translation},
booktitle = {Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics},
month = {June},
address = {Prague, Czech Republic},
publisher = {Association for Computational Linguistics},
pages = {33--40},
url = {
http://www.aclweb.org/anthology/P/P07/P07-1005},
year = 2007
}
Chan et al. (2007) use a support vector machine to include context features for rules in a hierarchical translation model. Maximum entropy models may be used for the same purpose
Xiong, Deyi and Zhang, Min and Aw, Aiti and Li, Haizhou (2008):
Linguistically Annotated BTG for Statistical Machine Translation, Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)
mentioned in Context Features and Reordering Models@InProceedings{xiong-EtAl:2008:PAPERS,
author = {Xiong, Deyi and Zhang, Min and Aw, Aiti and Li, Haizhou},
title = {Linguistically Annotated {BTG} for Statistical Machine Translation},
booktitle = {Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)},
month = {August},
address = {Manchester, UK},
publisher = {Coling 2008 Organizing Committee},
pages = {1009--1016},
url = {
http://www.aclweb.org/anthology/C08-1127},
year = 2008
}
(Xiong et al., 2008;
He, Zhongjun and Liu, Qun and Lin, Shouxun (2008):
Improving Statistical Machine Translation using Lexicalized Rule Selection, Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)
@InProceedings{he-liu-lin:2008:PAPERS,
author = {He, Zhongjun and Liu, Qun and Lin, Shouxun},
title = {Improving Statistical Machine Translation using Lexicalized Rule Selection},
booktitle = {Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)},
month = {August},
address = {Manchester, UK},
publisher = {Coling 2008 Organizing Committee},
pages = {321--328},
url = {
http://www.aclweb.org/anthology/C08-1041},
year = 2008
}
He et al., 2008).
Liu, Zhanyi and Wang, Haifeng and Wu, Hua and Li, Sheng (2010):
Improving Statistical Machine Translation with Monolingual Collocation, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
mentioned in Word Alignment Based On Coocurrence and Context Features@InProceedings{liu-EtAl:2010:ACL,
author = {Liu, Zhanyi and Wang, Haifeng and Wu, Hua and Li, Sheng},
title = {Improving Statistical Machine Translation with Monolingual Collocation},
booktitle = {Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics},
month = {July},
address = {Uppsala, Sweden},
publisher = {Association for Computational Linguistics},
pages = {825--833},
url = {
http://www.aclweb.org/anthology/P10-1085},
year = 2010
}
Liu et al. (2010) use collocation information as a feature for each phrase pair, which suggests that collocated words make better phrases.
Benchmarks
Discussion
Related Topics
New Publications
Servan, Christophe and Dymetman, Marc (2015):
Adaptation par enrichissement terminologique en traduction automatique statistique fondée sur la génération et le filtrage de bi-segments virtuels, 22ème Conférence sur le Traitement Automatique des Langues Naturelles
mentioned in Domain Adaptation, Context Features and Unknown Words@inproceedings{servan:hal-01157850,
author = {Servan, Christophe and Dymetman, Marc},
title = {{Adaptation par enrichissement terminologique en traduction automatique statistique fond{\'e}e sur la g{\'e}n{\'e}ration et le filtrage de bi-segments virtuels}},
url = {
https://hal.archives-ouvertes.fr/hal-01157850},
booktitle = {{22{\`e}me Conf{\'e}rence sur le Traitement Automatique des Langues Naturelles}},
address = {Caen, France},
organization = {{ATALA}},
month = {Jun},
keywords = {Traduction Automatique Statistique ; G{\'e}n{\'e}ration Automatique de Texte ; contexte phrastique ; terminologie bilingue},
pdf = {
https://hal.archives-ouvertes.fr/hal-01157850/file/terminology-final-version.pdf},
hal\_id = {hal-01157850},
hal\_version = {v1},
year = 2015
}
Servan and Dymetman (2015)
Darlene Stewart and Roland Kuhn and Eric Joanis and George Foster (2014):
Coarse split and lump bilingual language models for richer source information in SMT, Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{AMTA-2014-Stewart,
author = {Darlene Stewart and Roland Kuhn and Eric Joanis and George Foster},
title = {Coarse
split and lump bilingual language models for richer source information in SMT},
pages = {28-41},
url = {
http://www.mt-archive.info/10/AMTA-2014-Stewart.pdf},
volume = {1},
booktitle = {Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {Vancouver, BC, Canada},
year = 2014
}
Stewart et al. (2014)
Thanh-Le Ha and Jan Niehues and Alex Waibel (2014):
Lexical Translation Model Using A Deep Neural Network Architecture, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
mentioned in Lexical Choice, Context Features and Embeddings@inproceedings{Ha:iwslt:2014,
author = {Thanh-Le Ha and Jan Niehues and Alex Waibel},
title = {Lexical Translation Model Using A Deep Neural Network Architecture},
pages = {223--229},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
year = 2014
}
Ha et al. (2014)
Rejwanul Haque and Sudip Kumar Naskar and Antal van den Bosch and Andy Way (2011):
Integrating source-language context into phrase-based statistical machine translation, Machine Translation
@article{MTJ:2011:Haque,
author = {Rejwanul Haque and Sudip Kumar Naskar and Antal van den Bosch and Andy Way},
title = {Integrating source-language context into phrase-based statistical machine translation},
pages = {239-285},
journal = {Machine Translation},
volume = {25},
number = {3},
month = {September},
year = 2011
}
Haque et al. (2011)
Rejwanul Haque and Sudip Naskar and Antal van den Bosch and Andy Way (2010):
Supertags as Source Language Context in Hierarchical Phrase-Based SMT, Proceedings of the Ninth Conference of the Association for Machine Translation in the Americas
@inproceedings{AMTA-2010-Haque,
author = {Rejwanul Haque and Sudip Naskar and Antal van den Bosch and Andy Way},
title = {Supertags as Source Language Context in Hierarchical Phrase-Based {SMT}},
url = {
http://www.mt-archive.info/AMTA-2010-Haque.pdf},
booktitle = {Proceedings of the Ninth Conference of the Association for Machine Translation in the Americas},
location = {Denver, Colorado},
year = 2010
}
Haque et al. (2010)
Liu, Qun and He, Zhongjun and Liu, Yang and Lin, Shouxun (2008):
Maximum Entropy based Rule Selection Model for Syntax-based Statistical Machine Translation, Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing
@InProceedings{liu-EtAl:2008:EMNLP1,
author = {Liu, Qun and He, Zhongjun and Liu, Yang and Lin, Shouxun},
title = {Maximum Entropy based Rule Selection Model for Syntax-based Statistical Machine Translation},
booktitle = {Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing},
month = {October},
address = {Honolulu, Hawaii},
publisher = {Association for Computational Linguistics},
pages = {89--97},
url = {
http://www.aclweb.org/anthology/D08-1010},
year = 2008
}
Liu et al. (2008)
Ralf Brown (2008):
Exploiting Document-Level Context for Data-Driven Machine Translation, Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{amta08:Brown,
author = {Ralf Brown},
title = {Exploiting Document-Level Context for Data-Driven Machine Translation},
url = {
http://www.mt-archive.info/AMTA-2008-Brown.pdf},
googlescholar = {12669379868044810415},
pages = {46--55},
booktitle = {Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {Waikiki, Hawaii},
year = 2008
}
Brown (2008)
Shen, Libin and Xu, Jinxi and Zhang, Bing and Matsoukas, Spyros and Weischedel, Ralph (2009):
Effective Use of Linguistic and Contextual Information for Statistical Machine Translation, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing
@InProceedings{shen-EtAl:2009:EMNLP,
author = {Shen, Libin and Xu, Jinxi and Zhang, Bing and Matsoukas, Spyros and Weischedel, Ralph},
title = {Effective Use of Linguistic and Contextual Information for Statistical Machine Translation},
booktitle = {Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing},
month = {August},
address = {Singapore},
publisher = {Association for Computational Linguistics},
pages = {72--80},
url = {
http://www.aclweb.org/anthology/D/D09/D09-1008},
year = 2009
}
Shen et al. (2009)
Huang, Fei and Xiang, Bing (2010):
Feature-Rich Discriminative Phrase Rescoring for SMT, Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)
@InProceedings{huang-xiang:2010:PAPERS,
author = {Huang, Fei and Xiang, Bing},
title = {Feature-Rich Discriminative Phrase Rescoring for SMT},
booktitle = {Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)},
month = {August},
address = {Beijing, China},
publisher = {Coling 2010 Organizing Committee},
pages = {492--500},
url = {
http://www.aclweb.org/anthology/C10-1056},
year = 2010
}
Huang and Xiang (2010)
Andrade, Daniel and Nasukawa, Tetsuya and Tsujii, Junichi (2010):
Robust Measurement and Comparison of Context Similarity for Finding Translation Pairs, Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)
@InProceedings{andrade-nasukawa-tsujii:2010:PAPERS,
author = {Andrade, Daniel and Nasukawa, Tetsuya and Tsujii, Junichi},
title = {Robust Measurement and Comparison of Context Similarity for Finding Translation Pairs},
booktitle = {Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)},
month = {August},
address = {Beijing, China},
publisher = {Coling 2010 Organizing Committee},
pages = {19--27},
url = {
http://www.aclweb.org/anthology/C10-1003},
year = 2010
}
Andrade et al. (2010)
Onishi, Takashi and Utiyama, Masao and Sumita, Eiichiro (2011):
Reordering Constraint Based on Document-Level Context, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Techologies
@InProceedings{onishi-utiyama-sumita:2011:ACL-HLT2011,
author = {Onishi, Takashi and Utiyama, Masao and Sumita, Eiichiro},
title = {Reordering Constraint Based on Document-Level Context},
booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Techologies},
month = {June},
address = {Portland, Oregon, USA},
publisher = {Association for Computational Linguistics},
pages = {434--438},
url = {
http://www.aclweb.org/anthology/P11-2076},
year = 2011
}
Onishi et al. (2011)
Subotin, Michael (2011):
An exponential translation model for target language morphology, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Techologies
@InProceedings{subotin:2011:ACL-HLT2011,
author = {Subotin, Michael},
title = {An exponential translation model for target language morphology},
booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Techologies},
month = {June},
address = {Portland, Oregon, USA},
publisher = {Association for Computational Linguistics},
pages = {230--238},
url = {
http://www.aclweb.org/anthology/P11-1024},
year = 2011
}
Subotin (2011)
Gong, Zhengxian and Zhang, Min and Zhou, Guodong (2011):
Cache-based Document-level Statistical Machine Translation, Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing
@InProceedings{gong-zhang-zhou:2011:EMNLP,
author = {Gong, Zhengxian and Zhang, Min and Zhou, Guodong},
title = {Cache-based Document-level Statistical Machine Translation},
booktitle = {Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing},
month = {July},
address = {Edinburgh, Scotland, UK.},
publisher = {Association for Computational Linguistics},
pages = {909--919},
url = {
http://www.aclweb.org/anthology/D11-1084},
year = 2011
}
Gong et al. (2011)