Domain Adaptation
Often, there is a mismatch between the domain for which training data are available and the target domain of a machine translation system. Different domains may vary by topic or text style.
Domain Adaptation is the main subject of 143 publications. 28 are discussed here.
Publications
Machine translation systems are often built for very specific domains, such as movie and television subtitles
Martin Volk and Søren Harder (2007):
Evaluating MT with Translations or Translators. What is the Difference?, Proceedings of the MT Summit XI
@inproceedings{Volk:2007:MTSummit,
author = {Martin Volk and S{\o}ren Harder},
title = {Evaluating {MT} with Translations or Translators. {W}hat is the Difference?},
url = {
http://mt-archive.info/MTS-2007-Volk.pdf},
googlescholar = {875595458994321061},
booktitle = {Proceedings of the {MT} Summit XI},
year = 2007
}
(Volk and Harder, 2007). A translation model may be trained only from sentences in a parallel corpus that are similar to the input sentence
Almut Silja Hildebrand and Matthias Eck and Stephan Vogel and Alex Waibel (2005):
Adaptation of the translation model for statistical machine translation based on information retrieval, Proceedings of the 10th Conference of the European Association for Machine Translation (EAMT)
@InProceedings{Hildebrand:2005:EAMT,
author = {Almut Silja Hildebrand and Matthias Eck and Stephan Vogel and Alex Waibel},
title = {Adaptation of the translation model for statistical machine translation based on information retrieval},
url = {
http://isl.anthropomatik.kit.edu/cmu-kit/5\_EAMT2005\_Hildebrand(1).pdf},
googlescholar = {13931991959007327601},
booktitle = {Proceedings of the 10th Conference of the European Association for Machine Translation (EAMT)},
month = {May},
address = {Budapest},
year = 2005
}
(Hildebrand et al., 2005), which may be determined by language model perplexity
Keiji Yasuda and Ruiqiang Zhang and Hirofumi Yamamoto and Eiichiro Sumita (2008):
Method of Selecting Training Data to Build a Compact and Efficient Translation Model, Proceedings of the 3rd International Joint Conference on Natural Language Processing (IJCNLP)
@inproceedings{Yasuda:2008:IJCNLP,
author = {Keiji Yasuda and Ruiqiang Zhang and Hirofumi Yamamoto and Eiichiro Sumita},
title = {Method of Selecting Training Data to Build a Compact and Efficient Translation Model},
url = {
http://www.researchgate.net/publication/228846991\_Method\_of\_selecting\_training\_data\_to\_build\_a\_compact\_and\_efficient\_translation\_model/file/e0b495225d73564982.pdf},
googlescholar = {2562951910086143226},
booktitle = {Proceedings of the 3rd International Joint Conference on Natural Language Processing (IJCNLP)},
year = 2008
}
(Yasuda et al., 2008). Also, a domain-specific language model may be obtained by including only sentences that are similar to the ones in the target domain
Sethy, Abhinav and Georgiou, Panayiotis and Narayanan, Shrikanth (2006):
Selecting relevant text subsets from web-data for building topic specific language models, Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
@InProceedings{sethy-georgiou-narayanan:2006:HLT-NAACL06-Short,
author = {Sethy, Abhinav and Georgiou, Panayiotis and Narayanan, Shrikanth},
title = {Selecting relevant text subsets from web-data for building topic specific language models},
booktitle = {Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers},
month = {June},
address = {New York City, USA},
publisher = {Association for Computational Linguistics},
pages = {145--148},
url = {
http://www.aclweb.org/anthology/N/N06/N06-2037},
year = 2006
}
(Sethy et al., 2006). We may detect such sentences using the tf/idf method common in information retrieval and then boost the count of these sentences
Lü, Yajuan and Huang, Jin and Liu, Qun (2007):
Improving Statistical Machine Translation Performance by Training Data Selection and Optimization, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)
@InProceedings{lu-huang-liu:2007:EMNLP-CoNLL2007,
author = {L\"{u}, Yajuan and Huang, Jin and Liu, Qun},
title = {Improving Statistical Machine Translation Performance by Training Data Selection and Optimization},
booktitle = {Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)},
pages = {343--350},
url = {
http://www.aclweb.org/anthology/D/D07/D07-1036},
year = 2007
}
(Lü et al., 2007). Better still is the cross-entropy difference method of
Moore, Robert C. and Lewis, William (2010):
Intelligent Selection of Language Model Training Data, Proceedings of the ACL 2010 Conference Short Papers
@InProceedings{moore-lewis:2010:Short,
author = {Moore, Robert C. and Lewis, William},
title = {Intelligent Selection of Language Model Training Data},
booktitle = {Proceedings of the ACL 2010 Conference Short Papers},
month = {July},
address = {Uppsala, Sweden},
publisher = {Association for Computational Linguistics},
pages = {220--224},
url = {
http://www.aclweb.org/anthology/P10-2041},
year = 2010
}
(Moore and Lewis, 2010) which compares in-doman and out-of-domain language model score.
Axelrod, Amittai and He, Xiaodong and Gao, Jianfeng (2011):
Domain Adaptation via Pseudo In-Domain Data Selection, Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing
mentioned in Corpus Cleaning and Domain Adaptation@InProceedings{axelrod-he-gao:2011:EMNLP,
author = {Axelrod, Amittai and He, Xiaodong and Gao, Jianfeng},
title = {Domain Adaptation via Pseudo In-Domain Data Selection},
booktitle = {Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing},
month = {July},
address = {Edinburgh, Scotland, UK.},
publisher = {Association for Computational Linguistics},
pages = {355--362},
url = {
http://www.aclweb.org/anthology/D11-1033},
year = 2011
}
Axelrod et al. (2011);
Saab Mansour and Joern Wuebker and Hermann Ney (2011):
Combining translation and language model scoring for domain-specific data filtering, Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{iwslt11:Mansour,
author = {Saab Mansour and Joern Wuebker and Hermann Ney},
title = {Combining translation and language model scoring for domain-specific data filtering},
url = {
http://www.mt-archive.info/IWSLT-2011-Mansour.pdf},
pages = {222-229},
editor = {Marcello Federico and Mei-Yuh Hwang and Margit R{\"o}dder and Sebastian St{\"u}ker},
booktitle = {Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)},
location = {San Francisco, USA},
year = 2011
}
Mansour et al. (2011);
Amittai Axelrod and QingJun Li and William D. Lewis (2012):
Applications of data selection via cross-entropy difference for real-world statistical machine translation, Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{iwslt12:Axelrod,
author = {Amittai Axelrod and QingJun Li and William D. Lewis},
title = {Applications of data selection via cross-entropy difference for real-world statistical machine translation},
url = {
http://www.mt-archive.info/IWSLT-2012-Axelrod.pdf},
pages = {201-208},
booktitle = {Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)},
location = {Hong Kong},
year = 2012
}
Axelrod et al. (2012) apply this method for parallel corpora.
Duh, Kevin and Neubig, Graham and Sudoh, Katsuhito and Tsukada, Hajime (2013):
Adaptation Data Selection using Neural Language Models: Experiments in Machine Translation, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
@InProceedings{duh-EtAl:2013:Short,
author = {Duh, Kevin and Neubig, Graham and Sudoh, Katsuhito and Tsukada, Hajime},
title = {Adaptation Data Selection using Neural Language Models: Experiments in Machine Translation},
booktitle = {Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {678--683},
url = {
http://www.aclweb.org/anthology/P13-2119},
year = 2013
}
Duh et al. (2013) use a neural network language model in this approach.
Bicici, Ergun and Yuret, Deniz (2011):
Instance Selection for Machine Translation using Feature Decay Algorithms, Proceedings of the Sixth Workshop on Statistical Machine Translation
@InProceedings{bicici-yuret:2011:WMT1,
author = {Bicici, Ergun and Yuret, Deniz},
title = {Instance Selection for Machine Translation using Feature Decay Algorithms},
booktitle = {Proceedings of the Sixth Workshop on Statistical Machine Translation},
month = {July},
address = {Edinburgh, Scotland},
publisher = {Association for Computational Linguistics},
pages = {272--283},
url = {
http://www.aclweb.org/anthology/W11-2131},
year = 2011
}
Bicici and Yuret (2011) sub-sample sentence pairs whose source has most overlap with the test set. In a more graded approach, sentences may be weighted by how much match the target domain
Bach, Nguyen and Gao, Qin and Vogel, Stephan (2008):
Improving Word Alignment with Language Model Based Confidence Scores, Proceedings of the Third Workshop on Statistical Machine Translation
mentioned in Research Groups and Domain Adaptation@InProceedings{bach-gao-vogel:2008:WMT,
author = {Bach, Nguyen and Gao, Qin and Vogel, Stephan},
title = {Improving Word Alignment with Language Model Based Confidence Scores},
booktitle = {Proceedings of the Third Workshop on Statistical Machine Translation},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {151--154},
url = {
http://www.aclweb.org/anthology/W/W08/W08-0321},
year = 2008
}
(Bach et al., 2008).
Instead of discarding some of the training data, the mixture-model approach weights domain-specific models trained on data from different domains and combines them
Foster, George and Kuhn, Roland (2007):
Mixture-Model Adaptation for SMT, Proceedings of the Second Workshop on Statistical Machine Translation
@InProceedings{foster-kuhn:2007:WMT,
author = {Foster, George and Kuhn, Roland},
title = {Mixture-Model Adaptation for {SMT}},
booktitle = {Proceedings of the Second Workshop on Statistical Machine Translation},
month = {June},
address = {Prague, Czech Republic},
publisher = {Association for Computational Linguistics},
pages = {128--135},
url = {
http://www.aclweb.org/anthology/W/W07/W07-0217},
year = 2007
}
(Foster and Kuhn, 2007). Mixture models may also used to automatically cluster domains
Civera, Jorge and Juan, Alfons (2007):
Domain Adaptation in Statistical Machine Translation with Mixture Modelling, Proceedings of the Second Workshop on Statistical Machine Translation
mentioned in Research Groups and Domain Adaptation@InProceedings{civera-juan:2007:WMT,
author = {Civera, Jorge and Juan, Alfons},
title = {Domain Adaptation in Statistical Machine Translation with Mixture Modelling},
booktitle = {Proceedings of the Second Workshop on Statistical Machine Translation},
month = {June},
address = {Prague, Czech Republic},
publisher = {Association for Computational Linguistics},
pages = {177--180},
url = {
http://www.aclweb.org/anthology/W/W07/W07-0222},
year = 2007
}
(Civera and Juan, 2007). Such domain specific models may be combined as components in the standard log-linear model
Koehn, Philipp and Schroeder, Josh (2007):
Experiments in Domain Adaptation for Statistical Machine Translation, Proceedings of the Second Workshop on Statistical Machine Translation
mentioned in Research Groups and Domain Adaptation@InProceedings{koehn-schroeder:2007:WMT,
author = {Koehn, Philipp and Schroeder, Josh},
title = {Experiments in Domain Adaptation for Statistical Machine Translation},
booktitle = {Proceedings of the Second Workshop on Statistical Machine Translation},
month = {June},
address = {Prague, Czech Republic},
publisher = {Association for Computational Linguistics},
pages = {224--227},
url = {
http://www.aclweb.org/anthology/W/W07/W07-0233},
year = 2007
}
(Koehn and Schroeder, 2007) and may be build both for language models and translation models
(Schwenk and Koehn, 2008).
Rico Sennrich (2011):
Combining Multi-Engine Machine Translation and Online Learning through Dynamic Phrase Tables, Proceedings of the 15th International Conference of the European Association for Machine Translation (EAMT)
@inproceedings{eamt11:Sennrich,
author = {Rico Sennrich},
title = {Combining Multi-Engine Machine Translation and Online Learning through Dynamic Phrase Tables},
url = {
http://www.mt-archive.info/EAMT-2011-Sennrich.pdf},
googlescholar = {13092809721251956818},
pages = {89--96},
booktitle = {Proceedings of the 15th International Conference of the European Association for Machine Translation (EAMT)},
location = {Leuven, Belgium},
editor = {Mikel L. Forcada and Heidi Depraetere and Vincent Vandeghinste},
year = 2011
}
Sennrich (2011) builds the in-domain translation model by still basing the scoring on statistics over the full corpus. When dealing with multiple domain-specific translation models, input sentences need to be classified to the correct domain, which may done using a language model or information retrieval approach
Jia Xu and Yonggang Deng and Yuqing Gao and Hermann Ney (2007):
Domain Dependent Statistical Machine Translation, Proceedings of the MT Summit XI
@inproceedings{Xu:2007:MTSummit,
author = {Jia Xu and Yonggang Deng and Yuqing Gao and Hermann Ney},
title = {Domain Dependent Statistical Machine Translation},
url = {
http://mt-archive.info/MTS-2007-Xu.pdf},
googlescholar = {12645157097599900393},
booktitle = {Proceedings of the {MT} Summit XI},
year = 2007
}
(Xu et al., 2007). If the classifier returns proportions on how well the input falls into some of the classes, this may be used to dynamically weight the domain models
Finch, Andrew and Sumita, Eiichiro (2008):
Dynamic Model Interpolation for Statistical Machine Translation, Proceedings of the Third Workshop on Statistical Machine Translation
@InProceedings{finch-sumita:2008:WMT,
author = {Finch, Andrew and Sumita, Eiichiro},
title = {Dynamic Model Interpolation for Statistical Machine Translation},
booktitle = {Proceedings of the Third Workshop on Statistical Machine Translation},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {208--215},
url = {
http://www.aclweb.org/anthology/W/W08/W08-0334},
year = 2008
}
(Finch and Sumita, 2008).
Wu, Hua and Wang, Haifeng and Zong, Chengqing (2008):
Domain Adaptation for Statistical Machine Translation with Domain Dictionary and Monolingual Corpora, Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)
@InProceedings{wu-wang-zong:2008:PAPERS,
author = {Wu, Hua and Wang, Haifeng and Zong, Chengqing},
title = {Domain Adaptation for Statistical Machine Translation with Domain Dictionary and Monolingual Corpora},
booktitle = {Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)},
month = {August},
address = {Manchester, UK},
publisher = {Coling 2008 Organizing Committee},
pages = {993--1000},
url = {
http://www.aclweb.org/anthology/C08-1125},
year = 2008
}
Wu et al. (2008) present methods that exploit domain-specific monolingual corpora and dictionaries.
Holger Schwenk (2008):
Investigations on large-scale lightly-supervised training for statistical machine translation, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{schwenk2:iwslt:2008,
author = {Holger Schwenk},
title = {Investigations on large-scale lightly-supervised training for statistical machine translation},
url = {
http://www-lium.univ-lemans.fr/~schwenk/papers/Schwenk.iwslt2008.pdf},
googlescholar = {9087183346661484930},
pages = {182--189},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
year = 2008
}
Schwenk (2008);
Holger Schwenk and Jean Senellart (2009):
Translation Model Adaptation for an Arabic/French News Translation System by Lightly-Supervised Training, Proceedings of the Twelfth Machine Translation Summit (MT Summit XII)
@inproceedings{MTS09:Schwenk,
author = {Holger Schwenk and Jean Senellart},
title = {Translation Model Adaptation for an {Arabic/French} News Translation System by Lightly-Supervised Training},
booktitle = {Proceedings of the Twelfth Machine Translation Summit (MT Summit XII)},
publisher = {International Association for Machine Translation},
location = {Ottawa, Ontario, Canada},
year = 2009
}
Schwenk and Senellart (2009) propose to generate in-domain training data by translating in-domain monolingual data, filtering the created artificial parallel corpus, and adding it to the baseline parallel data.
Matthias Huck and David Vilar and Daniel Stein and Hermann Ney (2011):
Advancements in Arabic-to-English Hierarchical Machine Translation, Proceedings of the 15th International Conference of the European Association for Machine Translation (EAMT)
mentioned in Domain Adaptation and MT Without Word Alignment@inproceedings{eamt11:Huck,
author = {Matthias Huck and David Vilar and Daniel Stein and Hermann Ney},
title = {Advancements in {A}rabic-to-{E}nglish Hierarchical Machine Translation},
url = {
http://www-i6.informatik.rwth-aachen.de/publications/download/716/HuckMatthiasVilarDavidSteinDanielNeyHermann--AdvancementsinArabic-to-EnglishHierarchicalMachineTranslation--2011.pdf},
googlescholar = {72178570233796985},
pages = {273--289},
booktitle = {Proceedings of the 15th International Conference of the European Association for Machine Translation (EAMT)},
location = {Leuven, Belgium},
editor = {Mikel L. Forcada and Heidi Depraetere and Vincent Vandeghinste},
year = 2011
}
Huck et al. (2011) confirm the effectiveness of the method. Follow-up work indicates that creating the data in the inverse direction (from monolingual target side data) gives better results
Lambert, Patrik and Schwenk, Holger and Servan, Christophe and Abdul-Rauf, Sadaf (2011):
Investigations on Translation Model Adaptation Using Monolingual Data, Proceedings of the Sixth Workshop on Statistical Machine Translation
@InProceedings{lambert-EtAl:2011:WMT,
author = {Lambert, Patrik and Schwenk, Holger and Servan, Christophe and Abdul-Rauf, Sadaf},
title = {Investigations on Translation Model Adaptation Using Monolingual Data},
booktitle = {Proceedings of the Sixth Workshop on Statistical Machine Translation},
month = {July},
address = {Edinburgh, Scotland},
publisher = {Association for Computational Linguistics},
pages = {284--293},
url = {
http://www.aclweb.org/anthology/W11-2132},
year = 2011
}
(Lambert et al., 2011), especially for morphologically rich languages
Bojar, Ondřej and Tamchyna, Aleš (2011):
Improving Translation Model by Monolingual Data, Proceedings of the Sixth Workshop on Statistical Machine Translation
@InProceedings{bojar-tamchyna:2011:WMT,
author = {Bojar, Ondřej and Tamchyna, Ale\v{s}},
title = {Improving Translation Model by Monolingual Data},
booktitle = {Proceedings of the Sixth Workshop on Statistical Machine Translation},
month = {July},
address = {Edinburgh, Scotland},
publisher = {Association for Computational Linguistics},
pages = {330--336},
url = {
http://www.aclweb.org/anthology/W11-2138},
year = 2011
}
(Bojar and Tamchyna, 2011).
Related to domain adaptation is the problem of adapting machine translation systems to different regional language uses
Cheng, Pu-Jen and Lu, Wen-Hsiang and Teng, Jei-Wen and Chien, Lee-Feng (2004):
Creating Multilingual Translation Lexicons with Regional Variations Using Web Corpora, Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume
@inproceedings{Cheng:2004,
author = {Cheng, Pu-Jen and Lu, Wen-Hsiang and Teng, Jei-Wen and Chien, Lee-Feng},
title = {Creating Multilingual Translation Lexicons with Regional Variations Using Web Corpora},
booktitle = {Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume},
month = {July},
address = {Barcelona, Spain},
pages = {534--541},
year = 2004
}
(Cheng et al., 2004).
Alexandru Ceauşfu and John Tinsley and Jian Zhang and Andy Way (2011):
Experiments on Domain Adaptation for Patent Machine Translation in the PLuTO project, Proceedings of the 15th International Conference of the European Association for Machine Translation (EAMT)
@inproceedings{eamt11:Ceausfu,
author = {Alexandru Ceau\c{s}fu and John Tinsley and Jian Zhang and Andy Way},
title = {Experiments on Domain Adaptation for Patent Machine Translation in the {PLuTO} project},
pages = {21--28},
booktitle = {Proceedings of the 15th International Conference of the European Association for Machine Translation (EAMT)},
location = {Leuven, Belgium},
editor = {Mikel L. Forcada and Heidi Depraetere and Vincent Vandeghinste},
year = 2011
}
Ceauşfu et al. (2011) compare different domain adaptation methods to different subjects matters in patent translation and observe small gaines over the baseline.
Benchmarks
Discussion
Related Topics
New Publications
Wang, Longyue and Wong, Derek F. and Chao, Lidia S. and Xing, Junwen and Lu, Yi and Trancoso, Isabel (2013):
Edit Distance: A New Data Selection Criterion for Domain Adaptation in SMT, Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013
@InProceedings{wang-EtAl:2013:RANLP-2013,
author = {Wang, Longyue and Wong, Derek F. and Chao, Lidia S. and Xing, Junwen and Lu, Yi and Trancoso, Isabel},
title = {Edit Distance: A New Data Selection Criterion for Domain Adaptation in SMT},
booktitle = {Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013},
month = {September},
address = {Hissar, Bulgaria},
publisher = {INCOMA Ltd. Shoumen, BULGARIA},
pages = {727--732},
url = {
http://www.aclweb.org/anthology/R13-1094},
year = 2013
}
Wang et al. (2013)
Cuong Hoang and Khalil Sima'an (2014):
Latent Domain Phrase-based Models for Adaptation, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014, October 25-29, 2014, Doha, Qatar, A meeting of SIGDAT, a Special Interest Group of the ACL
@inproceedings{DBLP:conf/emnlp/HoangS14,
author = {Cuong Hoang and Khalil Sima'an},
title = {Latent Domain Phrase-based Models for Adaptation},
booktitle = {Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, {EMNLP} 2014, October 25-29, 2014, Doha, Qatar, {A} meeting of SIGDAT, a Special Interest Group of the {ACL}},
pages = {566--576},
crossref = {DBLP:conf/emnlp/2014},
url = {
http://aclweb.org/anthology/D/D14/D14-1062.pdf},
timestamp = {Sat, 15 Nov 2014 14:45:18 +0100},
biburl = {
http://dblp.uni-trier.de/rec/bib/conf/emnlp/HoangS14},
bibsource = {dblp computer science bibliography,
http://dblp.org},
year = 2014
}
Hoang and Sima'an (2014)
Rosa, Rudolf and Sudarikov, Roman and Novák, Michal and Popel, Martin and Bojar, Ondřej (2016):
Dictionary-based Domain Adaptation of MT Systems without Retraining, Proceedings of the First Conference on Machine Translation
@InProceedings{rosa-EtAl:2016:WMT,
author = {Rosa, Rudolf and Sudarikov, Roman and Nov\'{a}k, Michal and Popel, Martin and Bojar, Ond\v{r}ej},
title = {Dictionary-based Domain Adaptation of {MT} Systems without Retraining},
booktitle = {Proceedings of the First Conference on Machine Translation},
month = {August},
address = {Berlin, Germany},
publisher = {Association for Computational Linguistics},
pages = {449--455},
url = {
http://www.aclweb.org/anthology/W/W16/W16-2334},
year = 2016
}
Rosa et al. (2016)
Nadir Durrani and Hassan Sajjad and Shafiq Joty and Ahmed Abdelali and Stephan Vogel (2015):
Using joint models or domain adaptation in statistical machine translation, Machine Translation Summit XV
@inproceedings{MTS2015-Durrani,
author = {Nadir Durrani and Hassan Sajjad and Shafiq Joty and Ahmed Abdelali and Stephan Vogel},
title = {Using joint models or domain adaptation in statistical machine translation},
url = {
http://www.mt-archive.info/15/MTS-2015-Durrani.pdf},
pages = {117--130},
booktitle = {Machine Translation Summit XV},
year = 2015
}
Durrani et al. (2015)
Gregor Thurmair (2015):
Evaluation of the domain adaptation of MT systems in ACCURAT, Proceedings of the 18th Annual Conference of the European Association for Machine Translation
@InProceedings{W15-4928,
author = {Gregor Thurmair},
title = {Evaluation of the domain adaptation of {MT} systems in ACCURAT},
booktitle = {Proceedings of the 18th Annual Conference of the European Association for Machine Translation},
month = {May},
address = {Antalya, Turkey},
url = {
http://aclweb.org/anthology/W15-4928},
editor = {\^IIknur Durgar Elâ""Kahlout and Mehmed \"Ozkan and Felipe S\'anchezâ""Mart\'inez and Gema Ram\'irezâ""S\'anchez and Fred Hollowood and Andy Way},
pages = {206--209},
year = 2015
}
Thurmair (2015)
Yo Ehara and Masao Utiyama and Eiichiro Sumita (2015):
Risk-aware distribution of SMT outputs for translation of documents targeting many anonymous readers, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{IWSLT-2015-Ehara,
author = {Yo Ehara and Masao Utiyama and Eiichiro Sumita},
title = {Risk-aware distribution of {SMT} outputs for translation of documents targeting many anonymous readers},
pages = {240-247},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
location = {Da Nang, Vietnam},
url = {
http://www.mt-archive.info/15/IWSLT-2015-ehara.pdf},
month = {December},
year = 2015
}
Ehara et al. (2015)
Ergun Biçici (2015):
Domain Adaptation for Machine Translation with Instance Selection, The Prague Bulletin of Mathematical Linguistics
@article{pbml-103-Bicici,
author = {Ergun Biçici},
title = {Domain Adaptation for Machine Translation with Instance Selection},
pages = {5--21},
journal = {The Prague Bulletin of Mathematical Linguistics},
url = {
http://ufal.mff.cuni.cz/pbml/103/art-bicici.pdf},
volume = {103},
month = {April},
year = 2015
}
Biçici (2015)
Pratyush Banerjee and Raphael Rubino and Johann Roturier and Josef van Genabith (2015):
Quality estimation-guided supplementary data selection for domain adaptation of statistical machine translation, Machine Translation
@article{MTJ:2015:Banerjee,
author = {Pratyush Banerjee and Raphael Rubino and Johann Roturier and Josef van Genabith},
title = {Quality estimation-guided supplementary data selection for domain adaptation of statistical machine translation},
pages = {77--100},
journal = {Machine Translation},
volume = {29},
number = {2},
month = {June},
year = 2015
}
Banerjee et al. (2015)
Sauleh Eetemadi and William Lewis and Kristina Toutanova and Hayder Radha (2015):
Survey of data-selection methods in statistical machine translation, Machine Translation
@article{MTJ:2015:Eetemadi,
author = {Sauleh Eetemadi and William Lewis and Kristina Toutanova and Hayder Radha},
title = {Survey of data-selection methods in statistical machine translation},
pages = {189--223},
journal = {Machine Translation},
volume = {29},
number = {3--4},
month = {December},
year = 2015
}
Eetemadi et al. (2015)
Amittai Axelrod and Yogarshi Vyas and Marianna Martindale and Marine Carpuat (2015):
Class-based N-gram language difference models for data selection, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{IWSLT-2015-Axelrod-2,
author = {Amittai Axelrod and Yogarshi Vyas and Marianna Martindale and Marine Carpuat},
title = {Class-based N-gram language difference models for data selection},
pages = {180-187},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
location = {Da Nang, Vietnam},
url = {
http://www.mt-archive.info/15/IWSLT-2015-axelrod-2.pdf},
month = {December},
year = 2015
}
Axelrod et al. (2015)
William Lewis and Christian Federmann and Ying Xin (2015):
Applying cross-entropy difference for selecting parallel training data from publicly available sources for conversational machine translation, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{IWSLT-2015-Lewis,
author = {William Lewis and Christian Federmann and Ying Xin},
title = {Applying cross-entropy difference for selecting parallel training data from publicly available sources for conversational machine translation},
pages = {126-134},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
location = {Da Nang, Vietnam},
url = {
http://www.mt-archive.info/15/IWSLT-2015-lewis.pdf},
month = {December},
year = 2015
}
Lewis et al. (2015)
Rabinovich, Ella and Patel, Raj Nath and Mirkin, Shachar and Specia, Lucia and Wintner, Shuly (2017):
Personalized Machine Translation: Preserving Original Author Traits, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers
@InProceedings{rabinovich-EtAl:2017:EACLlong,
author = {Rabinovich, Ella and Patel, Raj Nath and Mirkin, Shachar and Specia, Lucia and Wintner, Shuly},
title = {Personalized Machine Translation: Preserving Original Author Traits},
booktitle = {Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers},
month = {April},
address = {Valencia, Spain},
publisher = {Association for Computational Linguistics},
pages = {1074--1084},
url = {
http://www.aclweb.org/anthology/E17-1101},
year = 2017
}
Rabinovich et al. (2017)
Cuong, Hoang and Sima'an, Khalil and Titov, Ivan (2016):
Adapting to All Domains at Once: Rewarding Domain Invariance in SMT, Transactions of the Association for Computational Linguistics
@article{TACL768,
author = {Cuong, Hoang and Sima'an, Khalil and Titov, Ivan },
title = {Adapting to All Domains at Once: Rewarding Domain Invariance in SMT},
journal = {Transactions of the Association for Computational Linguistics},
volume = {4},
issn = {2307-387X},
url = {
https://tacl2013.cs.columbia.edu/ojs/index.php/tacl/article/view/768},
pages = {99--112},
year = 2016
}
Cuong et al. (2016)
Ture, Ferhan and Boschee, Elizabeth (2016):
Learning to Translate for Multilingual Question Answering, Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
@InProceedings{ture-boschee:2016:EMNLP2016,
author = {Ture, Ferhan and Boschee, Elizabeth},
title = {Learning to Translate for Multilingual Question Answering},
booktitle = {Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing},
month = {November},
address = {Austin, Texas},
publisher = {Association for Computational Linguistics},
pages = {573--584},
url = {
https://aclweb.org/anthology/D16-1055},
year = 2016
}
Ture and Boschee (2016)
Durrani, Nadir and Sajjad, Hassan and Joty, Shafiq and Abdelali, Ahmed (2016):
A Deep Fusion Model for Domain Adaptation in Phrase-based MT, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
@InProceedings{durrani-EtAl:2016:COLING,
author = {Durrani, Nadir and Sajjad, Hassan and Joty, Shafiq and Abdelali, Ahmed},
title = {A Deep Fusion Model for Domain Adaptation in Phrase-based MT},
booktitle = {Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers},
month = {December},
address = {Osaka, Japan},
publisher = {The COLING 2016 Organizing Committee},
pages = {3177--3187},
url = {
http://aclweb.org/anthology/C16-1299},
year = 2016
}
Durrani et al. (2016)
Wang, Rui and Zhao, Hai and Lu, Bao-Liang and Utiyama, Masao and Sumita, Eiichiro (2016):
Connecting Phrase based Statistical Machine Translation Adaptation, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
@InProceedings{wang-EtAl:2016:COLING5,
author = {Wang, Rui and Zhao, Hai and Lu, Bao-Liang and Utiyama, Masao and Sumita, Eiichiro},
title = {Connecting Phrase based Statistical Machine Translation Adaptation},
booktitle = {Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers},
month = {December},
address = {Osaka, Japan},
publisher = {The COLING 2016 Organizing Committee},
pages = {3135--3145},
url = {
http://aclweb.org/anthology/C16-1295},
year = 2016
}
Wang et al. (2016)
Servan, Christophe and Dymetman, Marc (2015):
Adaptation par enrichissement terminologique en traduction automatique statistique fondée sur la génération et le filtrage de bi-segments virtuels, 22ème Conférence sur le Traitement Automatique des Langues Naturelles
mentioned in Domain Adaptation, Context Features and Unknown Words@inproceedings{servan:hal-01157850,
author = {Servan, Christophe and Dymetman, Marc},
title = {{Adaptation par enrichissement terminologique en traduction automatique statistique fond{\'e}e sur la g{\'e}n{\'e}ration et le filtrage de bi-segments virtuels}},
url = {
https://hal.archives-ouvertes.fr/hal-01157850},
booktitle = {{22{\`e}me Conf{\'e}rence sur le Traitement Automatique des Langues Naturelles}},
address = {Caen, France},
organization = {{ATALA}},
month = {Jun},
keywords = {Traduction Automatique Statistique ; G{\'e}n{\'e}ration Automatique de Texte ; contexte phrastique ; terminologie bilingue},
pdf = {
https://hal.archives-ouvertes.fr/hal-01157850/file/terminology-final-version.pdf},
hal\_id = {hal-01157850},
hal\_version = {v1},
year = 2015
}
Servan and Dymetman (2015)
Cuong, Hoang and Sima'an, Khalil (2014):
Latent Domain Translation Models in Mix-of-Domains Haystack, Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers
@InProceedings{cuong-simaan:2014:Coling,
author = {Cuong, Hoang and Sima'an, Khalil},
title = {Latent Domain Translation Models in Mix-of-Domains Haystack},
booktitle = {Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers},
month = {August},
address = {Dublin, Ireland},
publisher = {Dublin City University and Association for Computational Linguistics},
pages = {1928--1939},
url = {
http://www.aclweb.org/anthology/C14-1182},
year = 2014
}
Cuong and Sima'an (2014)
Cuong, Hoang and Sima'an, Khalil (2014):
Latent Domain Phrase-based Models for Adaptation, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)
mentioned in Phrase Based Models and Domain Adaptation@InProceedings{cuong-simaan:2014:EMNLP2014,
author = {Cuong, Hoang and Sima'an, Khalil},
title = {Latent Domain Phrase-based Models for Adaptation},
booktitle = {Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
month = {October},
address = {Doha, Qatar},
publisher = {Association for Computational Linguistics},
pages = {566--576},
url = {
http://www.aclweb.org/anthology/D14-1062},
year = 2014
}
Cuong and Sima'an (2014)
Cuong, Hoang and Sima'an, Khalil (2014):
Latent Domain Translation Models in Mix-of-Domains Haystack, Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers
@inproceedings{HoangCuongCOLING2014,
author = {Cuong, Hoang and Sima'an, Khalil},
title = {Latent Domain Translation Models in Mix-of-Domains Haystack},
booktitle = {Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers},
month = {August},
address = {Dublin, Ireland},
publisher = {Dublin City University and Association for Computational Linguistics},
pages = {1928--1939},
url = {
http://www.aclweb.org/anthology/C14-1182},
year = 2014
}
Cuong and Sima'an (2014)
Cuong, Hoang and Sima'an, Khalil (2015):
Latent domain word alignment for heterogeneous corpora, Meeting of the North American Chapter of the Association for Computational Linguistics (NAACL), Denver, USA, May
mentioned in Domain Adaptation and Word Alignment@inproceedings{cuong2015latent,
author = {Cuong, Hoang and Sima'an, Khalil},
title = {Latent domain word alignment for heterogeneous corpora},
booktitle = {Meeting of the North American Chapter of the Association for Computational Linguistics (NAACL), Denver, USA, May},
year = 2015
}
Cuong and Sima'an (2015)
Mirkin, Shachar and Meunier, Jean-Luc (2015):
Personalized Machine Translation: Predicting Translational Preferences, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing
@InProceedings{mirkin-meunier:2015:EMNLP,
author = {Mirkin, Shachar and Meunier, Jean-Luc},
title = {Personalized Machine Translation: Predicting Translational Preferences},
booktitle = {Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing},
month = {September},
address = {Lisbon, Portugal},
publisher = {Association for Computational Linguistics},
pages = {2019--2025},
url = {
http://aclweb.org/anthology/D15-1238},
year = 2015
}
Mirkin and Meunier (2015)
van der Wees, Marlies and Bisazza, Arianna and Weerkamp, Wouter and Monz, Christof (2015):
Whatâ""s in a Domain? Analyzing Genre and Topic Differences in Statistical Machine Translation, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)
@InProceedings{vanderwees-EtAl:2015:ACL-IJCNLP,
author = {van der Wees, Marlies and Bisazza, Arianna and Weerkamp, Wouter and Monz, Christof},
title = {Whatâ""s in a Domain? Analyzing Genre and Topic Differences in Statistical Machine Translation},
booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)},
month = {July},
address = {Beijing, China},
publisher = {Association for Computational Linguistics},
pages = {560--566},
url = {
http://www.aclweb.org/anthology/P15-2092},
year = 2015
}
Wees et al. (2015)
Haddow, Barry (2013):
Applying Pairwise Ranked Optimisation to Improve the Interpolation of Translation Models, Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
@InProceedings{haddow:2013:NAACL-HLT,
author = {Haddow, Barry},
title = {Applying Pairwise Ranked Optimisation to Improve the Interpolation of Translation Models},
booktitle = {Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
month = {June},
address = {Atlanta, Georgia},
publisher = {Association for Computational Linguistics},
pages = {342--347},
url = {
http://www.aclweb.org/anthology/N13-1035},
year = 2013
}
Haddow (2013)
Hsieh, An-Chang and Huang, Hen-Hsen and Chen, Hsin-Hsi (2013):
Uses of Monolingual In-Domain Corpora for Cross-Domain Adaptation with Hybrid MT Approaches, Proceedings of the Second Workshop on Hybrid Approaches to Translation
@InProceedings{hsieh-huang-chen:2013:HyTra,
author = {Hsieh, An-Chang and Huang, Hen-Hsen and Chen, Hsin-Hsi},
title = {Uses of Monolingual In-Domain Corpora for Cross-Domain Adaptation with Hybrid {MT} Approaches},
booktitle = {Proceedings of the Second Workshop on Hybrid Approaches to Translation},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {117--122},
url = {
http://www.aclweb.org/anthology/W13-2817},
year = 2013
}
Hsieh et al. (2013)
Toral, Antonio (2013):
Hybrid Selection of Language Model Training Data Using Linguistic Information and Perplexity, Proceedings of the Second Workshop on Hybrid Approaches to Translation
@InProceedings{toral:2013:HyTra,
author = {Toral, Antonio},
title = {Hybrid Selection of Language Model Training Data Using Linguistic Information and Perplexity},
booktitle = {Proceedings of the Second Workshop on Hybrid Approaches to Translation},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {8--12},
url = {
http://www.aclweb.org/anthology/W13-2803},
year = 2013
}
Toral (2013)
Bicici, Ergun (2013):
Feature Decay Algorithms for Fast Deployment of Accurate Statistical Machine Translation Systems, Proceedings of the Eighth Workshop on Statistical Machine Translation
@InProceedings{bicici:2013:WMT1,
author = {Bicici, Ergun},
title = {Feature Decay Algorithms for Fast Deployment of Accurate Statistical Machine Translation Systems},
booktitle = {Proceedings of the Eighth Workshop on Statistical Machine Translation},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {78--84},
url = {
http://www.aclweb.org/anthology/W13-2206},
year = 2013
}
Bicici (2013)
Marion Weller and Alexander Fraser and Ulrich Heid (2014):
Combining bilingual terminology mining and morphological modeling for domain adaptation in SMT, Proceedings of 17th Annual conference of the European Association for Machine Translation
@inproceedings{eamt-2014-Weller,
author = {Marion Weller and Alexander Fraser and Ulrich Heid},
title = {Combining bilingual terminology mining and morphological modeling for domain adaptation in SMT},
booktitle = {Proceedings of 17th Annual conference of the European Association for Machine Translation},
pages = {11-18},
url = {
http://www.mt-archive.info/10/EAMT-2014-Weller.pdf},
location = {Dubrovnik, Croatia},
year = 2014
}
Weller et al. (2014)
Nicola Bertoldi and Patrick Simianer and Mauro Cettolo and Katharina Wäschle and Marcello Federico and Stefan Riezler (2014):
Online adaptation to post-edits for phrase-based statistical machine translation, Machine Translation
@article{MTJ:2014:Bertoldi,
author = {Nicola Bertoldi and Patrick Simianer and Mauro Cettolo and Katharina Wäschle and Marcello Federico and Stefan Riezler},
title = {Online adaptation to post-edits for phrase-based statistical machine translation},
pages = {309-339},
journal = {Machine Translation},
volume = {28},
number = {3-4},
month = {December},
year = 2014
}
Bertoldi et al. (2014)
Mauro Cettolo and Nicola Bertoldi and Marcello Federico and Holger Schwenk and Loic Barrault and Chrstophe Servan (2014):
Translation project adaptation for MT-enhanced computer assisted translation, Machine Translation
@article{MTJ:2014:Cettolo,
author = {Mauro Cettolo and Nicola Bertoldi and Marcello Federico and Holger Schwenk and Loic Barrault and Chrstophe Servan},
title = {Translation project adaptation for MT-enhanced computer assisted translation},
pages = {127-150},
journal = {Machine Translation},
volume = {28},
number = {2},
month = {October},
year = 2014
}
Cettolo et al. (2014)
Hoang, Cuong and Sima'an, Khalil (2014):
Latent Domain Translation Models in Mix-of-Domains Haystack, Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers
@InProceedings{hoang-simaan:2014:Coling,
author = {Hoang, Cuong and Sima'an, Khalil},
title = {Latent Domain Translation Models in Mix-of-Domains Haystack},
booktitle = {Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers},
month = {August},
address = {Dublin, Ireland},
publisher = {Dublin City University and Association for Computational Linguistics},
pages = {1928--1939},
url = {
http://www.aclweb.org/anthology/C14-1182},
year = 2014
}
Hoang and Sima'an (2014)
Mathur, Prashant and Venkatapathy, Sriram and Cancedda, Nicola (2014):
Fast Domain Adaptation of SMT models without in-Domain Parallel Data, Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers
@InProceedings{mathur-venkatapathy-cancedda:2014:Coling,
author = {Mathur, Prashant and Venkatapathy, Sriram and Cancedda, Nicola},
title = {Fast Domain Adaptation of {SMT} models without in-Domain Parallel Data},
booktitle = {Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers},
month = {August},
address = {Dublin, Ireland},
publisher = {Dublin City University and Association for Computational Linguistics},
pages = {1114--1123},
url = {
http://www.aclweb.org/anthology/C14-1105},
year = 2014
}
Mathur et al. (2014)
Lu, Yi and Wang, Longyue and Wong, Derek F. and Chao, Lidia S. and Wang, Yiming (2014):
Domain Adaptation for Medical Text Translation using Web Resources, Proceedings of the Ninth Workshop on Statistical Machine Translation
@InProceedings{lu-EtAl:2014:W14-33,
author = {Lu, Yi and Wang, Longyue and Wong, Derek F. and Chao, Lidia S. and Wang, Yiming},
title = {Domain Adaptation for Medical Text Translation using Web Resources},
booktitle = {Proceedings of the Ninth Workshop on Statistical Machine Translation},
month = {June},
address = {Baltimore, Maryland, USA},
publisher = {Association for Computational Linguistics},
pages = {233--238},
url = {
http://www.aclweb.org/anthology/W14-3328},
year = 2014
}
Lu et al. (2014)
Mansour, Saab and Ney, Hermann (2014):
Unsupervised Adaptation for Statistical Machine Translation, Proceedings of the Ninth Workshop on Statistical Machine Translation
@InProceedings{mansour-ney:2014:W14-33,
author = {Mansour, Saab and Ney, Hermann},
title = {Unsupervised Adaptation for Statistical Machine Translation},
booktitle = {Proceedings of the Ninth Workshop on Statistical Machine Translation},
month = {June},
address = {Baltimore, Maryland, USA},
publisher = {Association for Computational Linguistics},
pages = {457--465},
url = {
http://www.aclweb.org/anthology/W14-3359},
year = 2014
}
Mansour and Ney (2014)
Hasler, Eva and Haddow, Barry and Koehn, Philipp (2014):
Dynamic Topic Adaptation for SMT using Distributional Profiles, Proceedings of the Ninth Workshop on Statistical Machine Translation
@InProceedings{hasler-haddow-koehn:2014:W14-33,
author = {Hasler, Eva and Haddow, Barry and Koehn, Philipp},
title = {Dynamic Topic Adaptation for {SMT} using Distributional Profiles},
booktitle = {Proceedings of the Ninth Workshop on Statistical Machine Translation},
month = {June},
address = {Baltimore, Maryland, USA},
publisher = {Association for Computational Linguistics},
pages = {445--456},
url = {
http://www.aclweb.org/anthology/W14-3358},
year = 2014
}
Hasler et al. (2014)
Bicici, Ergun and Liu, Qun and Way, Andy (2014):
Parallel FDA5 for Fast Deployment of Accurate Statistical Machine Translation Systems, Proceedings of the Ninth Workshop on Statistical Machine Translation
@InProceedings{bicici-liu-way:2014:W14-33,
author = {Bicici, Ergun and Liu, Qun and Way, Andy},
title = {Parallel FDA5 for Fast Deployment of Accurate Statistical Machine Translation Systems},
booktitle = {Proceedings of the Ninth Workshop on Statistical Machine Translation},
month = {June},
address = {Baltimore, Maryland, USA},
publisher = {Association for Computational Linguistics},
pages = {59--65},
url = {
http://www.aclweb.org/anthology/W14-3303},
year = 2014
}
Bicici et al. (2014)
Carpuat, Marine and Goutte, Cyril and Foster, George (2014):
Linear Mixture Models for Robust Machine Translation, Proceedings of the Ninth Workshop on Statistical Machine Translation
@InProceedings{carpuat-goutte-foster:2014:W14-33,
author = {Carpuat, Marine and Goutte, Cyril and Foster, George},
title = {Linear Mixture Models for Robust Machine Translation},
booktitle = {Proceedings of the Ninth Workshop on Statistical Machine Translation},
month = {June},
address = {Baltimore, Maryland, USA},
publisher = {Association for Computational Linguistics},
pages = {499--509},
url = {
http://www.aclweb.org/anthology/W14-3363},
year = 2014
}
Carpuat et al. (2014)
- Mediani et al. (2014)
Wang, Longyue and Lu, Yi and Wong, Derek F. and Chao, Lidia S. and Wang, Yiming and Oliveira, Francisco (2014):
Combining Domain Adaptation Approaches for Medical Text Translation, Proceedings of the Ninth Workshop on Statistical Machine Translation
@InProceedings{wang-EtAl:2014:W14-33,
author = {Wang, Longyue and Lu, Yi and Wong, Derek F. and Chao, Lidia S. and Wang, Yiming and Oliveira, Francisco},
title = {Combining Domain Adaptation Approaches for Medical Text Translation},
booktitle = {Proceedings of the Ninth Workshop on Statistical Machine Translation},
month = {June},
address = {Baltimore, Maryland, USA},
publisher = {Association for Computational Linguistics},
pages = {254--259},
url = {
http://www.aclweb.org/anthology/W14-3331},
year = 2014
}
Wang et al. (2014)
Cui, Lei and Chen, Xilun and Zhang, Dongdong and Liu, Shujie and Li, Mu and Zhou, Ming (2013):
Multi-Domain Adaptation for SMT Using Multi-Task Learning, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
@InProceedings{cui-EtAl:2013:EMNLP,
author = {Cui, Lei and Chen, Xilun and Zhang, Dongdong and Liu, Shujie and Li, Mu and Zhou, Ming},
title = {Multi-Domain Adaptation for {SMT} Using Multi-Task Learning},
booktitle = {Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing},
month = {October},
address = {Seattle, Washington, USA},
publisher = {Association for Computational Linguistics},
pages = {1055--1065},
url = {
http://www.aclweb.org/anthology/D13-1107},
year = 2013
}
Cui et al. (2013)
Hasler, Eva and Blunsom, Phil and Koehn, Philipp and Haddow, Barry (2014):
Dynamic Topic Adaptation for Phrase-based MT, Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics
@InProceedings{hasler-EtAl:2014:EACL,
author = {Hasler, Eva and Blunsom, Phil and Koehn, Philipp and Haddow, Barry},
title = {Dynamic Topic Adaptation for Phrase-based MT},
booktitle = {Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics},
month = {April},
address = {Gothenburg, Sweden},
publisher = {Association for Computational Linguistics},
pages = {328--337},
url = {
http://www.aclweb.org/anthology/E14-1035},
year = 2014
}
Hasler et al. (2014)
Louis, Annie and Webber, Bonnie (2014):
Structured and Unstructured Cache Models for SMT Domain Adaptation, Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics
@InProceedings{louis-webber:2014:EACL,
author = {Louis, Annie and Webber, Bonnie},
title = {Structured and Unstructured Cache Models for {SMT} Domain Adaptation},
booktitle = {Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics},
month = {April},
address = {Gothenburg, Sweden},
publisher = {Association for Computational Linguistics},
pages = {155--163},
url = {
http://www.aclweb.org/anthology/E14-1017},
year = 2014
}
Louis and Webber (2014)
Hoang, Cuong and Sima'an, Khalil (2014):
Latent Domain Phrase-based Models for Adaptation, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)
@InProceedings{hoang-simaan:2014:EMNLP2014,
author = {Hoang, Cuong and Sima'an, Khalil},
title = {Latent Domain Phrase-based Models for Adaptation},
booktitle = {Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
month = {October},
address = {Doha, Qatar},
publisher = {Association for Computational Linguistics},
pages = {566--576},
url = {
http://www.aclweb.org/anthology/D14-1062},
year = 2014
}
Hoang and Sima'an (2014)
Kirchhoff, Katrin and Bilmes, Jeff (2014):
Submodularity for Data Selection in Machine Translation, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)
@InProceedings{kirchhoff-bilmes:2014:EMNLP2014,
author = {Kirchhoff, Katrin and Bilmes, Jeff},
title = {Submodularity for Data Selection in Machine Translation},
booktitle = {Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
month = {October},
address = {Doha, Qatar},
publisher = {Association for Computational Linguistics},
pages = {131--141},
url = {
http://www.aclweb.org/anthology/D14-1014},
year = 2014
}
Kirchhoff and Bilmes (2014)
Carpuat, Marine and Daume III, Hal and Henry, Katharine and Irvine, Ann and Jagarlamudi, Jagadeesh and Rudinger, Rachel (2013):
SenseSpotting: Never let your parallel data tie you to an old domain, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{carpuat-EtAl:2013:ACL2013,
author = {Carpuat, Marine and Daume III, Hal and Henry, Katharine and Irvine, Ann and Jagarlamudi, Jagadeesh and Rudinger, Rachel},
title = {SenseSpotting: Never let your parallel data tie you to an old domain},
booktitle = {Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {1435--1445},
url = {
http://www.aclweb.org/anthology/P13-1141},
year = 2013
}
Carpuat et al. (2013)
Sennrich, Rico and Schwenk, Holger and Aransa, Walid (2013):
A Multi-Domain Translation Model Framework for Statistical Machine Translation, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{sennrich-schwenk-aransa:2013:ACL2013,
author = {Sennrich, Rico and Schwenk, Holger and Aransa, Walid},
title = {A Multi-Domain Translation Model Framework for Statistical Machine Translation},
booktitle = {Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {832--840},
url = {
http://www.aclweb.org/anthology/P13-1082},
year = 2013
}
Sennrich et al. (2013)
Zhu, Conghui and Watanabe, Taro and Sumita, Eiichiro and Zhao, Tiejun (2013):
Hierarchical Phrase Table Combination for Machine Translation, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{zhu-EtAl:2013:ACL20132,
author = {Zhu, Conghui and Watanabe, Taro and Sumita, Eiichiro and Zhao, Tiejun},
title = {Hierarchical Phrase Table Combination for Machine Translation},
booktitle = {Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {802--810},
url = {
http://www.aclweb.org/anthology/P13-1079},
year = 2013
}
Zhu et al. (2013)
Zhang, Jiajun and Zong, Chengqing (2013):
Learning a Phrase-based Translation Model from Monolingual Data with Application to Domain Adaptation, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{zhang-zong:2013:ACL2013,
author = {Zhang, Jiajun and Zong, Chengqing},
title = {Learning a Phrase-based Translation Model from Monolingual Data with Application to Domain Adaptation},
booktitle = {Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {1425--1434},
url = {
http://www.aclweb.org/anthology/P13-1140},
year = 2013
}
Zhang and Zong (2013)
Saab Mansour and Yaser Al-Onaizan and Graeme Blackwood and Christoph Tillmann (2014):
Automatic dialect classification for statistical machine translation, Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{AMTA-2014-Mansour,
author = {Saab Mansour and Yaser Al-Onaizan and Graeme Blackwood and Christoph Tillmann},
title = {Automatic dialect classification for statistical machine translation},
pages = {342-355},
url = {
http://www.mt-archive.info/10/AMTA-2014-Mansour.pdf},
volume = {1},
booktitle = {Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {Vancouver, BC, Canada},
year = 2014
}
Mansour et al. (2014)
Liu, Le and Hong, Yu and Liu, Hao and Wang, Xing and Yao, Jianmin (2014):
Effective Selection of Translation Model Training Data, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
@InProceedings{liu-EtAl:2014:P14-22,
author = {Liu, Le and Hong, Yu and Liu, Hao and Wang, Xing and Yao, Jianmin},
title = {Effective Selection of Translation Model Training Data},
booktitle = {Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {June},
address = {Baltimore, Maryland},
publisher = {Association for Computational Linguistics},
pages = {569--573},
url = {
http://www.aclweb.org/anthology/P14-2093},
year = 2014
}
Liu et al. (2014)
Jeblee, Serena and Feely, Weston and Bouamor, Houda and Lavie, Alon and Habash, Nizar and Oflazer, Kemal (2014):
Domain and Dialect Adaptation for Machine Translation into Egyptian Arabic, Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP)
@InProceedings{jeblee-EtAl:2014:ANLP20142,
author = {Jeblee, Serena and Feely, Weston and Bouamor, Houda and Lavie, Alon and Habash, Nizar and Oflazer, Kemal},
title = {Domain and Dialect Adaptation for Machine Translation into Egyptian Arabic},
booktitle = {Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP)},
month = {October},
address = {Doha, Qatar},
publisher = {Association for Computational Linguistics},
pages = {196--206},
url = {
http://www.aclweb.org/anthology/W14-3627},
year = 2014
}
Jeblee et al. (2014)
Mauro Cettolo and Nicola Bertoldi and Marcello Federico (2014):
The repetition rate of text as a predictor of the effectiveness of machine translation adaptation, Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{AMTA-2014-Cettolo,
author = {Mauro Cettolo and Nicola Bertoldi and Marcello Federico},
title = {The repetition rate of text as a predictor of the effectiveness of machine translation adaptation},
pages = {166-179},
url = {
http://www.mt-archive.info/10/AMTA-2014-Cettolo.pdf},
volume = {1},
booktitle = {Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {Vancouver, BC, Canada},
year = 2014
}
Cettolo et al. (2014)
Boxing Chen and Roland Kuhn and George Foster (2014):
A comparison of mixture and vector space techniques for translation model adaptation, Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{AMTA-2014-Chen,
author = {Boxing Chen and Roland Kuhn and George Foster},
title = {A comparison of mixture and vector space techniques for translation model adaptation},
pages = {124-138},
url = {
http://www.mt-archive.info/10/AMTA-2014-Chen.pdf},
volume = {1},
booktitle = {Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {Vancouver, BC, Canada},
year = 2014
}
Chen et al. (2014)
Eva Hasler and Barry Haddow and Philipp Koehn (2014):
Combining domain and topic adaptation for SMT, Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{AMTA-2014-Hasler,
author = {Eva Hasler and Barry Haddow and Philipp Koehn},
title = {Combining domain and topic adaptation for SMT},
pages = {139-151},
url = {
http://www.mt-archive.info/10/AMTA-2014-Hasker.pdf},
volume = {1},
booktitle = {Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {Vancouver, BC, Canada},
year = 2014
}
Hasler et al. (2014)
Shachar Mirkin and Laurent Besacier (2014):
Data selection for compact adapted SMT models, Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{AMTA-2014-Mirkin,
author = {Shachar Mirkin and Laurent Besacier},
title = {Data selection for compact adapted {SMT} models},
pages = {301-314},
url = {
http://www.mt-archive.info/10/AMTA-2014-Mirkin.pdf},
volume = {1},
booktitle = {Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {Vancouver, BC, Canada},
year = 2014
}
Mirkin and Besacier (2014)
Katsuhito Sudoh and Masaaki Nagata and Shinsuke Mori and Tatsuya Kawahara (2014):
Japanese-to-English patent translation system based on domain-adapted word segmentation and post-ordering, Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{AMTA-2014-Sudoh,
author = {Katsuhito Sudoh and Masaaki Nagata and Shinsuke Mori and Tatsuya Kawahara},
title = {Japanese-to-English patent translation system based on domain-adapted word segmentation and post-ordering},
pages = {234-248},
url = {
http://www.mt-archive.info/10/AMTA-2014-Sudoh.pdf},
volume = {1},
booktitle = {Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {Vancouver, BC, Canada},
year = 2014
}
Sudoh et al. (2014)
Joern Wuebker and Hermann Ney and Adrià Martínez-Villaronga and Adrià Giménez and Alfons Juan and Christophe Servan and Marc Dymetman and Shachar Mirkin (2014):
Comparison of data selection techniques for the translation of video lectures, Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{AMTA-2014-Wuebker,
author = {Joern Wuebker and Hermann Ney and Adrià Martínez-Villaronga and Adrià Giménez and Alfons Juan and Christophe Servan and Marc Dymetman and Shachar Mirkin},
title = {Comparison of data selection techniques for the translation of video lectures},
pages = {193-207},
url = {
http://www.mt-archive.info/10/AMTA-2014-Wuebker.pdf},
volume = {1},
booktitle = {Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {Vancouver, BC, Canada},
year = 2014
}
Wuebker et al. (2014)
Jian Zhang and Liangyou Li and Andy Way and Qun Liu (2014):
A probabilistic feature-based fill-up for SMT, Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{AMTA-2014-Zhang,
author = {Jian Zhang and Liangyou Li and Andy Way and Qun Liu},
title = {A probabilistic feature-based fill-up for SMT},
pages = {96-109},
url = {
http://www.mt-archive.info/10/AMTA-2014-Zhang.pdf},
volume = {1},
booktitle = {Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {Vancouver, BC, Canada},
year = 2014
}
Zhang et al. (2014)