Incremental Updating
New training data may arrive continuously, thus allowing incremental updating of models.
Incremental Updating is the main subject of 28 publications. 4 are discussed here.
Publications
When a user interacts with machine translation, a desirable feature for the machine translation engine is to incrementally learn from each user translation.
Carlos A. Henríquez Q. and José B. Mariño and Rafael E. Banchs (2011):
Deriving translation units using small additional corpora, Proceedings of the 15th International Conference of the European Association for Machine Translation (EAMT)
@inproceedings{eamt11:Henriquez,
author = {Carlos A. {Henr{\'i}quez~Q.} and Jos{\'e} B. Mari{\~n}o and Rafael E. Banchs},
title = {Deriving translation units using small additional corpora},
pages = {121--128},
booktitle = {Proceedings of the 15th International Conference of the European Association for Machine Translation (EAMT)},
location = {Leuven, Belgium},
editor = {Mikel L. Forcada and Heidi Depraetere and Vincent Vandeghinste},
year = 2011
}
Henríquez Q. et al. (2011) incrementally update the translation model and language model with blocks of new translations. Word alignment is carried out with TER, extracting only reliable new translation units. A language model trained on the new data is interpolated with the baseline.
Marion Potet and Emmanuelle Esperança-Rodier and Hervé Blanchon and Laurent Besacier (2011):
Preliminary Experiments on Using Users' Post-Editions to Enhance a SMT System, Proceedings of the 15th International Conference of the European Association for Machine Translation (EAMT)
@inproceedings{eamt11:Potet,
author = {Marion Potet and Emmanuelle Esperan{\,c}a-Rodier and Herv{\'e} Blanchon and Laurent Besacier},
title = {Preliminary Experiments on Using Users' Post-Editions to Enhance a {SMT} System},
pages = {161--168},
booktitle = {Proceedings of the 15th International Conference of the European Association for Machine Translation (EAMT)},
location = {Leuven, Belgium},
editor = {Mikel L. Forcada and Heidi Depraetere and Vincent Vandeghinste},
year = 2011
}
Potet et al. (2011) compare incrementally updated translation models against learning a dedicated post-editing model and using the new incoming data for tuning. A related approach is the incremental updating of the translation model from a large data stream
Levenberg, Abby and Callison-Burch, Chris and Osborne, Miles (2010):
Stream-based Translation Models for Statistical Machine Translation, Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
mentioned in Phrase Based Model Training and Incremental Updating@InProceedings{levenberg-callisonburch-osborne:2010:NAACLHLT,
author = {Levenberg, Abby and Callison-Burch, Chris and Osborne, Miles},
title = {Stream-based Translation Models for Statistical Machine Translation},
booktitle = {Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics},
month = {June},
address = {Los Angeles, California},
publisher = {Association for Computational Linguistics},
pages = {394--402},
url = {
http://www.aclweb.org/anthology/N10-1062},
year = 2010
}
(Levenberg et al., 2010).
Benchmarks
Discussion
Related Topics
New Publications
Daniel Ortiz-Martínez (2016):
Online Learning for Statistical Machine Translation, Computational Linguistics
@Article{J16-1004,
author = {Daniel Ortiz-Mart{\'i}nez},
title = {Online Learning for Statistical Machine Translation},
journal = {Computational Linguistics},
volume = {41},
number = {1},
month = {March},
year = 2016
}
Ortiz-Martínez (2016)
Shachar Mirkin and Nicola Cancedda (2013):
Assessing quick update methods of statistical translation models, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{Mirkin:iwslt:2013,
author = {Shachar Mirkin and Nicola Cancedda},
title = {Assessing quick update methods of statistical translation models},
url = {
http://www.mt-archive.info/10/IWSLT-2013-Mirkin.pdf},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
year = 2013
}
Mirkin and Cancedda (2013)
Denkowski, Michael and Dyer, Chris and Lavie, Alon (2014):
Learning from Post-Editing: Online Model Adaptation for Statistical Machine Translation, Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics
@InProceedings{denkowski-dyer-lavie:2014:EACL,
author = {Denkowski, Michael and Dyer, Chris and Lavie, Alon},
title = {Learning from Post-Editing: Online Model Adaptation for Statistical Machine Translation},
booktitle = {Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics},
month = {April},
address = {Gothenburg, Sweden},
publisher = {Association for Computational Linguistics},
pages = {395--404},
url = {
http://www.aclweb.org/anthology/E14-1042},
year = 2014
}
Denkowski et al. (2014)
Dara, Aswarth Abhilash and van Genabith, Josef and Liu, Qun and Judge, John and Toral, Antonio (2014):
Active Learning for Post-Editing Based Incrementally Retrained MT, Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers
@InProceedings{dara-EtAl:2014:EACL2014-SP,
author = {Dara, Aswarth Abhilash and van Genabith, Josef and Liu, Qun and Judge, John and Toral, Antonio},
title = {Active Learning for Post-Editing Based Incrementally Retrained MT},
booktitle = {Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers},
month = {April},
address = {Gothenburg, Sweden},
publisher = {Association for Computational Linguistics},
pages = {185--189},
url = {
http://www.aclweb.org/anthology/E14-4036},
year = 2014
}
Dara et al. (2014)
Green, Spence and Wang, Sida and Cer, Daniel and Manning, Christopher D. (2013):
Fast and Adaptive Online Training of Feature-Rich Translation Models, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{green-EtAl:2013:ACL2013,
author = {Green, Spence and Wang, Sida and Cer, Daniel and Manning, Christopher D.},
title = {Fast and Adaptive Online Training of Feature-Rich Translation Models},
booktitle = {Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {311--321},
url = {
http://www.aclweb.org/anthology/P13-1031},
year = 2013
}
Green et al. (2013)
Denkowski, Michael and Lavie, Alon and Lacruz, Isabel and Dyer, Chris (2014):
Real Time Adaptive Machine Translation for Post-Editing with cdec and TransCenter, Proceedings of the EACL 2014 Workshop on Humans and Computer-assisted Translation
@InProceedings{denkowski-EtAl:2014:HaCaT,
author = {Denkowski, Michael and Lavie, Alon and Lacruz, Isabel and Dyer, Chris},
title = {Real Time Adaptive Machine Translation for Post-Editing with cdec and TransCenter},
booktitle = {Proceedings of the EACL 2014 Workshop on Humans and Computer-assisted Translation},
month = {April},
address = {Gothenburg, Sweden},
publisher = {Association for Computational Linguistics},
pages = {72--77},
url = {
http://www.aclweb.org/anthology/W14-0311},
year = 2014
}
Denkowski et al. (2014)
Farajian, M. Amin and Bertoldi, Nicola and Federico, Marcello (2014):
Online Word Alignment for Online Adaptive Machine Translation, Proceedings of the EACL 2014 Workshop on Humans and Computer-assisted Translation
@InProceedings{farajian-bertoldi-federico:2014:HaCaT,
author = {Farajian, M. Amin and Bertoldi, Nicola and Federico, Marcello},
title = {Online Word Alignment for Online Adaptive Machine Translation},
booktitle = {Proceedings of the EACL 2014 Workshop on Humans and Computer-assisted Translation},
month = {April},
address = {Gothenburg, Sweden},
publisher = {Association for Computational Linguistics},
pages = {84--92},
url = {
http://www.aclweb.org/anthology/W14-0313},
year = 2014
}
Farajian et al. (2014)
Prashant Mathur and Mauro Cettolo and Marcello Federico and Jos\' G.C. de Souza (2014):
Online multi-user adaptive statistical machine translation, Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{AMTA-2014-Mathur,
author = {Prashant Mathur and Mauro Cettolo and Marcello Federico and Jos{\'} G.C. de~Souza},
title = {Online multi-user adaptive statistical machine translation},
pages = {152-165},
url = {
http://www.mt-archive.info/10/AMTA-2014-Mathur.pdf},
volume = {1},
booktitle = {Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {Vancouver, BC, Canada},
year = 2014
}
Mathur et al. (2014)
Ventsislav Zhechev (2014):
Term translation central: up-to-date MT without frequent retraining, Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA), Vol. 2: MT Users Track
@inproceedings{AMTA-2014-Zhechev,
author = {Ventsislav Zhechev},
title = {Term translation central: up-to-date {MT} without frequent retraining},
pages = {152-159},
url = {
http://www.mt-archive.info/10/AMTA-2014-Zhechev.pdf},
volume = {2},
booktitle = {Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA), Vol.~2: {MT} Users Track},
location = {Vancouver, BC, Canada},
year = 2014
}
Zhechev (2014)
Ortiz-Martínez, Daniel and García-Varea, Ismael and Casacuberta, Francisco (2010):
Online Learning for Interactive Statistical Machine Translation, Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
mentioned in Phrase Based Model Training and Incremental Updating@InProceedings{ortizmartinez-garciavarea-casacuberta:2010:NAACLHLT,
author = {Ortiz-Mart\'{i}nez, Daniel and Garc\'{i}a-Varea, Ismael and Casacuberta, Francisco},
title = {Online Learning for Interactive Statistical Machine Translation},
booktitle = {Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics},
month = {June},
address = {Los Angeles, California},
publisher = {Association for Computational Linguistics},
pages = {546--554},
url = {
http://www.aclweb.org/anthology/N10-1079},
year = 2010
}
Ortiz-Martínez et al. (2010)
Daniel Hardt and Jakob Elming (2010):
Incremental Re-training for Post-editing SMT, Proceedings of the Ninth Conference of the Association for Machine Translation in the Americas
@inproceedings{AMTA-2010-Hardt,
author = {Daniel Hardt and Jakob Elming},
title = {Incremental Re-training for Post-editing {SMT}},
url = {
http://www.mt-archive.info/AMTA-2010-Hardt.pdf},
booktitle = {Proceedings of the Ninth Conference of the Association for Machine Translation in the Americas},
location = {Denver, Colorado},
year = 2010
}
Hardt and Elming (2010)
Ortiz-Martínez, Daniel and Leiva, Luis A. and Alabau, Vicent and García-Varea, Ismael and Casacuberta, Francisco (2011):
An Interactive Machine Translation System with Online Learning, Proceedings of the ACL-HLT 2011 System Demonstrations
@InProceedings{ortizmartinez-EtAl:2011:Demos,
author = {Ortiz-Mart\'{i}nez, Daniel and Leiva, Luis A. and Alabau, Vicent and Garc\'{i}a-Varea, Ismael and Casacuberta, Francisco},
title = {An Interactive Machine Translation System with Online Learning},
booktitle = {Proceedings of the ACL-HLT 2011 System Demonstrations},
month = {June},
address = {Portland, Oregon},
publisher = {Association for Computational Linguistics},
pages = {68--73},
url = {
http://www.aclweb.org/anthology/P11-4012},
year = 2011
}
Ortiz-Martínez et al. (2011)
Qin Gao and Will Lewis and Chris Quirk and Mei-Yuh Hwang (2011):
Incremental Training and Intentional Over-fitting of Word Alignment, Proceedings of the 13th Machine Translation Summit (MT Summit XIII)
@inproceedings{MTS-2011-Gao,
author = {Qin Gao and Will Lewis and Chris Quirk and Mei-Yuh Hwang},
title = {Incremental Training and Intentional Over-fitting of Word Alignment},
url = {
http://www.mt-archive.info/MTS-2011-Gao.pdf},
pages = {106-113},
booktitle = {Proceedings of the 13th Machine Translation Summit (MT Summit XIII)},
publisher = {International Association for Machine Translation},
location = {Xiamen, China},
year = 2011
}
Gao et al. (2011)
Frédéric Blain and Jean Senellart and Holger Schwenk and Mirko Plitt and Johann Roturier (2011):
Qualitative Analysis of Post-Editing for High Quality Machine Translation, Proceedings of the 13th Machine Translation Summit (MT Summit XIII)
@inproceedings{MTS-2011-Blain,
author = {Fr{\'e}d{\'e}ric Blain and Jean Senellart and Holger Schwenk and Mirko Plitt and Johann Roturier},
title = {Qualitative Analysis of Post-Editing for High Quality Machine Translation},
url = {
http://www.mt-archive.info/MTS-2011-Blain.pdf},
pages = {164-171},
booktitle = {Proceedings of the 13th Machine Translation Summit (MT Summit XIII)},
publisher = {International Association for Machine Translation},
location = {Xiamen, China},
year = 2011
}
Blain et al. (2011)
Frédéric Blain and Holger Schwenk and Jean Senellart (2012):
Incremental adaptation using translation information and post-editing analysis, Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{iwslt12:Blain,
author = {Fr{\'e}d{\'e}ric Blain and Holger Schwenk and Jean Senellart},
title = {Incremental adaptation using translation information and post-editing analysis},
url = {
http://www.mt-archive.info/IWSLT-2012-Blain.pdf},
pages = {229-236},
booktitle = {Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)},
location = {Hong Kong},
year = 2012
}
Blain et al. (2012)
González-Rubio, Jesús and Ortiz-Martínez, Daniel and Casacuberta, Francisco (2012):
Active learning for interactive machine translation, Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
@InProceedings{gonzalezrubio-ortizmartinez-casacuberta:2012:EACL2012,
author = {Gonz\'{a}lez-Rubio, Jes\'{u}s and Ortiz-Mart\'{i}nez, Daniel and Casacuberta, Francisco},
title = {Active learning for interactive machine translation},
booktitle = {Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics},
month = {April},
address = {Avignon, France},
publisher = {Association for Computational Linguistics},
pages = {245--254},
url = {
http://www.aclweb.org/anthology/E12-1025},
year = 2012
}
González-Rubio et al. (2012)
Banerjee, Pratyush and Naskar, Sudip Kumar and Roturier, Johann and Way, Andy and van Genabith, Josef (2012):
Translation Quality-Based Supplementary Data Selection by Incremental Update of Translation Models, Proceedings of COLING 2012
@InProceedings{banerjee-EtAl:2012:PAPERS,
author = {Banerjee, Pratyush and Naskar, Sudip Kumar and Roturier, Johann and Way, Andy and van Genabith, Josef},
title = {Translation Quality-Based Supplementary Data Selection by Incremental Update of Translation Models},
booktitle = {Proceedings of COLING 2012},
month = {December},
address = {Mumbai, India},
publisher = {The COLING 2012 Organizing Committee},
pages = {149--166},
url = {
http://www.aclweb.org/anthology/C12-1010},
year = 2012
}
Banerjee et al. (2012)
Nicola Bertoldi and Mauro Cettolo and Marcello Federico (2013):
Cache-based Online Adaptation for Machine Translation Enhanced Computer Assisted Translation, Machine Translation Summit XIV
@inproceedings{MTS2013-Bertoldi,
author = {Nicola Bertoldi and Mauro Cettolo and Marcello Federico},
title = {Cache-based Online Adaptation for Machine Translation Enhanced Computer Assisted Translation},
url = {
http://www.mt-archive.info/10/MTS-2013-Bertoldi.pdf},
pages = {35--42},
booktitle = {Machine Translation Summit XIV},
year = 2013
}
Bertoldi et al. (2013)
Martha Alicia Rocha and Joan Andreu Sanchez (2013):
Towards the Supervised Machine Translation: Real Word Alignments and Translations in a Multi-task Active Learning process, Machine Translation Summit XIV
@inproceedings{MTS2013-Rocha,
author = {Martha Alicia Rocha and Joan Andreu Sanchez},
title = {Towards the Supervised Machine Translation: Real Word Alignments and Translations in a Multi-task Active Learning process},
url = {
http://www.mt-archive.info/10/MTS-2013-Rocha.pdf},
pages = {239--248},
booktitle = {Machine Translation Summit XIV},
year = 2013
}
Rocha and Sanchez (2013)
Michel Simard and George Foster (2013):
PEPr: Post-Edit Propagation Using Phrase-based Statistical Machine Translation, Machine Translation Summit XIV
@inproceedings{MTS2013-Simard,
author = {Michel Simard and George Foster},
title = {PEPr: Post-Edit Propagation Using Phrase-based Statistical Machine Translation},
url = {
http://www.mt-archive.info/10/MTS-2013-Simard.pdf},
pages = {191--198},
booktitle = {Machine Translation Summit XIV},
year = 2013
}
Simard and Foster (2013)
Katharina Wäschle and Patrick Simianer and Nicola Bertoldi and Stefan Riezler and Marcello Federico (2013):
Generative and Discriminative Methods for Online Adaptation in SMT, Machine Translation Summit XIV
@inproceedings{MTS2013-Waschle,
author = {Katharina W{\"a}schle and Patrick Simianer and Nicola Bertoldi and Stefan Riezler and Marcello Federico},
title = {Generative and Discriminative Methods for Online Adaptation in {SMT}},
url = {
http://www.mt-archive.info/10/MTS-2013-Waschle.pdf},
pages = {11--18},
booktitle = {Machine Translation Summit XIV},
year = 2013
}
Wäschle et al. (2013)
Mauro Cettolo and Christophe Servan and Nicola Bertoldi and Marcello Federico and Loïc Barrault and Holger Schwenk (2013):
Issues in incremental adaptation of statistical MT from human post-edits, Proceedings of Workshop on Post-editing Technology and Practice
mentioned in Domain Adaptation and Incremental Updating@inproceedings{MTS2013W2-Cettolo,
author = {Mauro Cettolo and Christophe Servan and Nicola Bertoldi and Marcello Federico and Loïc Barrault and Holger Schwenk},
title = {Issues in incremental adaptation of statistical {MT} from human post-edits},
url = {
http://www.mt-archive.info/10/MTS-2013-W2-Cettolo.pdf},
pages = {111--118},
booktitle = {Proceedings of Workshop on Post-editing Technology and Practice},
year = 2013
}
Cettolo et al. (2013)
Prashant Mathur and Mauro Cettolo (2014):
Optimized MT online learning in computer assisted translation, Proceedings of the Workshop on interactive and adaptive machine translation
@inproceedings{AMTA-2014-W1-Mathur,
author = {Prashant Mathur and Mauro Cettolo},
title = {Optimized {MT} online learning in computer assisted translation},
pages = {32-41},
url = {
http://www.mt-archive.info/10/AMTA-2014-W1-Mathur.pdf},
booktitle = {Proceedings of the Workshop on interactive and adaptive machine translation},
location = {Vancouver, BC, Canada},
year = 2014
}
Mathur and Cettolo (2014)
Vicent Alabau and Jesús González-Rubio and Daniel Ortiz-Martínez and Germán Sanchis-Trilles and Francisco Casacuberta and Mercedes García-Martínez and Bartolomé Mesa-Lao and Dan Cheung Petersen and Barbara Dragsted and Michael Carl (2014):
Integrating online and active learning in a computer-assisted translation workbench, Proceedings of the Workshop on interactive and adaptive machine translation
@inproceedings{AMTA-2014-W1-Alabau,
author = {Vicent Alabau and Jes{\'u}s Gonz{\'a}lez-Rubio and Daniel Ortiz-Mart{\'i}nez and Germ{\'a}n Sanchis-Trilles and Francisco Casacuberta and Mercedes Garc{\'i}a-Mart{\'i}nez and Bartolom{\'e} Mesa-Lao and Dan Cheung Petersen and Barbara Dragsted and Michael Carl},
title = {Integrating online and active learning in a computer-assisted translation workbench},
pages = {1-8},
url = {
http://www.mt-archive.info/10/AMTA-2014-W1-Alabau.pdf},
booktitle = {Proceedings of the Workshop on interactive and adaptive machine translation},
location = {Vancouver, BC, Canada},
year = 2014
}
Alabau et al. (2014)
Mathur, Prashant and Mauro, Cettolo and Federico, Marcello (2013):
Online Learning Approaches in Computer Assisted Translation, Proceedings of the Eighth Workshop on Statistical Machine Translation
@InProceedings{mathur-mauro-federico:2013:WMT,
author = {Mathur, Prashant and Mauro, Cettolo and Federico, Marcello},
title = {Online Learning Approaches in Computer Assisted Translation},
booktitle = {Proceedings of the Eighth Workshop on Statistical Machine Translation},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {301--308},
url = {
http://www.aclweb.org/anthology/W13-2237},
year = 2013
}
Mathur et al. (2013)