Finite State Machines
Statistical machine translation models may be implemented using finite state machines, for which a large number of powerful toolkits are available, which provide their own generic decoding algorithms.
Finite State Machines is the main subject of 18 publications. 10 are discussed here.
Publications
Instead of devising a dedicated decoding algorithm for statistical machine translation, finite state tools may be used, both for word-based
Srinivas Bangalore and Giuseppe Riccardi (2000):
Stochastic Finite-State models for Spoken Language Machine Translation, ANLP-NAACL 2000 Workshop: Embedded Machine Translation Systems
@Inproceedings{Bangalore:2000,
author = {Srinivas Bangalore and Giuseppe Riccardi},
title = {Stochastic Finite-State models for Spoken Language Machine Translation},
url = {
http://www.aclweb.org/anthology-new/W/W00/W00-0508.pdf},
googlescholar = {5984868585217966640},
booktitle = {ANLP-NAACL 2000 Workshop: Embedded Machine Translation Systems},
year = 2000
}
(Bangalore and Riccardi, 2000;
Srinivas Bangalore and Giuseppe Riccardi (2001):
A Finite-State Approach to Machine Translation, Proceedings of Annual Meeting of the North American Chapter of the Association of Computational Linguistics (NAACL)
@Inproceedings{Bangalore:2001,
author = {Srinivas Bangalore and Giuseppe Riccardi},
title = {A Finite-State Approach to Machine Translation},
url = {
http://acl.ldc.upenn.edu/N/N01/N01-1018.pdf},
booktitle = {Proceedings of Annual Meeting of the North American Chapter of the Association of Computational Linguistics (NAACL)},
year = 2001
}
Bangalore and Riccardi, 2001;
Tsukada, Hajime and Nagata, Masaaki (2004):
Efficient Decoding for Statistical Machine Translation with a Fully Expanded WFST Model , Proceedings of EMNLP 2004
@inproceedings{Tsukada:2004,
author = {Tsukada, Hajime and Nagata, Masaaki},
title = {Efficient Decoding for Statistical Machine Translation with a Fully Expanded WFST Model },
url = {
http://acl.ldc.upenn.edu/acl2004/emnlp/pdf/Tsukada.pdf},
googlescholar = {11552192498336003432},
booktitle = {Proceedings of EMNLP 2004},
editor = {Dekang Lin and Dekai Wu},
month = {July},
address = {Barcelona, Spain},
publisher = {Association for Computational Linguistics},
pages = {427--433},
year = 2004
}
Tsukada and Nagata, 2004;
Francisco Casacuberta and Enrique Vidal (2004):
Machine translation with inferred stochastic finite-state transducers, Computational Linguistics
mentioned in Research Groups and Finite State Machines@Article{casacuberta:2004,
author = {Francisco Casacuberta and Enrique Vidal},
title = {Machine translation with inferred stochastic finite-state transducers},
url = {
http://acl.ldc.upenn.edu/J/J04/J04-2004.pdf},
googlescholar = {10847257596009666709},
journal = {Computational Linguistics},
volume = {30},
number = {2},
pages = {205--225},
year = 2004
}
Casacuberta and Vidal, 2004), alignment template
Kumar, Shankar and Byrne, William (2003):
A Weighted Finite State Transducer Implementation of the Alignment Template Model for Statistical Machine Translation, HLT-NAACL 2003: Main Proceedings
@inproceedings{Kumar:2003,
author = {Kumar, Shankar and Byrne, William},
title = {A Weighted Finite State Transducer Implementation of the Alignment Template Model for Statistical Machine Translation},
url = {
http://acl.ldc.upenn.edu/N/N03/N03-1019.pdf},
booktitle = {HLT-NAACL 2003: Main Proceedings},
editor = {Marti Hearst and Mari Ostendorf},
month = {May 27 - June 1},
address = {Edmonton, Alberta, Canada},
publisher = {Association for Computational Linguistics},
pages = {142--149},
year = 2003
}
(Kumar and Byrne, 2003) and phrase-based models. The use of finite state toolkits also allows for the training of word-based and phrase-based models. The implementation by
Deng, Yonggang and Byrne, William (2005):
HMM Word and Phrase Alignment for Statistical Machine Translation, Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing
@InProceedings{deng-byrne:2005:HLTEMNLP,
author = {Deng, Yonggang and Byrne, William},
title = {{HMM} Word and Phrase Alignment for Statistical Machine Translation},
booktitle = {Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing},
month = {October},
address = {Vancouver, British Columbia, Canada},
publisher = {Association for Computational Linguistics},
pages = {169--176},
url = {
http://www.aclweb.org/anthology/H/H05/H05-1022},
year = 2005
}
Deng and Byrne (2005) is available as the MTTK toolkit
Deng, Yonggang and Byrne, William (2006):
MTTK: An Alignment Toolkit for Statistical Machine Translation, Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Demonstrations
mentioned in Available Software and Finite State Machines@InProceedings{deng-byrne:2006:HLT-NAACL06-Demos,
author = {Deng, Yonggang and Byrne, William},
title = {MTTK: An Alignment Toolkit for Statistical Machine Translation},
booktitle = {Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Demonstrations},
month = {June},
address = {New York City, USA},
publisher = {Association for Computational Linguistics},
pages = {265--268},
url = {
http://www.aclweb.org/anthology/N/N06/N06-4004},
year = 2006
}
(Deng and Byrne, 2006). Similarly, the IBM models may be implemented using graphical model toolkits
Filali, Karim and Bilmes, Jeff (2007):
Generalized Graphical Abstractions for Statistical Machine Translation, Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
@InProceedings{filali-bilmes:2007:ShortPapers,
author = {Filali, Karim and Bilmes, Jeff},
title = {Generalized Graphical Abstractions for Statistical Machine Translation},
booktitle = {Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers},
month = {April},
address = {Rochester, New York},
publisher = {Association for Computational Linguistics},
pages = {33--36},
url = {
http://www.aclweb.org/anthology/N/N07/N07-2009},
year = 2007
}
(Filali and Bilmes, 2007).
Alicia Pérez and Víctor Guijarrubia and Raquel Justo and M. Inés Torres and Francisco Casacuberta (2007):
A Comparison of Linguistically and Statistically Enhanced Models for Speech-to-Speech Machine Translation, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{Perez:2007:IWSLT,
author = {Alicia P{\'e}rez and V{\'i}ctor Guijarrubia and Raquel Justo and M. In{\'e}s Torres and Francisco Casacuberta},
title = {A Comparison of Linguistically and Statistically Enhanced Models for Speech-to-Speech Machine Translation},
url = {
http://20.210-193-52.unknown.qala.com.sg/archive/iwslt\_07/papers/slt7\_013.pdf},
googlescholar = {16979493087263083812},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
year = 2007
}
Pérez et al. (2007) compare finite state implementation of word and phrase-based models.
Just as word-based and phrase-based models may be implemented with finite state toolkits, a general framework of tree transducers may subsume many of the proposed tree-based models
Jonathan Graehl and Kevin Knight (2004):
Training Tree Transducers, Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)
@Inproceedings{Graehl:2004,
author = {Jonathan Graehl and Kevin Knight},
title = {Training Tree Transducers},
url = {
http://acl.ldc.upenn.edu/hlt-naacl2004/main/pdf/58\_Paper.pdf},
booktitle = {Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)},
year = 2004
}
(Graehl and Knight, 2004).
Benchmarks
Discussion
Related Topics
New Publications
Argueta, Arturo and Chiang, David (2017):
Decoding with Finite-State Transducers on GPUs, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers
@InProceedings{argueta-chiang:2017:EACLlong,
author = {Argueta, Arturo and Chiang, David},
title = {Decoding with Finite-State Transducers on GPUs},
booktitle = {Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers},
month = {April},
address = {Valencia, Spain},
publisher = {Association for Computational Linguistics},
pages = {1044--1052},
url = {
http://www.aclweb.org/anthology/E17-1098},
year = 2017
}
Argueta and Chiang (2017)
Iglesias, Gonzalo and de Gispert, Adrià and R. Banga, Eduardo and Byrne, William (2009):
Hierarchical Phrase-Based Translation with Weighted Finite State Transducers, Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
@InProceedings{iglesias-EtAl:2009:NAACLHLT09,
author = {Iglesias, Gonzalo and de Gispert, Adri\`{a} and R. Banga, Eduardo and Byrne, William},
title = {Hierarchical Phrase-Based Translation with Weighted Finite State Transducers},
booktitle = {Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics},
month = {June},
address = {Boulder, Colorado},
publisher = {Association for Computational Linguistics},
pages = {433--441},
url = {
http://www.aclweb.org/anthology/N/N09/N09-1049},
year = 2009
}
Iglesias et al. (2009)
González, Jorge and Casacuberta, Francisco (2009):
GREAT: A Finite-State Machine Translation Toolkit Implementing a Grammatical Inference Approach for Transducer Inference (GIATI), Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference
@InProceedings{gonzalez-casacuberta:2009:CLAGI,
author = {Gonz\'alez, Jorge and Casacuberta, Francisco},
title = {{GREAT}: A Finite-State Machine Translation Toolkit Implementing a Grammatical Inference Approach for Transducer Inference ({GIATI})},
booktitle = {Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference},
month = {March},
address = {Athens, Greece},
publisher = {Association for Computational Linguistics},
pages = {24--32},
url = {
http://www.aclweb.org/anthology/W09-1005},
year = 2009
}
González and Casacuberta (2009)
Malik, M. G. Abbas and Boitet, Christian and Bhattacharyya, Pushpak (2010):
Finite-state Scriptural Translation, Coling 2010: Posters
@InProceedings{malik-boitet-bhattacharyya:2010:POSTERS,
author = {Malik, M. G. Abbas and Boitet, Christian and Bhattacharyya, Pushpak},
title = {Finite-state Scriptural Translation},
booktitle = {Coling 2010: Posters},
month = {August},
address = {Beijing, China},
publisher = {Coling 2010 Organizing Committee},
pages = {791--800},
url = {
http://www.aclweb.org/anthology/C10-2091},
year = 2010
}
Malik et al. (2010)
Iglesias, Gonzalo and Allauzen, Cyril and Byrne, William and de Gispert, Adrià and Riley, Michael (2011):
Hierarchical Phrase-based Translation Representations, Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing
@InProceedings{iglesias-EtAl:2011:EMNLP,
author = {Iglesias, Gonzalo and Allauzen, Cyril and Byrne, William and de Gispert, Adri\`{a} and Riley, Michael},
title = {Hierarchical Phrase-based Translation Representations},
booktitle = {Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing},
month = {July},
address = {Edinburgh, Scotland, UK.},
publisher = {Association for Computational Linguistics},
pages = {1373--1383},
url = {
http://www.aclweb.org/anthology/D11-1127},
year = 2011
}
Iglesias et al. (2011)
Hiyan Alshawi and Srinivas Bangalore and Shona Douglas (2002):
Head-Transducer Models for Speech Translation and Their Automatic Acquisition from Bilingual Data, Machine Translation
@article{MTJ:2002:Alshawi,
author = {Hiyan Alshawi and Srinivas Bangalore and Shona Douglas},
title = {Head-Transducer Models for Speech Translation and Their Automatic Acquisition from Bilingual Data},
url = {
ftp://ftp.cis.upenn.edu/pub/srini/AlshawiBangaloreDouglasFinal.ps},
googlescholar = {1064727921438651510},
pages = {105-124},
journal = {Machine Translation},
volume = {15},
number = {1--2},
month = {June},
year = 2002
}
Alshawi et al. (2002)
Beck, Daniel Emilio (2011):
Syntax-based Statistical Machine Translation using Tree Automata and Tree Transducers, Proceedings of the ACL 2011 Student Session
@InProceedings{beck:2011:SS,
author = {Beck, Daniel Emilio},
title = {Syntax-based Statistical Machine Translation using Tree Automata and Tree Transducers},
booktitle = {Proceedings of the ACL 2011 Student Session},
month = {June},
address = {Portland, OR, USA},
publisher = {Association for Computational Linguistics},
pages = {36--40},
url = {
http://www.aclweb.org/anthology/P11-3007},
year = 2011
}
Beck (2011)
Stephan Vogel and Hermann Ney (2000):
Translation with Cascaded Finite State Transducers, Proceedings of the 38th Annual Meeting of the Association of Computational Linguistics (ACL)
@InProceedings{Vogel:2000,
author = {Stephan Vogel and Hermann Ney},
title = {Translation with Cascaded Finite State Transducers},
url = {
http://www.aclweb.org/anthology/P00-1004},
googlescholar = {14583653011845237450},
booktitle = {Proceedings of the 38th Annual Meeting of the Association of Computational Linguistics (ACL)},
year = 2000
}
Vogel and Ney (2000)