Syntax Based Models without Linguistic Annotation
Publications
Rooted in a finite state machine approach, head automata have been developed that allow for tree representation during the translation process
Hiyan Alshawi (1996):
Head Automata and Bilingual Tiling: Translation with Minimal Representations (Invited Talk), Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL)
@Inproceedings{Alshawi:1996,
author = {Hiyan Alshawi},
title = {Head Automata and Bilingual Tiling: Translation with Minimal Representations (Invited Talk)},
url = {
http://acl.ldc.upenn.edu/P/P96/P96-1023.pdf?origin=publicationDetail},
googlescholar = {11846375422406776920},
booktitle = {Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL)},
year = 1996
}
(Alshawi, 1996;
Hiyan Alshawi and Adam L. Buchsbaum and Fei Xia (1997):
A Comparison of Head Transducers and Transfer for a Limited Domain Translation Application, Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL)
@Inproceedings{Alshawi:1997,
author = {Hiyan Alshawi and Adam L. Buchsbaum and Fei Xia},
title = {A Comparison of Head Transducers and Transfer for a Limited Domain Translation Application},
url = {
http://acl.ldc.upenn.edu/P/P97/P97-1046.pdf},
googlescholar = {5948054786386064082},
booktitle = {Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL)},
year = 1997
}
Alshawi et al., 1997;
Hiyan Alshawi and Srinivas Bangalore and Shona Douglas (1998):
Automatic Acquisition of Hierarchical Transduction Models for Machine Translation, Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)
@Inproceedings{Alshawi:1998,
author = {Hiyan Alshawi and Srinivas Bangalore and Shona Douglas},
title = {Automatic Acquisition of Hierarchical Transduction Models for Machine Translation},
url = {
http://acl.ldc.upenn.edu/C/C98/C98-1006.pdf},
googlescholar = {11495305628758623054},
booktitle = {Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)},
year = 1998
}
Alshawi et al., 1998;
Hiyan Alshawi and Srinivas Bangalore and Shona Douglas (2000):
Learning Dependency Translation Models as Collection of Finite-State Head Transducers, Computational Linguistics
@Article{Alshawi:2000,
author = {Hiyan Alshawi and Srinivas Bangalore and Shona Douglas},
title = {Learning Dependency Translation Models as Collection of Finite-State Head Transducers},
url = {
http://acl.ldc.upenn.edu/J/J00/J00-1004.pdf},
journal = {Computational Linguistics},
volume = {26},
number = {1},
pages = {45--60},
year = 2000
}
Alshawi et al., 2000).
Ye-Yi Wang and Alex Waibel (1998):
Modeling with Structures in Statistical Machine translation, Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)
mentioned in Phrase Based Models and Inversion Transduction Grammars@Inproceedings{Wang:1998,
author = {Ye-Yi Wang and Alex Waibel},
title = {Modeling with Structures in Statistical Machine translation},
url = {
http://acl.ldc.upenn.edu/P/P98/P98-2221.pdf},
googlescholar = {12385920933414037529},
booktitle = {Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)},
year = 1998
}
Wang and Waibel (1998) also induce hierarchical structures in their statistical machine translation model.
The use of synchronous grammars for statistical machine translation has been pioneered by
Dekai Wu (1995):
Trainable Coarse Bilingual Grammars for Parallel Text Bracketing, Proceedings of the Third Workshop on Very Large Corpora (VLC)
@Inproceedings{Wu:1995b,
author = {Dekai Wu},
title = {Trainable Coarse Bilingual Grammars for Parallel Text Bracketing},
url = {
http://acl.ldc.upenn.edu/W/W95/W95-0106.pdf},
googlescholar = {8753937834967775813},
booktitle = {Proceedings of the Third Workshop on Very Large Corpora (VLC)},
year = 1995
}
Wu (1995);
Dekai Wu (1996):
A Polynomial-Time Algorithm for Statistical Machine Translation, Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL)
@Inproceedings{Wu:1996,
author = {Dekai Wu},
title = {A Polynomial-Time Algorithm for Statistical Machine Translation},
url = {
http://acl.ldc.upenn.edu/P/P96/P96-1021.pdf},
googlescholar = {3924641975573530982},
booktitle = {Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL)},
year = 1996
}
Wu (1996);
Dekai Wu (1997):
Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora, Computational Linguistics
mentioned in Word Alignment Based On Coocurrence and Inversion Transduction Grammars@Article{Wu:1997,
author = {Dekai Wu},
title = {Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora},
url = {
http://acl.ldc.upenn.edu/J/J97/J97-3002.pdf},
googlescholar = {7926725626202301933},
journal = {Computational Linguistics},
volume = {23},
number = {3},
year = 1997
}
Wu (1997);
Dekai Wu and Hongsing Wong (1998):
Machine Translation with a Stochastic Grammatical Channel, Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)
@Inproceedings{Wu:1998,
author = {Dekai Wu and Hongsing Wong},
title = {Machine Translation with a Stochastic Grammatical Channel},
url = {
http://acl.ldc.upenn.edu/P/P98/P98-2230.pdf},
googlescholar = {17922869678767927043},
booktitle = {Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)},
year = 1998
}
Wu and Wong (1998) with their work on inversion transduction grammars (ITG) and stochastic bracketing transduction grammars (SBTG). A different formal model are multitext grammars
I. Dan Melamed (2003):
Multitext grammars and synchronous parsers, Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)
@Inproceedings{Melamed:2003,
author = {I. Dan Melamed},
title = {Multitext grammars and synchronous parsers},
url = {
http://acl.ldc.upenn.edu/N/N03/N03-1021.pdf},
googlescholar = {11734294015301744662},
booktitle = {Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)},
year = 2003
}
(Melamed, 2003;
Melamed, I. Dan (2004):
Statistical Machine Translation by Parsing, Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume
@inproceedings{Melamed:2004,
author = {Melamed, I. Dan},
title = {Statistical Machine Translation by Parsing},
url = {
http://acl.ldc.upenn.edu/acl2004/main/pdf/113\_pdf\_2-col.pdf},
googlescholar = {10328834082011647315},
booktitle = {Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume},
month = {July},
address = {Barcelona, Spain},
pages = {653--660},
year = 2004
}
Melamed, 2004;
Melamed, I. Dan and Satta, Giorgio and Wellington, Benjamin (2004):
Generalized Multitext Grammars, Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume
@inproceedings{Melamed:2004b,
author = {Melamed, I. Dan and Satta, Giorgio and Wellington, Benjamin},
title = {Generalized Multitext Grammars},
url = {
http://acl.ldc.upenn.edu/acl2004/main/pdf/277\_pdf\_2-col.pdf},
googlescholar = {2355681091376158087},
booktitle = {Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume},
month = {July},
address = {Barcelona, Spain},
pages = {661--668},
year = 2004
}
Melamed et al., 2004) and range concatenation grammars
Søgaard, Anders (2008):
Range Concatenation Grammars for Translation, Coling 2008: Companion volume: Posters and Demonstrations
@InProceedings{sogaard:2008:POSTERS2,
author = {S{\o}gaard, Anders},
title = {Range Concatenation Grammars for Translation},
booktitle = {Coling 2008: Companion volume: Posters and Demonstrations},
month = {August},
address = {Manchester, UK},
publisher = {Coling 2008 Organizing Committee},
pages = {101--104},
url = {
http://www.aclweb.org/anthology/C08-3026},
year = 2008
}
(Søgaard, 2008).
Zhang, Hao and Gildea, Daniel (2005):
Stochastic Lexicalized Inversion Transduction Grammar for Alignment, Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05)
@InProceedings{zhang-gildea:2005:ACL,
author = {Zhang, Hao and Gildea, Daniel},
title = {Stochastic Lexicalized Inversion Transduction Grammar for Alignment},
booktitle = {Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05)},
month = {June},
address = {Ann Arbor, Michigan},
publisher = {Association for Computational Linguistics},
pages = {475--482},
url = {
http://www.aclweb.org/anthology/P/P05/P05-1059},
year = 2005
}
Zhang and Gildea (2005) extend the ITG formalism by lexicalizing grammar rules and apply it to a small-scale word alignment task. Armed with an efficient A* algorithm for this formalism
Zhang, Hao and Gildea, Daniel (2006):
Efficient Search for Inversion Transduction Grammar, Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
@InProceedings{zhang-gildea:2006:EMNLP,
author = {Zhang, Hao and Gildea, Daniel},
title = {Efficient Search for Inversion Transduction Grammar},
booktitle = {Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing},
month = {July},
address = {Sydney, Australia},
publisher = {Association for Computational Linguistics},
pages = {224--231},
url = {
http://www.aclweb.org/anthology/W/W06/W06-1627},
year = 2006
}
(Zhang and Gildea, 2006), they extend the formalism with rules that are lexicalized in both input and output
Zhang, Hao and Gildea, Daniel (2006):
Inducing Word Alignments with Bilexical Synchronous Trees, Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions
@InProceedings{zhang-gildea:2006:POS,
author = {Zhang, Hao and Gildea, Daniel},
title = {Inducing Word Alignments with Bilexical Synchronous Trees},
booktitle = {Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions},
month = {July},
address = {Sydney, Australia},
publisher = {Association for Computational Linguistics},
pages = {953--960},
url = {
http://www.aclweb.org/anthology/P/P06/P06-2122},
year = 2006
}
(Zhang and Gildea, 2006). Word alignments generated with ITG may also be used to extract phrase for phrase-based models
Joan Andreu Sánchez and José Miguel Benedí (2006):
Obtaining Word Phrases with Stochastic Inversion Translation Grammars for Phrase-based Statistical Machine Translation, Proceedings of the 11th Conference of the European Association for Machine Translation (EAMT)
@InProceedings{Sanchez:2006:EAMT,
author = {Joan Andreu S\'{a}nchez and Jos\'{e} Miguel Bened\'{i}},
title = {Obtaining Word Phrases with Stochastic Inversion Translation Grammars for Phrase-based Statistical Machine Translation},
url = {
http://mt-archive.info/EAMT-2006-Sanchez.pdf},
googlescholar = {16689080566077645608},
booktitle = {Proceedings of the 11th Conference of the European Association for Machine Translation (EAMT)},
month = {June},
address = {Oslo, Norway},
year = 2006
}
(Sánchez and Benedí, 2006;
Sánchez, Joan Andreu and Benedí, José Miguel (2006):
Stochastic Inversion Transduction Grammars for Obtaining Word Phrases for Phrase-based Statistical Machine Translation, Proceedings on the Workshop on Statistical Machine Translation
@InProceedings{sanchez-benedi:2006:WMT,
author = {S\'{a}nchez, Joan Andreu and Bened\'{i}, Jos\'{e} Miguel},
title = {Stochastic Inversion Transduction Grammars for Obtaining Word Phrases for Phrase-based Statistical Machine Translation},
booktitle = {Proceedings on the Workshop on Statistical Machine Translation},
month = {June},
address = {New York City},
publisher = {Association for Computational Linguistics},
pages = {130--133},
url = {
http://www.aclweb.org/anthology/W/W06/W06-3117},
year = 2006
}
Sánchez and Benedí, 2006b).
Zhang, Hao and Quirk, Chris and Moore, Robert C. and Gildea, Daniel (2008):
Bayesian Learning of Non-Compositional Phrases with Synchronous Parsing, Proceedings of ACL-08: HLT
@InProceedings{zhang-EtAl:2008:ACLMain2,
author = {Zhang, Hao and Quirk, Chris and Moore, Robert C. and Gildea, Daniel},
title = {{Bayesian} Learning of Non-Compositional Phrases with Synchronous Parsing},
booktitle = {Proceedings of ACL-08: HLT},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {97--105},
url = {
http://www.aclweb.org/anthology/P/P08/P08-1012},
year = 2008
}
Zhang et al. (2008) present an efficient ITG phrase alignment algorithm that uses Bayesian learning with priors.
Benchmarks
Discussion
Related Topics
New Publications