Syntax Based Models without Linguistic Annotation

Publications

Rooted in a finite state machine approach, head automata have been developed that allow for tree representation during the translation process

Hiyan Alshawi (1996): Head Automata and Bilingual Tiling: Translation with Minimal Representations (Invited Talk), Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL)

add

@Inproceedings{Alshawi:1996,
author = {Hiyan Alshawi},
title = {Head Automata and Bilingual Tiling: Translation with Minimal Representations (Invited Talk)},
url = {http://acl.ldc.upenn.edu/P/P96/P96-1023.pdf?origin=publicationDetail},
googlescholar = {11846375422406776920},
booktitle = {Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL)},
year = 1996
}

(Alshawi, 1996;

Hiyan Alshawi and Adam L. Buchsbaum and Fei Xia (1997): A Comparison of Head Transducers and Transfer for a Limited Domain Translation Application, Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL)

add

@Inproceedings{Alshawi:1997,
author = {Hiyan Alshawi and Adam L. Buchsbaum and Fei Xia},
title = {A Comparison of Head Transducers and Transfer for a Limited Domain Translation Application},
url = {http://acl.ldc.upenn.edu/P/P97/P97-1046.pdf},
googlescholar = {5948054786386064082},
booktitle = {Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL)},
year = 1997
}

Alshawi et al., 1997;

Hiyan Alshawi and Srinivas Bangalore and Shona Douglas (1998): Automatic Acquisition of Hierarchical Transduction Models for Machine Translation, Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)

add

@Inproceedings{Alshawi:1998,
author = {Hiyan Alshawi and Srinivas Bangalore and Shona Douglas},
title = {Automatic Acquisition of Hierarchical Transduction Models for Machine Translation},
url = {http://acl.ldc.upenn.edu/C/C98/C98-1006.pdf},
googlescholar = {11495305628758623054},
booktitle = {Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)},
year = 1998
}

Alshawi et al., 1998;

Hiyan Alshawi and Srinivas Bangalore and Shona Douglas (2000): Learning Dependency Translation Models as Collection of Finite-State Head Transducers, Computational Linguistics

add

@Article{Alshawi:2000,
author = {Hiyan Alshawi and Srinivas Bangalore and Shona Douglas},
title = {Learning Dependency Translation Models as Collection of Finite-State Head Transducers},
url = {http://acl.ldc.upenn.edu/J/J00/J00-1004.pdf},
journal = {Computational Linguistics},
volume = {26},
number = {1},
pages = {45--60},
year = 2000
}

Alshawi et al., 2000).

Ye-Yi Wang and Alex Waibel (1998): Modeling with Structures in Statistical Machine translation, Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)

mentioned in Phrase Based Models and Inversion Transduction Grammars

add

@Inproceedings{Wang:1998,
author = {Ye-Yi Wang and Alex Waibel},
title = {Modeling with Structures in Statistical Machine translation},
url = {http://acl.ldc.upenn.edu/P/P98/P98-2221.pdf},
googlescholar = {12385920933414037529},
booktitle = {Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)},
year = 1998
}

Wang and Waibel (1998) also induce hierarchical structures in their statistical machine translation model. The use of synchronous grammars for statistical machine translation has been pioneered by

Dekai Wu (1995): Trainable Coarse Bilingual Grammars for Parallel Text Bracketing, Proceedings of the Third Workshop on Very Large Corpora (VLC)

add

@Inproceedings{Wu:1995b,
author = {Dekai Wu},
title = {Trainable Coarse Bilingual Grammars for Parallel Text Bracketing},
url = {http://acl.ldc.upenn.edu/W/W95/W95-0106.pdf},
googlescholar = {8753937834967775813},
booktitle = {Proceedings of the Third Workshop on Very Large Corpora (VLC)},
year = 1995
}

Wu (1995);

Dekai Wu (1996): A Polynomial-Time Algorithm for Statistical Machine Translation, Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL)

add

@Inproceedings{Wu:1996,
author = {Dekai Wu},
title = {A Polynomial-Time Algorithm for Statistical Machine Translation},
url = {http://acl.ldc.upenn.edu/P/P96/P96-1021.pdf},
googlescholar = {3924641975573530982},
booktitle = {Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL)},
year = 1996
}

Wu (1996);

Dekai Wu (1997): Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora, Computational Linguistics

mentioned in Word Alignment Based On Coocurrence and Inversion Transduction Grammars

add

@Article{Wu:1997,
author = {Dekai Wu},
title = {Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora},
url = {http://acl.ldc.upenn.edu/J/J97/J97-3002.pdf},
googlescholar = {7926725626202301933},
journal = {Computational Linguistics},
volume = {23},
number = {3},
year = 1997
}

Wu (1997);

Dekai Wu and Hongsing Wong (1998): Machine Translation with a Stochastic Grammatical Channel, Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)

add

@Inproceedings{Wu:1998,
author = {Dekai Wu and Hongsing Wong},
title = {Machine Translation with a Stochastic Grammatical Channel},
url = {http://acl.ldc.upenn.edu/P/P98/P98-2230.pdf},
googlescholar = {17922869678767927043},
booktitle = {Proceedings of the 36th Annual Meeting of the Association of Computational Linguistics (ACL)},
year = 1998
}

Wu and Wong (1998) with their work on inversion transduction grammars (ITG) and stochastic bracketing transduction grammars (SBTG). A different formal model are multitext grammars

I. Dan Melamed (2003): Multitext grammars and synchronous parsers, Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)

add

@Inproceedings{Melamed:2003,
author = {I. Dan Melamed},
title = {Multitext grammars and synchronous parsers},
url = {http://acl.ldc.upenn.edu/N/N03/N03-1021.pdf},
googlescholar = {11734294015301744662},
booktitle = {Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL)},
year = 2003
}

(Melamed, 2003;

Melamed, I. Dan (2004): Statistical Machine Translation by Parsing, Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume

add

@inproceedings{Melamed:2004,
author = {Melamed, I. Dan},
title = {Statistical Machine Translation by Parsing},
url = {http://acl.ldc.upenn.edu/acl2004/main/pdf/113\_pdf\_2-col.pdf},
googlescholar = {10328834082011647315},
booktitle = {Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume},
month = {July},
address = {Barcelona, Spain},
pages = {653--660},
year = 2004
}

Melamed, 2004;

Melamed, I. Dan and Satta, Giorgio and Wellington, Benjamin (2004): Generalized Multitext Grammars, Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume

add

@inproceedings{Melamed:2004b,
author = {Melamed, I. Dan  and  Satta, Giorgio  and  Wellington, Benjamin},
title = {Generalized Multitext Grammars},
url = {http://acl.ldc.upenn.edu/acl2004/main/pdf/277\_pdf\_2-col.pdf},
googlescholar = {2355681091376158087},
booktitle = {Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume},
month = {July},
address = {Barcelona, Spain},
pages = {661--668},
year = 2004
}

Melamed et al., 2004) and range concatenation grammars

Søgaard, Anders (2008): Range Concatenation Grammars for Translation, Coling 2008: Companion volume: Posters and Demonstrations

add

@InProceedings{sogaard:2008:POSTERS2,
author = {S{\o}gaard, Anders},
title = {Range Concatenation Grammars for Translation},
booktitle = {Coling 2008: Companion volume: Posters and Demonstrations},
month = {August},
address = {Manchester, UK},
publisher = {Coling 2008 Organizing Committee},
pages = {101--104},
url = {http://www.aclweb.org/anthology/C08-3026},
year = 2008
}

(Søgaard, 2008).

Zhang, Hao and Gildea, Daniel (2005): Stochastic Lexicalized Inversion Transduction Grammar for Alignment, Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05)

add

@InProceedings{zhang-gildea:2005:ACL,
author = {Zhang, Hao  and  Gildea, Daniel},
title = {Stochastic Lexicalized Inversion Transduction Grammar for Alignment},
booktitle = {Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05)},
month = {June},
address = {Ann Arbor, Michigan},
publisher = {Association for Computational Linguistics},
pages = {475--482},
url = {http://www.aclweb.org/anthology/P/P05/P05-1059},
year = 2005
}

Zhang and Gildea (2005) extend the ITG formalism by lexicalizing grammar rules and apply it to a small-scale word alignment task. Armed with an efficient A* algorithm for this formalism

Zhang, Hao and Gildea, Daniel (2006): Efficient Search for Inversion Transduction Grammar, Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing

add

@InProceedings{zhang-gildea:2006:EMNLP,
author = {Zhang, Hao  and  Gildea, Daniel},
title = {Efficient Search for Inversion Transduction Grammar},
booktitle = {Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing},
month = {July},
address = {Sydney, Australia},
publisher = {Association for Computational Linguistics},
pages = {224--231},
url = {http://www.aclweb.org/anthology/W/W06/W06-1627},
year = 2006
}

(Zhang and Gildea, 2006), they extend the formalism with rules that are lexicalized in both input and output

Zhang, Hao and Gildea, Daniel (2006): Inducing Word Alignments with Bilexical Synchronous Trees, Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions

add

@InProceedings{zhang-gildea:2006:POS,
author = {Zhang, Hao  and  Gildea, Daniel},
title = {Inducing Word Alignments with Bilexical Synchronous Trees},
booktitle = {Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions},
month = {July},
address = {Sydney, Australia},
publisher = {Association for Computational Linguistics},
pages = {953--960},
url = {http://www.aclweb.org/anthology/P/P06/P06-2122},
year = 2006
}

(Zhang and Gildea, 2006). Word alignments generated with ITG may also be used to extract phrase for phrase-based models

Joan Andreu Sánchez and José Miguel Benedí (2006): Obtaining Word Phrases with Stochastic Inversion Translation Grammars for Phrase-based Statistical Machine Translation, Proceedings of the 11th Conference of the European Association for Machine Translation (EAMT)

add

@InProceedings{Sanchez:2006:EAMT,
author = {Joan Andreu S\'{a}nchez and Jos\'{e} Miguel Bened\'{i}},
title = {Obtaining Word Phrases with Stochastic Inversion Translation Grammars for Phrase-based Statistical Machine Translation},
url = {http://mt-archive.info/EAMT-2006-Sanchez.pdf},
googlescholar = {16689080566077645608},
booktitle = {Proceedings of the 11th Conference of the European Association for Machine Translation (EAMT)},
month = {June},
address = {Oslo, Norway},
year = 2006
}

(Sánchez and Benedí, 2006;

Sánchez, Joan Andreu and Benedí, José Miguel (2006): Stochastic Inversion Transduction Grammars for Obtaining Word Phrases for Phrase-based Statistical Machine Translation, Proceedings on the Workshop on Statistical Machine Translation

add

@InProceedings{sanchez-benedi:2006:WMT,
author = {S\'{a}nchez, Joan Andreu  and  Bened\'{i}, Jos\'{e} Miguel},
title = {Stochastic Inversion Transduction Grammars for Obtaining Word Phrases for Phrase-based Statistical Machine Translation},
booktitle = {Proceedings on the Workshop on Statistical Machine Translation},
month = {June},
address = {New York City},
publisher = {Association for Computational Linguistics},
pages = {130--133},
url = {http://www.aclweb.org/anthology/W/W06/W06-3117},
year = 2006
}

Sánchez and Benedí, 2006b).

Zhang, Hao and Quirk, Chris and Moore, Robert C. and Gildea, Daniel (2008): Bayesian Learning of Non-Compositional Phrases with Synchronous Parsing, Proceedings of ACL-08: HLT

add

@InProceedings{zhang-EtAl:2008:ACLMain2,
author = {Zhang, Hao  and  Quirk, Chris  and  Moore, Robert C.  and  Gildea, Daniel},
title = {{Bayesian} Learning of Non-Compositional Phrases with Synchronous Parsing},
booktitle = {Proceedings of ACL-08: HLT},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {97--105},
url = {http://www.aclweb.org/anthology/P/P08/P08-1012},
year = 2008
}

Zhang et al. (2008) present an efficient ITG phrase alignment algorithm that uses Bayesian learning with priors.

MT Research Survey Wiki

A Comprehensive Survey of Neural and Statistical Machine Translation Research Publications

Search Descriptions

Syntax Based Models without Linguistic Annotation

Publications

Benchmarks

Discussion

Related Topics

New Publications