Integrated Syntactic Features
The traditional word-based and phrased-based models view sentences as strings of words, and do not incorporate a more linguistic view into the model. However, to enforce more grammatical output, syntactic properties may be encoded as features and used in addition to the traditional model.
Integrating Syntactic Features is the main subject of 21 publications. 1 are discussed here.
Publications
If syntactic properties can be formulated as features that fit into the framework of the beam search decoding algorithm, they may be integrated into model.
Deepa Gupta and Mauro Cettolo and Marcello Federico (2007):
POS-based Reordering Models for Statistical Machine Translation, Proceedings of the MT Summit XI
@inproceedings{Gupta:2007:MTSummit,
author = {Deepa Gupta and Mauro Cettolo and Marcello Federico},
title = {POS-based Reordering Models for Statistical Machine Translation},
url = {
http://www.researchgate.net/publication/228797738\_POS-based\_reordering\_models\_for\_statistical\_machine\_translation/file/9fcfd5094227266fed.pdf},
googlescholar = {339301193388145832},
booktitle = {Proceedings of the {MT} Summit XI},
year = 2007
}
Gupta et al. (2007) propose reordering features for different part-of-speech types, that flag different reordering behavior for nouns, verbs, and adjectives.
Benchmarks
Discussion
Related Topics
New Publications
Turki Khemakhem, Ines and Jamoussi, Salma and Ben Hamadou, Abdelmajid (2013):
Integrating morpho-syntactic features in English-Arabic statistical machine translation, Proceedings of the Second Workshop on Hybrid Approaches to Translation
@InProceedings{turkikhemakhem-jamoussi-benhamadou:2013:HyTra,
author = {Turki Khemakhem, Ines and Jamoussi, Salma and Ben Hamadou, Abdelmajid},
title = {Integrating morpho-syntactic features in English-Arabic statistical machine translation},
booktitle = {Proceedings of the Second Workshop on Hybrid Approaches to Translation},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {74--81},
url = {
http://www.aclweb.org/anthology/W13-2811},
year = 2013
}
Khemakhem et al. (2013)
Yooyung Kim and Jun Huang and Youssef Billawala and Demitrios Master and Farzad Ehsani (2005):
Sehda S2MT: Incorporation of Syntax into Statistical Translation System, Proc. of the International Workshop on Spoken Language Translation
@InProceedings{kim:2005:iwslt,
author = {Yooyung Kim and Jun Huang and Youssef Billawala and Demitrios Master and Farzad Ehsani},
title = {Sehda {S2MT}: Incorporation of Syntax into Statistical Translation System},
booktitle = {Proc. of the International Workshop on Spoken Language Translation},
location = {Pittsburgh, PA, USA},
month = {October},
year = 2005
}
Kim et al. (2005)
Chenchen Ding and Masao Utiyama and Eiichiro Sumita (2014):
Document-level re-ranking with soft lexical and semantic features for statistical machine translation, Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{AMTA-2014-Ding,
author = {Chenchen Ding and Masao Utiyama and Eiichiro Sumita},
title = {Document-level re-ranking with soft lexical and semantic features for statistical machine translation},
pages = {110-123},
url = {
http://www.mt-archive.info/10/AMTA-2014-Ding.pdf},
volume = {1},
booktitle = {Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {Vancouver, BC, Canada},
year = 2014
}
Ding et al. (2014)
Chenchen Ding and Masao Utiyama and Eiichiro Sumita (2014):
Document-level re-ranking with soft lexical and semantic features for statistical machine translation, Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)
@inproceedings{AMTA-2014-Ding,
author = {Chenchen Ding and Masao Utiyama and Eiichiro Sumita},
title = {Document-level re-ranking with soft lexical and semantic features for statistical machine translation},
pages = {110-123},
url = {
http://www.mt-archive.info/10/AMTA-2014-Ding.pdf},
volume = {1},
booktitle = {Proceedings of the Eleventh Conference of the Association for Machine Translation in the Americas (AMTA)},
location = {Vancouver, BC, Canada},
year = 2014
}
Ding et al. (2014)
Deyi Xiong and Min Zhang and Aiti Aw and Haizhou Li (2010):
Linguistically Annotated Reordering: Evaluation and Analysis, Computational Linguistics
@Article{CL:J10-3009,
author = {Deyi Xiong and Min Zhang and Aiti Aw and Haizhou Li},
title = {Linguistically Annotated Reordering: Evaluation and Analysis},
journal = {Computational Linguistics},
volume = {36},
number = {3},
url = {
http://aclweb.org/anthology-new/J/J10/J10-3009.pdf},
year = 2010
}
Xiong et al. (2010)
Tim Hunter and Philip Resnik (2010):
Exploiting syntactic relationships in a phrase-based decoder: an exploration, Machine Translation
@article{MTJ:2010:Hunter,
author = {Tim Hunter and Philip Resnik},
title = {Exploiting syntactic relationships in a phrase-based decoder: an exploration},
url = {
http://www.tc.umn.edu/~timh/decoder/mtjournal.pdf},
googlescholar = {10266739974111233292},
pages = {123-140},
journal = {Machine Translation},
volume = {24},
number = {2},
month = {June},
year = 2010
}
Hunter and Resnik (2010)
Yuval Marton and David Chiang and Philip Resnik (2012):
Soft syntactic constraints for Arabic-English hierarchical phrase-based translation, Machine Translation
@article{MTJ:2012:Marton,
author = {Yuval Marton and David Chiang and Philip Resnik},
title = {Soft syntactic constraints for {Arabic}-{English} hierarchical phrase-based translation},
url = {
http://cse.iitk.ac.in/users/cs671/2013/hw3/marton-chiang-12\_soft-constraints-for-arabic-english.pdf},
googlescholar = {6559030275887096567},
pages = {137-157},
journal = {Machine Translation},
volume = {26},
number = {1-2},
month = {March},
year = 2012
}
Marton et al. (2012)
Yuval Marton and David Chiang and Philip Resnik (2012):
Soft syntactic constraints for Arabic-English hierarchical phrase-based translation, Machine Translation
@article{MTJ:2012:Marton,
author = {Yuval Marton and David Chiang and Philip Resnik},
title = {Soft syntactic constraints for {Arabic}-{English} hierarchical phrase-based translation},
url = {
http://cse.iitk.ac.in/users/cs671/2013/hw3/marton-chiang-12\_soft-constraints-for-arabic-english.pdf},
googlescholar = {6559030275887096567},
pages = {137-157},
journal = {Machine Translation},
volume = {26},
number = {1-2},
month = {March},
year = 2012
}
Marton et al. (2012)
Chenchen Ding and Takashi Inui and Mikio Yamamoto (2011):
Long-distance hierarchical structure transformation rules utilizing function words, Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{iwslt11:Ding,
author = {Chenchen Ding and Takashi Inui and Mikio Yamamoto},
title = {Long-distance hierarchical structure transformation rules utilizing function words},
url = {
http://www.mt-archive.info/IWSLT-2011-Ding.pdf},
pages = {159-166},
editor = {Marcello Federico and Mei-Yuh Hwang and Margit R{\"o}dder and Sebastian St{\"u}ker},
booktitle = {Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)},
location = {San Francisco, USA},
year = 2011
}
Ding et al. (2011)
Jan-Thorsten Peter and Matthias Huck and Hermann Ney and Daniel Stein (2011):
Soft string-to-dependency hierarchical machine translation, Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{iwslt11:Peter,
author = {Jan-Thorsten Peter and Matthias Huck and Hermann Ney and Daniel Stein},
title = {Soft string-to-dependency hierarchical machine translation},
url = {
http://www.mt-archive.info/IWSLT-2011-Peter.pdf},
pages = {246-253},
editor = {Marcello Federico and Mei-Yuh Hwang and Margit R{\"o}dder and Sebastian St{\"u}ker},
booktitle = {Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT)},
location = {San Francisco, USA},
year = 2011
}
Peter et al. (2011)
Lemao Liu and Tiejun Zhao and Chao Wang and Hailong Cao (2011):
A Unified and Discriminative Soft Syntactic Constraint Model for Hierarchical Phrase-based Translation, Proceedings of the 13th Machine Translation Summit (MT Summit XIII)
@inproceedings{MTS-2011-Liu-3,
author = {Lemao Liu and Tiejun Zhao and Chao Wang and Hailong Cao},
title = {A Unified and Discriminative Soft Syntactic Constraint Model for Hierarchical Phrase-based Translation},
url = {
http://www.mt-archive.info/MTS-2011-Liu-3.pdf},
pages = {253-260},
booktitle = {Proceedings of the 13th Machine Translation Summit (MT Summit XIII)},
publisher = {International Association for Machine Translation},
location = {Xiamen, China},
year = 2011
}
Liu et al. (2011)
Liu, Ding and Gildea, Daniel (2010):
Semantic Role Features for Machine Translation, Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)
@InProceedings{liu-gildea:2010:PAPERS,
author = {Liu, Ding and Gildea, Daniel},
title = {Semantic Role Features for Machine Translation},
booktitle = {Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)},
month = {August},
address = {Beijing, China},
publisher = {Coling 2010 Organizing Committee},
pages = {716--724},
url = {
http://www.aclweb.org/anthology/C10-1081},
year = 2010
}
Liu and Gildea (2010)
Germann, Ulrich (2012):
Syntax-aware Phrase-based Statistical Machine Translation: System Description, Proceedings of the Seventh Workshop on Statistical Machine Translation
@InProceedings{germann:2012:WMT,
author = {Germann, Ulrich},
title = {Syntax-aware Phrase-based Statistical Machine Translation: System Description},
booktitle = {Proceedings of the Seventh Workshop on Statistical Machine Translation},
month = {June},
address = {Montreal, Canada},
publisher = {Association for Computational Linguistics},
pages = {338--343},
url = {
http://www.aclweb.org/anthology/W12-3140},
year = 2012
}
Germann (2012)
Rosa, Rudolf and Dušek, Ondřej and Mareček, David and Popel, Martin (2012):
Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors, Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation
@InProceedings{rosa-EtAl:2012:SSST-6,
author = {Rosa, Rudolf and Du\v{s}ek, Ond\v{r}ej and Mare\v{c}ek, David and Popel, Martin},
title = {Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors},
booktitle = {Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation},
month = {July},
address = {Jeju, Republic of Korea},
publisher = {Association for Computational Linguistics},
pages = {39--48},
url = {
http://www.aclweb.org/anthology/W12-4205},
year = 2012
}
Rosa et al. (2012)
David Vilar and Daniel Stein and Hermann Ney (2008):
Analysing soft syntax features and heuristics for hierarchical phrase based machine translation, Proceedings of the International Workshop on Spoken Language Translation (IWSLT)
@inproceedings{vilar2:iwslt:2008,
author = {David Vilar and Daniel Stein and Hermann Ney},
title = {Analysing soft syntax features and heuristics for hierarchical phrase based machine translation},
url = {
http://www.dfki.de/~davi01/papers/vilar08:heuristicSyntax.pdf},
googlescholar = {2418046096523148831},
pages = {190--197},
booktitle = {Proceedings of the International Workshop on Spoken Language Translation (IWSLT)},
year = 2008
}
Vilar et al. (2008)
Xiang, Bing and Ge, Niyu and Ittycheriah, Abraham (2011):
Improving Reordering for Statistical Machine Translation with Smoothed Priors and Syntactic Features, Proceedings of Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation
@InProceedings{xiang-ge-ittycheriah:2011:SSST,
author = {Xiang, Bing and Ge, Niyu and Ittycheriah, Abraham},
title = {Improving Reordering for Statistical Machine Translation with Smoothed Priors and Syntactic Features},
booktitle = {Proceedings of Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation},
month = {June},
address = {Portland, Oregon, USA},
publisher = {Association for Computational Linguistics},
pages = {61--69},
url = {
http://www.aclweb.org/anthology/W11-1007},
year = 2011
}
Xiang et al. (2011)
Banchs, Rafael E. and Costa-jussa, Marta R. (2011):
A Semantic Feature for Statistical Machine Translation, Proceedings of Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation
@InProceedings{banchs-costajussa:2011:SSST,
author = {Banchs, Rafael E. and Costa-jussa, Marta R.},
title = {A Semantic Feature for Statistical Machine Translation},
booktitle = {Proceedings of Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation},
month = {June},
address = {Portland, Oregon, USA},
publisher = {Association for Computational Linguistics},
pages = {126--134},
url = {
http://www.aclweb.org/anthology/W11-1014},
year = 2011
}
Banchs and Costa-jussa (2011)
Mareček, David and Rosa, Rudolf and Galuščáková, Petra and Bojar, Ondřej (2011):
Two-step translation with grammatical post-processing, Proceedings of the Sixth Workshop on Statistical Machine Translation
@InProceedings{marevcek-EtAl:2011:WMT,
author = {Mare\v{c}ek, David and Rosa, Rudolf and Galu\v{s}\v{c}\'{a}kov\'{a}, Petra and Bojar, Ondřej},
title = {Two-step translation with grammatical post-processing},
booktitle = {Proceedings of the Sixth Workshop on Statistical Machine Translation},
month = {July},
address = {Edinburgh, Scotland},
publisher = {Association for Computational Linguistics},
pages = {426--432},
url = {
http://www.aclweb.org/anthology/W11-2152},
year = 2011
}
Mareček et al. (2011)
Gao, Yang and Koehn, Philipp and Birch, Alexandra (2011):
Soft Dependency Constraints for Reordering in Hierarchical Phrase-Based Translation, Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing
@InProceedings{gao-koehn-birch:2011:EMNLP,
author = {Gao, Yang and Koehn, Philipp and Birch, Alexandra},
title = {Soft Dependency Constraints for Reordering in Hierarchical Phrase-Based Translation},
booktitle = {Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing},
month = {July},
address = {Edinburgh, Scotland, UK.},
publisher = {Association for Computational Linguistics},
pages = {857--868},
url = {
http://www.aclweb.org/anthology/D11-1079},
year = 2011
}
Gao et al. (2011)
Zhang, Jiajun and Zhai, Feifei and Zong, Chengqing (2011):
Augmenting String-to-Tree Translation Models with Fuzzy Use of Source-side Syntax, Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing
@InProceedings{zhang-zhai-zong:2011:EMNLP,
author = {Zhang, Jiajun and Zhai, Feifei and Zong, Chengqing},
title = {Augmenting String-to-Tree Translation Models with Fuzzy Use of Source-side Syntax},
booktitle = {Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing},
month = {July},
address = {Edinburgh, Scotland, UK.},
publisher = {Association for Computational Linguistics},
pages = {204--215},
url = {
http://www.aclweb.org/anthology/D11-1019},
year = 2011
}
Zhang et al. (2011)
Clark, Jonathan H. and Frederking, Robert and Levin, Lori (2008):
Inductive Detection of Language Features via Clustering Minimal Pairs: Toward Feature-Rich Grammars in Machine Translation, Proceedings of the ACL-08: HLT Second Workshop on Syntax and Structure in Statistical Translation (SSST-2)
@InProceedings{clark-frederking-levin:2008:SSST,
author = {Clark, Jonathan H. and Frederking, Robert and Levin, Lori},
title = {Inductive Detection of Language Features via Clustering Minimal Pairs: Toward Feature-Rich Grammars in Machine Translation},
booktitle = {Proceedings of the ACL-08:~HLT Second Workshop on Syntax and Structure in Statistical Translation (SSST-2)},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {78--86},
url = {
http://www.aclweb.org/anthology/W/W08/W08-0410},
year = 2008
}
Clark et al. (2008)
- Och et al. (2003)