Binarizing Synchronous Grammars
Complex rules may be too specific to be generally useful and also increase the complexity of parsing. Hence, we may want tp binarize either the parse trees from which rules are extracted or the rules themselves.
Binarizing SCFG is the main subject of 10 publications. 5 are discussed here.
Publications
Binarization may be driven by the source side grammar
Zhang, Hao and Huang, Liang and Gildea, Daniel and Knight, Kevin (2006):
Synchronous Binarization for Machine Translation, Proceedings of the Human Language Technology Conference of the NAACL, Main Conference
@InProceedings{zhang-EtAl:2006:HLT-NAACL06-Main,
author = {Zhang, Hao and Huang, Liang and Gildea, Daniel and Knight, Kevin},
title = {Synchronous Binarization for Machine Translation},
booktitle = {Proceedings of the Human Language Technology Conference of the NAACL, Main Conference},
month = {June},
address = {New York City, USA},
publisher = {Association for Computational Linguistics},
pages = {256--263},
url = {
http://www.aclweb.org/anthology/N/N06/N06-1033},
year = 2006
}
(Zhang et al., 2006;
Wang, Wei and Knight, Kevin and Marcu, Daniel (2007):
Binarizing Syntax Trees to Improve Syntax-Based Machine Translation Accuracy, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)
@InProceedings{wang-knight-marcu:2007:EMNLP-CoNLL2007,
author = {Wang, Wei and Knight, Kevin and Marcu, Daniel},
title = {Binarizing Syntax Trees to Improve Syntax-Based Machine Translation Accuracy},
booktitle = {Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)},
pages = {746--754},
url = {
http://www.aclweb.org/anthology/D/D07/D07-1078},
year = 2007
}
Wang et al., 2007) or the target side grammar
Huang, Liang (2007):
Binarization, Synchronous Binarization, and Target-side Binarization, Proceedings of SSST, NAACL-HLT 2007 / AMTA Workshop on Syntax and Structure in Statistical Translation
@InProceedings{huang:2007:SSST,
author = {Huang, Liang},
title = {Binarization, Synchronous Binarization, and Target-side Binarization},
booktitle = {Proceedings of SSST, NAACL-HLT 2007 / AMTA Workshop on Syntax and Structure in Statistical Translation},
month = {April},
address = {Rochester, New York},
publisher = {Association for Computational Linguistics},
pages = {33--40},
url = {
http://www.aclweb.org/anthology/W/W07/W07-0405},
year = 2007
}
(Huang, 2007). A k-arization method with linear time complexity is proposed by
Zhang, Hao and Gildea, Daniel (2007):
Factorization of Synchronous Context-Free Grammars in Linear Time, Proceedings of SSST, NAACL-HLT 2007 / AMTA Workshop on Syntax and Structure in Statistical Translation
@InProceedings{zhang-gildea:2007:SSST,
author = {Zhang, Hao and Gildea, Daniel},
title = {Factorization of Synchronous Context-Free Grammars in Linear Time},
booktitle = {Proceedings of SSST, NAACL-HLT 2007 / AMTA Workshop on Syntax and Structure in Statistical Translation},
month = {April},
address = {Rochester, New York},
publisher = {Association for Computational Linguistics},
pages = {25--32},
url = {
http://www.aclweb.org/anthology/W/W07/W07-0404},
year = 2007
}
Zhang and Gildea (2007).
Nesson, Rebecca and Satta, Giorgio and Shieber, Stuart M. (2008):
Optimal k-arization of Synchronous Tree-Adjoining Grammar, Proceedings of ACL-08: HLT
@InProceedings{nesson-satta-shieber:2008:ACLMain,
author = {Nesson, Rebecca and Satta, Giorgio and Shieber, Stuart M.},
title = {Optimal $k$-arization of Synchronous Tree-Adjoining Grammar},
booktitle = {Proceedings of ACL-08: HLT},
month = {June},
address = {Columbus, Ohio},
publisher = {Association for Computational Linguistics},
pages = {604--612},
url = {
http://www.aclweb.org/anthology/P/P08/P08-1069},
year = 2008
}
Nesson et al. (2008) present a k-arization method for STAG grammars.
Benchmarks
Discussion
Related Topics
New Publications
Büchse, Matthias and Koller, Alexander and Vogler, Heiko (2013):
General binarization for parsing and translation, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
@InProceedings{buchse-koller-vogler:2013:ACL2013,
author = {B\"{u}chse, Matthias and Koller, Alexander and Vogler, Heiko},
title = {General binarization for parsing and translation},
booktitle = {Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {August},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {145--154},
url = {
http://www.aclweb.org/anthology/P13-1015},
year = 2013
}
Büchse et al. (2013)
Mark-Jan Nederhof and Giorgio Satta (2011):
Splittability of Bilexical Context-Free Grammars is Undecidable, Computational Linguistics
@Article{CL:2011-4009,
author = {Mark-Jan Nederhof and Giorgio Satta},
title = {Splittability of Bilexical Context-Free Grammars is Undecidable},
journal = {Computational Linguistics},
volume = {37},
number = {4},
url = {
http://aclweb.org/anthology-new/J/J11/J11-4009.pdf},
year = 2011
}
Nederhof and Satta (2011)
Xiao, Tong and Li, Mu and Zhang, Dongdong and Zhu, Jingbo and Zhou, Ming (2009):
Better Synchronous Binarization for Machine Translation, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing
@InProceedings{xiao-EtAl:2009:EMNLP,
author = {Xiao, Tong and Li, Mu and Zhang, Dongdong and Zhu, Jingbo and Zhou, Ming},
title = {Better Synchronous Binarization for Machine Translation},
booktitle = {Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing},
month = {August},
address = {Singapore},
publisher = {Association for Computational Linguistics},
pages = {362--370},
url = {
http://www.aclweb.org/anthology/D/D09/D09-1038},
year = 2009
}
Xiao et al. (2009)
DeNero, John and Pauls, Adam and Klein, Dan (2009):
Asynchronous Binarization for Synchronous Grammars, Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
@InProceedings{denero-pauls-klein:2009:Short,
author = {DeNero, John and Pauls, Adam and Klein, Dan},
title = {Asynchronous Binarization for Synchronous Grammars},
booktitle = {Proceedings of the ACL-IJCNLP 2009 Conference Short Papers},
month = {August},
address = {Suntec, Singapore},
publisher = {Association for Computational Linguistics},
pages = {141--144},
url = {
http://www.aclweb.org/anthology/P/P09/P09-2036},
year = 2009
}
DeNero et al. (2009)
Fang, Licheng and Chung, Tagyoung and Gildea, Daniel (2011):
Terminal-Aware Synchronous Binarization, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Techologies
@InProceedings{fang-chung-gildea:2011:ACL-HLT2011,
author = {Fang, Licheng and Chung, Tagyoung and Gildea, Daniel},
title = {Terminal-Aware Synchronous Binarization},
booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Techologies},
month = {June},
address = {Portland, Oregon, USA},
publisher = {Association for Computational Linguistics},
pages = {401--406},
url = {
http://www.aclweb.org/anthology/P11-2070},
year = 2011
}
Fang et al. (2011)