Translation Difficulty

When deciding on the use of machine translation, we may want to know beforehand the difficulty and hence expected quality for a given translation task.

Translation Difficulty is the main subject of 4 publications. 3 are discussed here.

Topics in Evaluation

Publications

The difficulty of translating a text may depend on many factors, such as source, genre, or dialect, some of which may be determined automatically

Katrin Kirchhoff and Owen Rambow and Nizar Habash and Mona Diab (2007): Semi-Automatic Error Analysis for Large-Scale Statistical Machine Translation Systems, Proceedings of the MT Summit XI

(Kirchhoff et al., 2007).

Birch, Alexandra and Osborne, Miles and Koehn, Philipp (2008): Predicting Success in Machine Translation, Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing

Birch et al. (2008) study the quality of machine translation systems for different language pairs, given the same domain of European Parliament proceedings. They find that the amount of reordering, the richness of target side morphology and language similarity are the main determining factors that make translation for a given language pair difficult.

Philipp Koehn and Alexandra Birch and Ralf Steinberger (2009): 462 Machine Translation Systems for Europe, Proceedings of the Twelfth Machine Translation Summit (MT Summit XII)

mentioned in Pivot Languages and Translation Difficulty

Koehn et al. (2009) extend this work to more languages and a different domain, and by using a more refined method to measure the degree of reordering.

Benchmarks

Discussion

New Publications

van der Wees, Marlies and Bisazza, Arianna and Monz, Christof (2015): Five Shades of Noise: Analyzing Machine Translation Errors in User-Generated Text, Proceedings of the Workshop on Noisy User-generated Text
add
@InProceedings{vanderwees-bisazza-monz:2015:WNUT,
author = {van der Wees, Marlies and Bisazza, Arianna and Monz, Christof},
title = {Five Shades of Noise: Analyzing Machine Translation Errors in User-Generated Text},
booktitle = {Proceedings of the Workshop on Noisy User-generated Text},
month = {July},
address = {Beijing, China},
publisher = {Association for Computational Linguistics},
pages = {28--37},
url = {http://www.aclweb.org/anthology/W15-4304},
year = 2015
}
Wees et al. (2015)

MT Research Survey Wiki

A Comprehensive Survey of Neural and Statistical Machine Translation Research Publications

Search Descriptions

Translation Difficulty

Publications

Benchmarks

Discussion

Related Topics

New Publications