Preprocessing is done with Monoses (https://github.com/artetxem/monoses)
Description of files:
If you use these splits of the data, please cite:
Marchisio, Kelly and Duh, Kevin and Koehn, Philipp: When Does Unsupervised Machine Translation Work?, Proceedings of the Fifth Conference on Machine Translation (WMT), 2020.
@InProceedings{marchisio-duh-koehn:2020:WMT, author = {Marchisio, Kelly and Duh, Kevin and Koehn, Philipp}, title = {When Does Unsupervised Machine Translation Work?}, booktitle = {Proceedings of the Fifth Conference on Machine Translation}, month = {November}, year = {2020}, publisher = {Association for Computational Linguistics}, }