Shared Task: Biomedical Translation Task
Task description
This task aims to evaluate systems on the translation of documents from the biomedical domain.
The test data will consist of biomedical abstracts and it will address the following language pairs:
- English-French and French-English (en/fr, fr/en)
- English-German and German-English (en/de, de/en)
- English-Italian and Italian-English (en/it, it/en)
- English-Portuguese and Portuguese-English (en/pt, pt/en)
- English-Russian and Russian-English (en/ru, ru/en)
- English-Spanish and Spanish-English (en/es, es/en)
Data
ATTENTION!!
We ask the participants to not download the Medline database by themselves in order to retrieve training data.
Submissions that are derived from a model that was trained on the whole PubMed will be not considered in the evaluation.
Participants can rely on training (and development) data from various sources, for instance:
- The Biomedical Translation repository includes links to
parallel corpora of scientific publications (en/pt, en/es, en/fr, en/de, en/zh, en/it, en/ru), among others
- The UFAL Medical Corpus (formerly HimLCorpus) includes
medical text from various sources for many language pairs (en/es, en/de, en/fr, en/ro)
HimL test sets can be used as the development sets for some language pairs (en/es, en/de, en/fr, en/ro)
- The Khresmoi development data can be used for some language pairs (en/es, en/de, en/fr).
- The UNCorpus contains training data for some languages (en/es, en/fr, en/zh)
- The MeSpEn corpus contains many parallel documents of en/es
- The Scielo full text corpus for en, es and pt
- The Brazilian Thesis and Dissertations for en/pt
Participants are also free to use out-of-domain data.
Evaluation
Evaluation will be carried out both automatically and manually.
Automatic evaluation will make use of standard machine translation metrics.
Native speakers of each of the languages will manually check the quality of the translation for a small sample of the submissions.
If necessary, we also expect participants to support us in the manual evaluation (accordingly to the number of submissions).
We plan to release test sets for the following language pairs and sources:
- Scientific abstracts:
- French/English (both directions)
- German/English (both directions)
- Italian/English (both directions)
- Portuguese/English (both directions)
- Russian/English (both directions)
- Spanish/English (both directions)
Scientific abstracts:
For the test set of Medline abstracts, the format will be plain text files.
The format will be the following:
(coming soon)
Submission Requirements
(coming soon)
Important dates
Release of test data | June/July, 2024 |
Results submission deadline | June/July, 2024 |
Paper submission deadline | TBA around 12th August (follows EMNLP) |
Paper notification | TBA (follows EMNLP) |
Camera-ready version due | TBA around 23rd September (follows EMNLP) |
Conference EMNLP | 12-13 November, 2024 |
All deadlines are in AoE (Anywhere on Earth).
Organisers
Rachel Bawden (University of Edinburgh, UK)
Giorgio Maria Di Nunzio (University of Padua, Italy)
Cristian Grozea (Fraunhofer Institute, Germany)
Antonio Jimeno Yepes (University of Melbourne, Australia)
Aurélie Névéol (Université Paris Saclay, CNRS, LISN, France)
Mariana Neves (German Federal Institute for Risk Assessment, Germany)
Roland Roller (DFKI, Germany)
Philippe Thomas (DFKI, Germany)
Federica Vezzani (University of Padua, Italy)
Maika Vicente Navarro, Maika Spanish Translator, Melbourne, Australia
Dina Wiemann (Novartis, Switzerland)
Lana Yeganova (NCBI/NLM/NIH, USA)
Please contact us in the mail wmtbiomedical@gmail.com.
Please join our discussion forum.