EMNLP 2026

ELEVENTH CONFERENCE ON
MACHINE TRANSLATION (WMT26)

November, 2026
Budapest, Hungary
 
HOME

 
TRANSLATION TASKS: GENERAL MT (NEWS) •︎ INDIC MT •︎ TERMINOLOGY
OTHER TASKS: OPEN DATA •︎ MULTILINGUAL INSTRUCTION

ANNOUNCEMENTS

Please register your team for the participation link: forms.gle/VfADz8BA3sASbNqu5 [REGISTER NOW!]

📢 March 15, 2026 - Website released, and the task is announced!

We are still updating the page. Please keep your eye on it!

OVERVIEW AND TASK DESCRIPTION

Building upon the resounding success of “Shared Task: Low-Resource Indic Language Translation” in WMT 2023, WMT24 and WMT25, which saw enthusiastic participation from around the world, we are excited to announce the “Shared Task: Low-Resource Indic Language Translation” in WMT 2026.

Recent advances in Machine Translation (MT) have significantly improved translation quality across many languages. TApproaches such as multilingual translation and transfer learning have expanded the reach of MT systems beyond well-resourced languages. Yet, extending coverage to diverse, low-resource languages remains a challenge due to the limited availability of parallel data for training robust systems. The WMT 2026 Indic Machine Translation Shared Task tackles this challenge by focusing on low-resource Indic languages from diverse language families. The focus will be on languages like Assamese (an Indo-Aryan language spoken mainly in the north-eastern Indian state of Assam), Mizo (a Sino-Tibetan language spoken primarily in the Mizoram state of India), Khasi (an Austroasiatic language spoken in Meghalaya, India), Manipuri (also known as Meiteilon, a Sino-Tibetan language and the official language of Manipur, India), Nyishi (a Sino-Tibetan language of Arunachal Pradesh, India), and kokborok language (Tibeto-Burman language spoken primarily by the Tripuri people).

The focus will be on North Eastern languages like Assamese (State: Assam), Bodo (State: Assam), Mizo (State: Mizoram), Khasi (State: Meghalaya), Manipuri (State: Manipur), Kokborok (State: Tripura) and Nyishi (State: Arunachal Pradesh).

This year’s task features two categories:DOWNLOAD

Category 1: (Moderate Training Data Available)

  • en-as: English ⇔ Assamese

  • en-lus: English ⇔ Mizo

  • en-kha: English ⇔ Khasi

  • en-mni: English ⇔ Manipuri

  • en-njz: English ⇔ Nyishi

  • en-mni-Mtei: English ⇔ Meitei

Category 2: (Very Limited Training Data)

  • en-bodo: English ⇔ Bodo

  • en-trp: English ⇔ Kokborok

  • en-mjw: English ⇔ Karbi

  • en-mrg: English ⇔ Mishing

  • en-nag: English ⇔ Nagamese

GOAL

The central objective is to develop MT systems that produce high-quality translations despite the constraints of data availability. Participants are encouraged to explore:

  • Monolingual Data Utilization: Leveraging monolingual data effectively for improved translation.

  • Multilingual Approaches: Investigating whether cross-lingual transfer benefits low-resource pairs.

  • Transfer Learning: Adapting models trained on richer language pairs to the target languages.

  • Innovative Techniques: Experimenting with novel methods specifically tailored for low-resource settings.

DEADLINES

March 18, 2026

Website released, and the task is announced!

Mach 20, 2026

Team Registration Open

May 5, 2026

Team Registration Close

May 7, 2026

Training Data Release only registered participants

June 30, 2026

Result Declaration to individual team

in-line with WMT26

System Paper Submission

November 5-9, 2026

Under EMNLP Conference

All deadlines are in AoE (Anywhere on Earth). Dates are specified with respect to EMNLP 2026.

DATA

Category 1: (Moderate Training Data Available)

  • en-as: English ⇔ Assamese

  • en-lus: English ⇔ Mizo

  • en-kha: English ⇔ Khasi

  • en-mni: English ⇔ Manipuri

  • en-njz: English ⇔ Nyishi

Category 2: (Very Limited Training Data)

  • en-bodo: English ⇔ Bodo

  • en-trp: English ⇔ Kokborok

TEST DATA

Great news! 🎉 We’ve RELEASED TEST datasets 📥 Accessing the Test Data DOWNLOAD.

📝 Test Set Output Submission Instructions

For each language pair (e.g., English → Assamese), you may submit up to:

1 Primary system

  • A constrained system (uses only official data), or

  • A system that uses additional monnolinngual resources, pretrained, etc., depending on your setup (should be publicly availavle).

Up to 2 Contrastive systems (Optional — may use external or additional data beyond the task set.)

🚨 Maximum: 3 submissions per language pair.

📄 Submission File Naming Convention

Each submission file must be named using this format:

<TEAM_NAME>_<SUBMISSION_TYPE>_<LANGUAGE_PAIR>.txt

Where:

  • TEAM_NAME — Your registered team name (e.g., CNLP-NITS-PP)

  • SUBMISSION_TYPE — One of: primary, constrained, or contrastive

  • LANGUAGE_PAIR — Use format like en_to_as for English → Assamese

🔍 Example

For a primary system submission from team CNLP-NITS-PP for English → Assamese:

CNLP-NITS-PP_primary_en_to_as.txt

Repeat this pattern for all your submissions.

🧾 System Description File (Mandatory!)

You must include a brief abstract system description in one of the following formats:

  • .pdf

  • .doc

  • .tex

  • .txt

This file is mandatory. Submissions without it will not be published in the final results.

📦 *Packaging Your Submission*

Place the following files in a single .zip archive:

  • All your system output files

  • The abstract system description file

Name the zip file using your team name:

<TEAM_NAME>.zip

📌 Example

For team CNLP-NITS-PP, name the file:

CNLP-NITS-PP.zip
📧 *Email Submission*

Send your .zip file to the following address:

Email: lrilt.wmt@gmail.com

Subject Line:

<TEAM_NAME>: Submission File for Shared Task: Low-Resource Indic Language Translation

Example Subject Line:

CNLP-NITS-PP: Submission File for Shared Task: Low-Resource Indic Language Translation

🙏 Thank You!

Thank you for participating in the shared task! We look forward to your submissions and system descriptions.

CITATIONS

If you are using this data, please cite:

PAPER Submission Process:

in-line with WMT26

EVALUATION

Systems will undergo both automatic evaluation (using BLEU, TER, COMET etc.) and human evaluation by native speakers for a comprehensive assessment of translation quality.

CONTACT

PAPER SUBMISSION

Your system paper submission should be prepared according to the WMT instructions and uploaded to START before TBA, 2026 (WMT MAIN PAGE).

ORGANIZERS

  • Santanu Pal, Wipro Innovation Network, Kolkata, India

  • Partha Pakray, National Institute of Technology, Silchar, India

  • Sandeep Kumar Dash, National Institute of Technology, Mizoram, India

  • Lenin Laitonjam, National Institute of Technology, Mizoram, India

  • Arnab Maji, North-Eastern Hill University, India

  • Saralin A Lyngdoh, North-Eastern Hill University, India

  • Riyanka Manna, Amrita Vishwa Vidyapeetham, Andhra Pradesh, India

  • Ajit Das, Bodoland University, India

  • Anupam Jamatia, National Institute of Technology, Agartala, India

  • Koj Sambyo, National Institute of Technology, Arunachal Pradesh, India

  • Sunita Warjri, Indian Institute of Information Technology Senapati, Manipur, India

  • Uzzal Sharma, Birangana Sati Sadhani Rajyik Vishwavidyalaya Golaghat, Assam, India

STUDENT COORDINATORS

  • Advaitha Vetagiri, National Institute of Technology, Silchar, India

  • Shyambabu Pandey, National Institute of Technology, Silchar, India

  • Kshetrimayum Boynao Singh, National Institute of Technology, Silchar, India

LANGUAGE RESOURCE CONTRIBUTORS

  • Raju Narzary, Bodoland University, Assam, India

  • Baneswar Baro, Bodoland University, Assam, India

  • Shwdwmshri Basumatary, Bodoland University, Assam, India

  • Jekolin Machahary, Bodoland University, Assam, India

  • Khusbu Basumatary, Bodoland University, Assam, India

  • Boney Moshahary, Bodoland University, Assam, India

  • Dwima Basumatary, Bodoland University, Assam, India

  • Jewel Basumatary, Bodoland University, Assam, India

  • Eusebius Lawai Lyngdoh, North-Eastern Hill University, Meghalaya, India

  • Ibadonbok Syiemlieh, North-Eastern Hill University, Meghalaya, India

  • Debabrata Khargharia, National Institute of Technology Silchar, Assam, India

  • Nibedita Kalita, National Institute of Technology Silchar, Assam, India

  • Many more…​will add soon