During Fall 2024, we meet on Mondays at 11:00am in Hackerman 306.
Looking for a paper to present? see here
Zoom link: https://wse.zoom.us/j/7337865238
Date | Paper Presentation (~30min) | Topic | Research Presentation (~20min) |
December 2 | |||
November 25 | Thanksgiving Break | ||
November 18 | Weina Dai | TBA | Weina Dai |
November 11 | Dedicated Research Discussion Day | ||
November 4 | Bismarck Odoom | SEAMLESSEXPRESSIVELM: Speech Language Model for Expressive Speech-to-Speech Translation with Chain-of-Thought (Gong et al., 2024) | Bismarck Odoom |
October 28 | Steven Tan | Past, Present, and Future of Multi-modal Multi-stream Modeling | Steven Tan |
October 21 | TaiMing Lu | Searching for Needles in a Haystack: On the Role of Incidental Bilingualism in PaLM's Translation Capability (Briakou et al., 2023) | |
October 14 | Neha Verma | SliceGPT: Compress Large Language Models by Deleting Rows and Columns (Ashkboos et al., 2024) | |
October 7 | Dedicated Research Discussion Day | ||
September 30 | Henry Li Xinyuan | Multilingual Multi-accented Multi-speaker TTS with RADTTS (Badlani et al., 2023) | Rachel Wicks |
September 23 | Henry Li Xinyuan | Direct speech-to-speech translation with discrete units (Lee et al., 2022) | Henry Li Xinyuan |
September 16 | Niyati Bafna | Understanding Data Augmentation in Neural Machine Translation: Two Perspectives towards Generalization (Li et al., 2019) | Neha Verma |
September 9 | Rachel Wicks | Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model (Ustun et al., 2024) | Nathaniel Robinson |
September 2 | Labor Day | ||
August 26 | Introductions |
Date | Presenter | Topic |
December 4 | Rachel Wicks, then Haoyue Guan | WMT Practice Talk, then Machine Translation between Spoken Languages and Signed Languages Represented in SignWriting (EACL 2023) [https://aclanthology.org/2023.findings-eacl.127] |
November 27 | Ashi Garg | Incorporating Probing Signals Into Multimodal Machine Translation via Visual Question-Answering Pairs (Arxiv 2023) [https://arxiv.org/pdf/2310.17133.pdf] |
November 20 | Fall Recess (no meeting) | |
November 13 | Liz Salesky | Local Byte Fusion for Neural Machine Translation (ACL 2023) https://aclanthology.org/2023.acl-long.397/ |
November 6 | Nate Robinson | Lego-MT https://arxiv.org/pdf/2212.10551.pdf |
October 30 | Amir Hussein | SeamlessM4T https://ai.meta.com/research/publications/seamlessm4t-massively-multilingual-multimodal-machine-translation |
October 23 | Niyati Bafna | When Does Translation Require Context? A Data-driven, Multilingual Exploration https://aclanthology.org/N18-1032.pdf |
October 16 | Zike Hu | Lin et al. (ACL 2021)Learning Language Specific Sub-network for Multilingual Machine Translation |
October 9 | Steven Tan | Rubenstein et al. (Arxiv 2023) AudioPaLM: A Large Language Model That Can Speak and Listen Borsos et al. (Arxiv 2023) AudioLM: a Language Modeling Approach to Audio Generation |
October 2 | Bismarck Odoom | Dong et al. (Arxiv 2023) PolyVoice: Language Models for Speech to Speech Translation |
September 25 | Chutong Meng | Zhang et al. (Findings of ACL 2023) DUB: Discrete Unit Back-translation for Speech Translation |
September 18 | Tianjian Li | Ji et al. (ICLR 2023) Tailoring Language Generation Models under Total Variation Distance and slides |
September 11 | Henry Li Xinyuan | Bahar et al. (Arxiv 2020) Tight Integrated End-to-end Training for Cascaded Speech Translation, Tran et al. (EMNLP 2022) Does Joint Training Really Help Cascaded Speech Translation?, and Dalmia et al. (NAACL 2021) Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks |
September 4 | Labor Day (no meeting) | |
August 28 | Fall sign up and introductions |
Date | Presenter | Topic |
April 25 | Marcin | Chat with Marcin Junczys-Dowmunt (Microsoft) - also: attend Shuoyang Ding's PhD Thesis defense at noon in Ames 234 |
April 18 | — | No meeting today - attend Shuo Sun's PhD Thesis defense instead |
April 11 | Jeremy Gwinnup | Voita et al (ACL 2021): Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation |
April 4 | Rachel Wicks | Fan et al. (JMLR 2021): Beyond English-Centric Multilingual Translation ; Might discuss Tran et al. (WMT 2021) Facebook AI’s WMT21 News Translation Task Submission |
March 28 | Stella Li | Yang et al. (EMNLP 2020): CSP:Code-Switching Pre-training for Neural Machine Translation |
March 21 | — | (spring break) |
March 14 | Liz Salesky | Dou & Neubig (EACL 2021): Word Alignment by Fine-tuning Embeddings on Parallel Corpora |
March 7 | Xutai Ma | Practice Talk |
February 28 | Neha Verma | Voita, Sennrich, and Titov (EMNLP 2021) Language Modeling, Lexical Translation, Reordering: The Training Process of NMT through the lens of Classical SMT |
February 21 | Steven Tan | Zhang et al. (ACL 2021): Crafting Adversarial Examples for Neural Machine Translation |
February 14 | Suzanna Sia | Eikema and Aziz (COLING 2020): Is MAP Decoding All You Need? The Inadequacy of the Mode in Neural Machine Translation, Eikema and Aziz (2021): Sampling-Based Minimum Bayes Risk Decoding for Neural Machine Translation |
February 7 | Shuoyang Ding | Nguyen et al. (ICLR 2021): Dataset Meta-Learning from Kernel Ridge-Regression |
January 31 | Xuan Zhang | Wang et al. (ACL 2021): Selective Knowledge Distillation for Neural Machine Translation |
January 24 | Kelly Marchisio | Current Work - GOAT for BLI |
January 17 | Rachel Wicks | Feng et al. (ACL 2021): Guiding Teacher Forcing with Seer Forcing for Neural Machine Translation |
January 10 | — | (winter break) |
January 3 | Boyuan Zheng | Aditya et al. (ICLR 2021): Long-tail learning via logit adjustment |
(April 30 is the official last day of class)
Day | Presenter | Topic |
May 24 | Liz Salesky (my hero) | Practice Talk |
May 17 | (EMNLP deadline) | |
May 10 | Matt Post | Clark et al. (arXiv 2021) CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation |
May 3 | Shuoyang Ding | Some current work |
April 26 | Kevin Duh | Prato et al. (EMNLP Findings 2020) Fully Quantized Transformer for Machine Translation |
April 19 | Kelly Marchisio | Recent work: Embedding-Enhanced Giza++ |
April 12 | Matt Post | Some current work |
April 5 | Jake Bremerman | Yu et al. (TACL 2020): Better Document-Level Machine Translation with Bayes' Rule |
March 29 | ||
March 22 | (spring break) | |
March 15 | Jeremy Gwinnup | Ive et al. (EACL 2021): Exploring Supervised and Unsupervised Rewards in Machine Translation |
March 8 | Philipp Koehn | Meng et al. (WMT 2020): WeChat Neural Machine Translation Systems for WMT20 |
March 1 | Xutai Ma | Practice Talk |
February 22 | Philipp Koehn | Some highlights from WMT 2020 News Translation Shared Task sumissions |
February 15 | Rachel Wicks | Pei Zhang, Boxing Chen, Niyu Ge, Kai Fan (EMNLP 2020): Long-Short Term Masking Transformer: A Simple but Effective Baseline for Document-level Neural Machine Translation |
February 8 | Amrit Nidhi | Eva Vanmassenhove, Dimitar Shterionov, Matthew Gwilliam : Machine Translationese: Effects of Algorithmic Bias on Linguistic Complexity in Machine Translation |
February 1 | Shuoyang Ding | Jiatao Gu, Xiang Kong (arXiv 2020): Fully Non-autoregressive Neural Machine Translation: Tricks of the Trade |
January 25 | First official day of class; intros | |
January 18 | Jeremy Gwinnup | Ozan Caglayan, Julia Ive, Veneta Haralampieva, Pranava Madhyastha, Loïc Barrault, Lucia Specia (EMNLP 2020): Simultaneous Machine Translation with Visual Context |
January 11 | Huda Khayrallah | practice talk |
Day | Presenter | Topic |
December 21 | Ankur Kejriwal | Markus Freitag and Orhan Firat(WMT'20):Complete Multilingual Neural Machine Translation |
December 14 | Ishita Tripathi | Marzieh Fadaee, Christof Monz (NGT @ ACL 2020): The Unreasonable Volatility of Neural Machine Translation Models |
December 7 | Milind Agarwal | Xinyi Wang, Yulia Tsvetkov, Graham Neubig (ACL 2020): Balancing Training for Multilingual Neural Machine Translation |
November 30 | Kelly Marchisio | Tasnim Mohiuddin, M Saiful Bari, Shafiq Joty (EMNLP 2020): LNMap: Departures from Isomorphic Assumption in Bilingual Lexicon Induction Through Non-Linear Mapping in Latent Space |
November 23 | Thanksgiving Break -- no class (also, NAACL deadline) | |
November 16 | NAACL Paper Clinic | TBD |
November 9 | Shuoyang Ding | Recent Work |
November 2 | Jake Bremerman | Marina Fomicheva, Lucia Specia, Francisco Guzmán (ACL 2020): Multi-Hypothesis Machine Translation Evaluation |
October 26 | practice talks | |
October 19 | Amrit Nidhi | Jitao XU, Josep Crego, Jean Senellart (ACL 2020): Boosting Neural Machine Translation with Similar Translations |
October 12 | Practice talks | |
October 5 | Rachel Wicks | Wei Zou, Shujian Huang, Jun Xie, Xinyu Dai, Jiajun Chen (ACL 2020): A Reinforced Generation of Adversarial Examples for Neural Machine Translation |
September 28 | Philipp Koehn | Special sneak preview: Findings from WMT 2020 Shared Task on Parallel Sentence Pair Filtering |
September 21 | Ramchandran Muthukumar | Yong Cheng, Lu Jiang, Wolfgang Macherey, Jacob Eisenstein (ACL 2020): AdvAug: Robust Adversarial Augmentation for Neural Machine Translation |
September 14 | Xuan Zhang | Aji, Bogoychev, Heafield and Sennrich (ACL 2020): In Neural Machine Translation, What Does Transfer Learning Transfer? |
September 7 | Labor Day -- no class | |
August 31 | First day of class -- intros |
Day | Presenter | Topic |
August 24 | Matt Post | Tangled up in BLEU (Mathur et al., ACL 2020, Beyond Accuracy: Behavioral Testing of NLP Models with CheckList (Ribeiro et al., ACL 2020) |
August 17 | Philipp Koehn | Low Resource MT for DARPA LwLL |
August 10 | Paper Clinic | |
August 3 | Philipp Koehn | WNGT Shared Task on Efficient Decoding: Overview paper, Edinburgh's submission: |
July 27 | Liz Salesky | Kasai et al. (arXiv 2020): Deep Encoder, Shallow Decoder: Reevaluating the Speed-Quality Tradeoff in Machine Translation |
July 20 | Felicia Koerner | Zenkel et al. (ACL 2020): End-to-End Neural Word Alignment Outperforms GIZA++ |
July 6 & 13 | ACL Recap | |
June 29 | Shuoyang Ding | Edunov et al. (ACL 2020): On The Evaluation of Machine Translation Systems Trained With Back-Translation |
June 22 | Brian Thompson | Practice talk |
June 15 | ACL Practice talks | |
June 8 | Matt Post | Bapna & Firat (EMNLP 2019): Simple, Scalable Adaptation for Neural Machine Translation |
Day | Presenter | Topic |
Dec 16 | Ankur Kejriwal | A Universal Music Translation Network |
Dec 9 | - | ACL Deadline -- paper proofreading |
Dec 6 (friday) | - | ACL paper workshop |
Dec 2 | - | ACL paper workshop |
Nov 18 | Yash Kumar Lal | Kim et al. (2019): Pivot-based Transfer Learning for Neural Machine Translation between non-English Languages |
Nov 11 | Huda Khayrallah | current work |
Nov 4 | Liz Salesky | Provilkov et al. (2019): BPE-Dropout: Simple and Effective Subword Regularization |
Oct 28 | Xuan Zhang | practice talk |
Oct 21 | Pamela Shapiro | Wang et al. (2019): Multilingual Neural Machine Translation With Soft Decoupled Encoding |
Oct 14 | Brian | practice talk |
Oct 7 | Rachel Wicks | Guzmán et al. (2019): The FLORES Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English |
Sep 30 | Matt Post | Zhang et al. (2018): Bridging the Gap between Training and Inference for Neural Machine Translation |
Sep 23 | Shuoyang Ding | practice talk |
Sep 16 | Kelly Marchisio | Artetxe et al. (2019): An Effective Approach to Unsupervised Machine Translation |
Sep 9 | - | EMNLP / MT summit / WMT recap |
Day | Presenter | Topic |
May 20 | - | EMNLP paper workshop at 11am in Hackerman 306 |
May 13 | - | EMNLP paper workshop |
May 6 | Vivian Tsai | Godin et al. (2018): Explaining Character-Aware Neural Networks for Word-Level Prediction: Do They Discover Linguistic Rules? |
April 29 | Cancelled (faculty meeting) | |
April 22 | S. Mielke | Cotterell et al. (2018): Are All Languages Equally Hard to Language-Model? / Current research |
April 15 | - | |
April 8 | Yash Kumar Lal | Edunov et al (2018): Understanding Backtranslation at Scale |
April 1 | Kelly Marchisio | Junczys-Dowmunt (2018): How I Learned to Stop Worrying and Love the Data (s Submission to the WMT2018 News Translation Task) |
March 25 | Huda Khayrallah | Practice talk |
March 18 | Arya McCarthy | Chen et al. (2018) The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation |
March 11 | Rebecca Knowles | Fadaee & Monz (2018): Back-Translation Sampling by Targeting Difficult Words in Neural Machine Translation |
March 4 | Note: this is the ACL deadline | |
February 25 | - | ACL paper workshop (review form) |
February 18 | Xuan Zhang | Shah et al. (NeurIPS 2018): Generative Neural Machine Translation |
February 11 | Shuoyang Ding | Zenkel et al. (2018) Adding Interpretable Attention to Neural Translation Models Improves Word Alignment |
February 4 | Gaurav Kumar | Current Research |
Day | Presenter | Topic |
December 17 | Pamela Shapiro | Deng et al. (NeurIPS 2018) Latent Alignment and Variational Attention |
December 10 | NAACL proofreading | |
December 3 | NAACL paper workshop (review form) | |
November 26 | Adi Renduchintala | Cherry et al. (EMNLP 2018) Revisiting Character-Based Neural Machine Translation with Capacity and Compression |
November 19 | EMNLP recap | |
November 12 | Xuan Zhang | Lample et al. (ICLR 2018): Unsupervised Machine Translation Using Monolingual Corpora Only |
October 22 | Huda Khayrallah | |
October 15 | Brian Thompson | WMT practice talk |
October 8 | Yash Kumar Lal | Platanios et al (EMNLP2018): Contextual Parameter Generation for Universal Neural Machine Translation |
October 1 | Kelly Marchisio | Neubig & Hu (EMNLP 2018): Rapid Adaptation of Neural Machine Translation to New Languages |
September 24 | Rebecca Knowles | Current Research |
September 17 | Brian Thompson | Kirkpatrick et al. (2017): Overcoming catastrophic forgetting in neural networks |
September 10 | Introductions |
Day | Presenter | Topic |
May 10 | Arya McCarthy | Passban et al. (NAACL 2018): Improving Character-based Decoding Using -Side Morphological Information for Neural Machine Translation |
May 3 | Pamela Shapiro | Review of Attention Mechanisms |
Apr 26 | Gaurav Kumar | Qi et al. (NAACL 2018): When and Why are Pre-trained Word Embeddings Useful for Neural Machine Translation? |
Apr 19 | Xutai Ma | Gu, et. al. (AAAI 2018) Search Engine Guided Neural Machine Translation, Zhang, et. al. (NAACL2018) Guiding Neural Machine Translation with Retrieved Translation Pieces |
Apr 12 | Adi Renduchintala | Yang, et. al. (NAACL2018): https://arxiv.org/pdf/1703.04887.pdf |
Apr 5 | Rebecca Knowles | Current research |
Mar 29 | Huda/Brian/Kevin | Current research |
Mar 22 | no meeting | |
Mar 15 | Becky Marvin & Steven Shearing | AMTA practice talks |
Mar 8 | Kevin Duh | Huang, et. al. (ICLR 2018): Towards Neural Phrase-based Machine Translation |
Mar 1 | Arya McCarthy | Wang et al. (2018): Translating Pro-Drop Languages with Reconstruction Models |
Feb 22 | Pamela Shapiro | Belinkov and Bisk (2018): Synthetic and Natural Noise Both Break Neural Machine Translation |
Feb 15 | Shuoyang Ding | Gu et al. (2018): Non-Autoregressive Neural Machine Translation |
Feb 8 | Juri Ganitkevitch | Juri Ganitkevitch PhD defense (9am, Malone 107): Large-Scale Paraphrasing for Text-to-Text Generation |
Feb 1 | Gaurav Kumar | Artetxe et al. (2017): Unsupervised Neural Machine Translation |
Day | Presenter | Topic |
Dec 13 | Xutai Ma/Shuoyang Ding | Current research |
Dec 6 | Adi Renduchintala | He et al. (NIPS 2016): Dual Learning for Machine Translation |
Nov 29 | Philipp Koehn | Ghader and Monz (IJCNLP 2017): What does Attention in Neural Machine Translation Pay Attention to? |
Nov 22 | No meeting: Thanksgiving break | |
Nov 15 | Huda Khayrallah | Practice Talk |
Nov 8 | Pamela Shapiro | Artxetxe et al. (ACL 2017): Learning bilingual word embeddings with (almost) no bilingual data |
Nov 1 | Becky Marvin | Nguyen and Chiang. (2017). Improving Lexical Choice in Neural Machine Translation |
Oct 25 | Rebecca Knowles | Carpuat et. al. (2017). Detecting Cross-Lingual Semantic Divergence for Neural Machine Translation |
Oct 18 |