Publications

Publications in reversed chronological order. You can find an up-to-date list of publications on my Google Scholar profile.

2025

  1. Joint speech and text machine translation for up to 100 languages
    Seamless Communication, Loı̈c Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, and 64 more authors
    Nature, 2025

2024

  1. Large Concept Models: Language Modeling in a Sentence Representation Space
    LCM Team, Loı̈c Barrault, Paul-Ambroise Duquenne, Maha Elbayad, Artyom Kozhevnikov, and 7 more authors
    arXiv e-prints, Dec 2024
  2. TMLR
    zipit.png
    Merging Text Transformer Models from Different Initializations
    Neha Verma, and Maha Elbayad
    Transactions on Machine Learning Research, Nov 2024
  3. SpiRit-LM: Interleaved Spoken and Written Language Model
    Tu Anh Nguyen, Benjamin Muller, Bokai Yu, Marta R. Costa-jussa, Maha Elbayad, and 9 more authors
    Oct 2024
  4. Scaling neural machine translation to 200 languages
    Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, Kenneth Heafield, and 34 more authors
    Nature, Jun 2024
  5. EAMT
    toxicity.png
    Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation
    Marta Costa-jussà, David Dale, Maha Elbayad, and Bokai Yu
    In Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), Jun 2024

2023

  1. Seamless: Multilingual Expressive and Streaming Speech Translation
    arXiv preprint arXiv:2312.05187, Nov 2023
  2. SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
    arXiv preprint arXiv:2308.11596, Aug 2023
  3. Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity
    Haoran Xu, Maha Elbayad, Kenton Murray, Jean Maillard, and Vedanuj Goswami
    In Findings of the Association for Computational Linguistics: EMNLP 2023, Dec 2023
  4. ACL
    causes_cures.png
    Causes and Cures for Interference in Multilingual Translation
    Uri Shaham, Maha Elbayad, Vedanuj Goswami, Omer Levy, and Shruti Bhosale
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2023
  5. ACL
    moe_overfit.png
    Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual Machine Translation
    Maha Elbayad, Anna Sun, and Shruti Bhosale
    In Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
  6. Efficiently Upgrading Multilingual Machine Translation Models to Support More Languages
    Simeng Sun, Maha Elbayad, Anna Sun, and James Cross
    In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, May 2023

2022

  1. No Language Left Behind: Scaling Human-Centered Machine Translation
    NLLB Team, Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, and 34 more authors
    arXiv preprint arXiv:2207.04672, Jul 2022

2020

  1. Thesis
    thesis.png
    Rethinking the Design of Sequence-to-Sequence Models for Efficient Machine Translation
    Maha Elbayad
    Jun 2020
  2. Efficient Wait-k Models for Simultaneous Machine Translation
    Maha Elbayad, Laurent Besacier, and Jakob Verbeek
    In Interspeech 2020, Oct 2020
  3. Depth-Adaptive Transformer
    Maha Elbayad, Jiatao Gu, Edouard Grave, and Michael Auli
    In International Conference on Learning Representations, Apr 2020
  4. ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020
    Maha Elbayad, Ha Nguyen, Fethi Bougares, Natalia Tomashenko, Antoine Caubrière, and 3 more authors
    In Proceedings of the 17th International Conference on Spoken Language Translation, Jul 2020
  5. Online Versus Offline NMT Quality: An In-depth Analysis on English-German and German-English
    Maha Elbayad, Michael Ustaszewski, Emmanuelle Esperança-Rodier, Francis Brunet-Manquat, Jakob Verbeek, and 1 more author
    In Proceedings of the 28th International Conference on Computational Linguistics, Dec 2020

2018

  1. Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction
    Maha Elbayad, Laurent Besacier, and Jakob Verbeek
    In Proceedings of the 22nd Conference on Computational Natural Language Learning, Oct 2018
  2. ACL
    tokseq.png
    Token-level and sequence-level loss smoothing for RNN language models
    Maha Elbayad, Laurent Besacier, and Jakob Verbeek
    In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2018