Sara Papi
Researcher
- E-mail: spapi@fbk.eu
- Website: https://sarapapi.github.io/
- Google Scholar: My citations
- Semantic Scholar: Profile
- Twitter: Profile
- LinkedIn: Profile
Short bio
I am an AI Researcher at FBK (Fondazione Bruno Kessler), working on speech processing and multimodal LLMs within the MEETWEEN and DVPS Horizon European projects. I received my PhD cum laude in Information Engineering and Computer Science from the University of Trento in 2024, with a focus on simultaneous speech translation and subtitling. My research interests span multimodal and crosslingual instruction-following models, speech foundation models, and LLMs. My work has been recognized with awards, including the Best PhD Graduate 2024 Award in Information and Communication Technology from the University of Trento, an Outstanding Paper and SAC Award at ACL 2024, and a Social Impact Paper Award at EMNLP 2024. I actively contribute to the community as an organizer of the IWSLT Evaluation Campaign and as an Area Chair or reviewer for major conferences in speech and NLP, such as *ACL and Interspeech.
Publications
-
Papi, Sara; Züfle, Maike; Gaido, Marco; Savoldi, Beatrice; Liu, Danni; Douros, Ioannis; Bentivogli, Luisa; Niehues, Jan,in «»,The Fourteenth International Conference on Learning Representations,,vol. ,n. ,2026, pp. -
-
Lam, Tsz Kin; Gaido, Marco; Papi, Sara; Bentivogli, Luisa; Haddow, Barry,in «»,Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers),,vol. ,n. ,2025, pp. 2994-3006
-
Verdini, Francesco; Melucci, Pierfrancesco; Perna, Stefano; Cariaggi, Francesco; Gaido, Marco; Papi, Sara; Mazurek, Szymon; Kasztelnik, Marek; Bentivogli, Luisa; Bratières, Sebastien; Merialdo, Paolo; Scardapane, Simone,in «»,Proc. Interspeech 2025,,vol. ,n. ,2025, pp. 1813-1817
-
Papi, Sara; Gaido, Marco; Bentivogli, Luisa; Brutti, Alessio; Cettolo, Mauro; Gretter, Roberto; Matassoni, Marco; Nabih, Mohamed; Negri, Matteo,in «»,Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025),,vol. ,n. ,2025, pp. -
-
Gaido, Marco; Papi, Sara; Bentivogli, Luisa; Brutti, Alessio; Cettolo, Mauro; Gretter, Roberto; Matassoni, Marco; Nabih, Mohamed; Negri, Matteo,in «»,Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025),,vol. ,n. ,2025, pp. 47-55
-
Abdulmumin, Idris; Agostinelli, Victor; Alumäe, Tanel; Anastasopoulos, Antonios; Bentivogli, Luisa; Bojar, Ondřej; Borg, Claudia; Bougares, Fethi; Cattoni, Roldano; Cettolo, Mauro; Chen, Lizhong; Chen, William; Dabre, Raj; Estève, Yannick; Federico, Marcello; Fishel, Mark; Gaido, Marco; Javorský, Dávid; Kasztelnik, Marek; Kponou, Fortuné; Krubiński, Mateusz; Kin Lam, Tsz; Liu, Danni; Matusov, Evgeny; Kumar Maurya, Chandresh; P. Mccrae, John; Mdhaffar, Salima; Moslem, Yasmin; Murray, Kenton; Nakamura, Satoshi; Negri, Matteo; Niehues, Jan; Kr. Ojha, Atul; Ortega, John E.; Papi, Sara; Pecina, Pavel; Polák, Peter; Połeć, Piotr; Sankar, Ashwin; Savoldi, Beatrice; Sethiya, Nivedita; Sikasote, Claytone; Sperber, Matthias; Stüker, Sebastian; Sudoh, Katsuhito; Thompson, Brian; Turchi, Marco; Waibel, Alex; Wilken, Patrick; Zevallos, Rodolfo; Zouhar, Vilém; Züfle, Maike,in «»,Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025),,vol. ,n. ,2025, pp. 412-481
-
Züfle, Maike; Papi, Sara; Savoldi, Beatrice; Gaido, Marco; Bentivogli, Luisa; Niehues, Jan,in «»,Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025),,vol. ,n. ,2025, pp. 19-32
-
Rao Koluguri, Nithin; Sekoyan, Monica; Zelenfroynd, George; Meister, Sasha; Ding, Shuoyang; Kostandian, Sofia; Huang, He; Karpov, Nikolay; Balam, Jagadeesh; Lavrukhin, Vitaly; Peng, Yifan; Papi, Sara; Gaido, Marco; Brutti, Alessio; Ginsburg, Boris,in «»,Proceedings of Interspeech,,vol. ,n. ,2025, pp. 3923-3927
-
Papi, Sara; Polák, Peter; Macháček, Dominik; Bojar, Ondřej,in «TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS»,,,vol. 13,n. ,2025, pp. 281-313
-
Papi, Sara; Gaido, Marco; Negri, Matteo; Bentivogli, Luisa,in «»,Proceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024),,vol. ,n. ,2024, pp. 72-79
-
Cettolo, Mauro; Piergentili, Andrea; Papi, Sara; Gaido, Marco; Negri, Matteo; Bentivogli, Luisa,in «»,Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024),,vol. ,n. ,2024, pp. -
-
Papi, Sara; Gaido, Marco; Pilzer, Andrea; Negri, Matteo,in «»,Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers),,vol. ,n. ,2024, pp. 3657-3672
-
Ahmad, Ibrahim Said; Anastasopoulos, Antonios; Bojar, Ondřej; Borg, Claudia; Carpuat, Marine; Cattoni, Roldano; Cettolo, Mauro; Chen, William; Dong, Qianqian; Federico, Marcello; Haddow, Barry; Javorský, Dávid; Krubiński, Mateusz; Kim Lam, Tsz; Ma, Xutai; Mathur, Prashant; Matusov, Evgeny; Maurya, Chandresh; Mccrae, John; Murray, Kenton; Nakamura, Satoshi; Negri, Matteo; Niehues, Jan; Niu, Xing; Ojha, Atul Kr.; Ortega, John; Papi, Sara; Polák, Peter; Pospíšil, Adam; Pecina, Pavel; Salesky, Elizabeth; Sethiya, Nivedita; Sarkar, Balaram; Shi, Jiatong; Sikasote, Claytone; Sperber, Matthias; Stüker, Sebastian; Sudoh, Katsuhito; Thompson, Brian; Waibel, Alex; Watanabe, Shinji; Wilken, Patrick; Zemánek, Petr; Zevallos, Rodolfo,in «»,Proceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024),,vol. ,n. ,2024, pp. 1-11
-
Savoldi, Beatrice; Papi, Sara; Negri, Matteo; Guerberof-Arenas, Ana; Bentivogli, Luisa,in «»,Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing,,vol. ,n. ,2024, pp. 18048-18076
-
Gaido, Marco; Papi, Sara; Bentivogli, Luisa; Brutti, Alessio; Cettolo, Mauro; Gretter, Roberto; Matassoni, Marco; Nabih, Mohamed; Negri, Matteo,MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages,in «»,Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing,,vol. ,n. ,2024, pp. 13934-13947
-
Gaido, Marco; Papi, Sara; Negri, Matteo; Bentivogli, Luisa,in «»,Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers),,vol. ,n. ,2024, pp. 14760-14778
-
Papi, Sara; Gaido, Marco; Negri, Matteo; Bentivogli, Luisa,StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection,in «»,Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers),,vol. ,n. ,2024, pp. 3692-3707
-
Gaido, Marco; Papi, Sara; Negri, Matteo; Cettolo, Mauro; Bentivogli, Luisa,in «»,Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers),,vol. ,n. ,2024, pp. 3673-3691
-
Gaido, Marco; Papi, Sara; Cettolo, Mauro; Cattoni, Roldano; Piergentili, Andrea; Negri, Matteo; Bentivogli, Luisa,in «»,Proceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024),,vol. ,n. ,2024, pp. 86-96
-
Papi, Sara; Wang, Peidong; Chen, Junkun; Xue, Jian; Kanda, Naoyuki; Li, Jinyu; Gaur, Yashesh,in «»,Proceedings of ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),IEEE,vol. ,n. ,2024, pp. -
-
Gaido, Marco; Papi, Sara; Negri, Matteo; Bentivogli, Luisa,in «»,Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024),,vol. ,n. ,2024, pp. 8184-8191
-
Gaido, Marco; Papi, Sara; Negri, Matteo; Turchi, Marco,Joint Speech Translation and Named Entity Recognition,INTERSPEECH 2023,2023, pp. 47-51
-
Papi, Sara; Negri, Matteo; Turchi, Marco,Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers),2023, pp. 13340-13356
-
Papi, Sara; Gaido, Marco; Negri, Matteo,Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023),2023, pp. 159-168
-
Papi, Sara; Turchi, Marco; Negri, Matteo,INTERSPEECH 2023,2023, pp. 3974-3978
-
Fucci, Dennis; Gaido, Marco; Papi, Sara; Cettolo, Mauro; Negri, Matteo; Bentivogli, Luisa,in «»,Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing,,vol. ,n. ,2023, pp. 11505-11517
-
Papi, Sara; Gaido, Marco; Karakanta, Alina; Cettolo, Mauro; Negri, Matteo; Turchi, Marco,in «TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS»,vol. 11,2023, pp. 1355-1376
-
Papi, Sara; Wang, Peidon; Chen, Junku; Xue, Jian; Li, Jinyu; Gaur, Yashesh,Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments,Proceedings of 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU),2023
-
Papi, Sara; Gaido, Marco; Negri, Matteo; Turchi, Marco,Findings of the Association for Computational Linguistics: EMNLP 2022,2022, pp. 141-153
-
Papi, Sara; Karakanta, Alina; Negri, Matteo; Turchi, Marco,Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 2: Short Papers),Association for Computational Linguistics,2022, pp. 480-487
-
Papi, Sara; Gaido, Marco; Negri, Matteo; Turchi, Marco,Proceedings of the Third Workshop on Automatic Simultaneous Translation,Association for Computational Linguistics,2022, pp. 12-17
-
Gaido, Marco; Papi, Sara; Fucci, Dennis; Fiameni, Giuseppe; Negri, Matteo; Turchi, Marco,in «»,Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022),Association for Computational Linguistics,vol. ,n. ,2022, pp. 177-189
-
Papi, Sara; Gaido, Marco; Negri, Matteo; Turchi, Marco,Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021),2021, pp. 84-91
-
Karakanta, A.; Papi, S.; Negri, M.; Turchi, M.,Proceedings of MT Summit 2021,2021
-
Papi, S.; Negri, M.; Turchi, M.,Proceedings of the Eighth Italian Conference on Computational Linguistics,vol. 3033,2021
-
Papi, Sara; Gaido, Marco; Negri, Matteo; Turchi, Marco,Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing,2021, pp. 1698-1706
-
Papi, Sara; Trentin, Edmondo; Gretter, Roberto; Matassoni, Marco; Falavigna, Daniele,Proceedings of Interspeech 2020,2020, pp. 3845-3849