We are happy to announce that Sara Papi is attending ICLR 2026 in Rio de Janeiro, Brazil ๐Ÿ‡ง๐Ÿ‡ท.


She will present our latest work, MCIF: the first human-annotated crosslingual + multimodal benchmark for instruction following in the scientific domain.
In this paper, we evaluate how current LLMs, SpeechLLMs, VideoLLMs, and MLLMs handle instructions across:
๐Ÿ”น Text, speech, and video
๐Ÿ”น English, German, Italian, and Chinese
๐Ÿ”น Recognition, translation, QA, and summarization
๐Ÿ”น Short-form and long-form inputs

This is the result of a collaboration with Maike Zรผfle, Marco Gaido, Beatrice Savoldi, Danni Liu, Ioannis Douros, Luisa Bentivogli, and Jan Niehues from Machine Translation at FBK, Artificial Intelligence for Language Technologies (AI4LT), and Translated within the Meetween project!

More details about the paper and the poster session can be found here