Software

SubSONAR

SubSONAR evaluates the quality of SRT files using the multilingual multimodal SONAR model. The evaluation accounts for the semantic similarity (computed as a cosine similarity) between each subtitle block and the corresponding...

Read More

AQET

AQET (Adaptive Quality Estimation Tool) is an open-source package for performing Quality Estimation for Machine Translation able to continuously learn from post-edited sentences.

Read More

IRSTLM

The IRST Language Modeling (IRSTLM) Toolkit features algorithms and data structures suitable to estimate, store, and access very large n-gram language models.

Read More

MOSES

Moses is a statistical machine translation system that allows you to automatically train translation models for any language pair.

Read More
Loading

🎙️ Two people. Two languages. One conversation! 
No delays. No switching languages. No one is left out.

This is what we are building.

#SpeechAI #MultilingualAI #HorizonEurope

🎉 We’re very happy to welcome our new postdoc @LucaCorbucci , who will be working on multimodal LLMs.
Looking forward to the exciting research ahead! 🚀

@FBK_research

Four years ago, NLLB set a milestone with MT for 200 languages. Today we present OMT: a family of models that extend support to 1600 languages while delivering competitive results in high/mid-resource language, with our 1B-8B models matching frontier and open 70B LLMs.

🧵(1/n)

📢I'm organizing a BoF session at #EACL2026 called Tokenization & Beyond, aiming to gather researchers exploring tokenization and alternatives such as byte-level and pixel-based approaches. Sign up using the form if you're interested! #NLProc @eaclmeeting

Load More