Software

SubSONAR

SubSONAR evaluates the quality of SRT files using the multilingual multimodal SONAR model. The evaluation accounts for the semantic similarity (computed as a cosine similarity) between each subtitle block and the corresponding...

Read More

AQET

AQET (Adaptive Quality Estimation Tool) is an open-source package for performing Quality Estimation for Machine Translation able to continuously learn from post-edited sentences.

Read More

IRSTLM

The IRST Language Modeling (IRSTLM) Toolkit features algorithms and data structures suitable to estimate, store, and access very large n-gram language models.

Read More

MOSES

Moses is a statistical machine translation system that allows you to automatically train translation models for any language pair.

Read More
Loading

Our pick of the week by @FBKZhihangXie: "When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation" by Anna Min, et al, 2025.

Today's task: model compression!!

🎯 Goal: Compress a large, general-purpose multimodal model, making speech translation more efficient ⚡️, deployable 📲, and sustainable ♻️, while preserving translation quality ⭐️
#AI #SpeechTech #ModelCompression #LLMcompression

First up, a new task for 2025:
*Instruction-following for speech processing!*

Explore instruction-following for speech ⇨
Integrate speech foundation models with LLMs across tasks such as speech translation, recognition, summarization, and QA.

🔗:

📢Workshop gratuito 05/02: “Lo stato dell'arte nelle tecnologie per il riconoscimento del parlato.”
Diretta YouTube: https://www.youtube.com/live/i4x7w8fIIXo?si=wYvvrO3-MSh7Yik4
Registrazione: https://www.eventbrite.com/e/biglietti-lo-stato-dellarte-nelle-tecnologie-per-il-riconoscimento-del-parlato-1109098797359?aff=oddtdtcreator

Load More