Software

SubSONAR

SubSONAR evaluates the quality of SRT files using the multilingual multimodal SONAR model. The evaluation accounts for the semantic similarity (computed as a cosine similarity) between each subtitle block and the corresponding...

Read More

AQET

AQET (Adaptive Quality Estimation Tool) is an open-source package for performing Quality Estimation for Machine Translation able to continuously learn from post-edited sentences.

Read More

IRSTLM

The IRST Language Modeling (IRSTLM) Toolkit features algorithms and data structures suitable to estimate, store, and access very large n-gram language models.

Read More

MOSES

Moses is a statistical machine translation system that allows you to automatically train translation models for any language pair.

Read More
Loading

🏝️ Yesterday at #LREC2026, Palma de Mallorca!
@lina_conti presented "Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation" at the poster session.
📄Paper:
💻Code: https://github.com/lina-conti/voice-bias-coreference
#SpeechTranslation #NLProc

How does the granularity of speech-text pairs impact SpeechLLM performance, and what is the optimal way to interleave tokens? Furthermore, what are the best practices for generating synthetic data to boost training?🧐

🎙️ Our paper on connecting Speech Foundation Models with LLMs is featured in the SpeechLMM Training Journal on Weights & Biases.

Read it 👉 https://bit.ly/4svG7ll

SpeechLMM 2.0 coming this summer. 👀

#Meetween #SpeechLMM #AI #NLP

Load More