simulstream
simulstream is a Python library for simultaneous/streaming speech recognition and translation. It...
Read Moreby Marco Gaido | Jan 20, 2026 | Software | 0
simulstream is a Python library for simultaneous/streaming speech recognition and translation. It...
Read Moreby Marco Gaido | May 17, 2024 | Software | 0
Open source repository with the code and models used in recent papers
Read Moreby Marco Gaido | May 17, 2024 | Software | 0
SubSONAR evaluates the quality of SRT files using the multilingual multimodal SONAR model. The evaluation accounts for the semantic similarity (computed as a cosine similarity) between each subtitle block and the corresponding...
Read Moreby Marco Gaido | May 17, 2024 | Software | 0
pangolinn is a Python library for neural network developers that contains test suites aimed at...
Read Moreby Matteo Negri | May 30, 2023 | Software | 0
A neural adaptive machine translation system that adapts to context and learns from corrections
Read Moreby Dennis Fucci | May 30, 2023 | Software | 0
AQET (Adaptive Quality Estimation Tool) is an open-source package for performing Quality Estimation for Machine Translation able to continuously learn from post-edited sentences.
Read Moreby Andrea Piergentili | May 30, 2023 | Software | 0
An extension of MGIZA++, which allows to align sentence pair in an online mode.
Read Moreby Dennis Fucci | May 30, 2023 | Software | 0
The IRST Language Modeling (IRSTLM) Toolkit features algorithms and data structures suitable to estimate, store, and access very large n-gram language models.
Read Moreby Dennis Fucci | May 30, 2023 | Software | 0
Moses is a statistical machine translation system that allows you to automatically train translation models for any language pair.
Read More
🏝️ Yesterday at #LREC2026, Palma de Mallorca!
@lina_conti presented "Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation" at the poster session.
📄Paper:
💻Code: https://github.com/lina-conti/voice-bias-coreference
#SpeechTranslation #NLProc
How does the granularity of speech-text pairs impact SpeechLLM performance, and what is the optimal way to interleave tokens? Furthermore, what are the best practices for generating synthetic data to boost training?🧐
🎙️ Our paper on connecting Speech Foundation Models with LLMs is featured in the SpeechLMM Training Journal on Weights & Biases.
Read it 👉 https://bit.ly/4svG7ll
SpeechLMM 2.0 coming this summer. 👀
#Meetween #SpeechLMM #AI #NLP