FBK-Fairseq
Open source repository with the code and models used in recent papers
Read Moreby Marco Gaido | May 17, 2024 | Software | 0
Open source repository with the code and models used in recent papers
Read Moreby Marco Gaido | May 17, 2024 | Software | 0
SubSONAR evaluates the quality of SRT files using the multilingual multimodal SONAR model. The evaluation accounts for the semantic similarity (computed as a cosine similarity) between each subtitle block and the corresponding...
Read Moreby Marco Gaido | May 17, 2024 | Software | 0
pangolinn is a Python library for neural network developers that contains test suites aimed at...
Read Moreby Matteo Negri | May 30, 2023 | Software | 0
A neural adaptive machine translation system that adapts to context and learns from corrections
Read Moreby Dennis Fucci | May 30, 2023 | Software | 0
AQET (Adaptive Quality Estimation Tool) is an open-source package for performing Quality Estimation for Machine Translation able to continuously learn from post-edited sentences.
Read Moreby Andrea Piergentili | May 30, 2023 | Software | 0
An extension of MGIZA++, which allows to align sentence pair in an online mode.
Read Moreby Dennis Fucci | May 30, 2023 | Software | 0
The IRST Language Modeling (IRSTLM) Toolkit features algorithms and data structures suitable to estimate, store, and access very large n-gram language models.
Read Moreby Dennis Fucci | May 30, 2023 | Software | 0
Moses is a statistical machine translation system that allows you to automatically train translation models for any language pair.
Read More
🇦🇹 I’ll be in Vienna for #ACL2025NLP!
Interested in training a SpeechLLM without a lot of params or data? Come to my poster:
🖼️ Mon, 18:00
Also into Speech Summarization? Join my IWSLT talk in collab with @fbk_mt:
🎤 Fri, 14:00
Happy to chat - come say hi! 😎
Papers in 🧵
Sara Papi, Maike Z\"ufle, Marco Gaido, Beatrice Savoldi, Danni Liu, Ioannis Douros, Luisa Bentivogli, Jan Niehues, "MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks,"
Our pick of the week by @mgaido91: "WhisperKit: On-device Real-time ASR with Billion-Scale Transformers" by Atila Orhon, Arda Okan, Berkin Durmus, @zachnagengast, and Eduardo Pacheco (ICML 2025)
#speech #speechtech #whisper #ASR #realtime
A couple of weeks before presenting our large-scale speech model compression task at IWSLT, here there is of the first attempts to bring large-scale models to the devices on the edge: https://arxiv.org/pdf/2507.10860... Hope to see more works along this direction!