The TC-STAR project is envisaged as a long-term effort to advance research in all core technologies for Speech-to-Speech Translation (SST). SST technology is a combination of Automatic Speech Recognition (ASR), Spoken Language Translation (SLT) and Text to Speech (TTS) (speech synthesis). The objectives of the project are ambitious: making a breakthrough in SST that significantly reduces the gap between human and machine translation performance. The project targets a selection of unconstrained conversational speech domains—speeches and broadcast news—and three languages: European English, European Spanish, and Mandarin Chinese. Accurate translation of unrestricted speech is well beyond the capability of today’s state-of-the-art research systems. Therefore, advances are needed to improve the state-of the-art technologies for speech recognition and speech translation.
Our pick of the week by @FBKZhihangXie: "#Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in #SpeechLLMs" by @WangDingdo2603, Junan Li, @HelenMeng_CUHK, et al. (#EMNLP2025)
#SLU #SpeechTech
🚀 New paper: Speech Discrete Tokens or Continuous Features?
📄 https://aclanthology.org/2025.emnlp-main.1266.pdf
🧩 A comprehensive benchmark of SpeechLLMs using HuBERT/WavLM with Qwen & LLaMA.
✨ Continuous features outperform overall, while discrete tokens excel at phoneme-level detail.
🚀 Exciting news from the @FBK_MT group!
Four of our members @BeatriceSavoldi, @lina_conti, @negri_teo & @luisabentivogli are attending #EMNLP2025 in Suzhou 🇨🇳 with 5 accepted papers!
Come to our sessions & let's connect:
🔗 https://mt.fbk.eu/fbk-mt-at-emnlp-2025/
We’re also hiring postdocs!⚡
🎉🎓Congratulations to our PhD student @DennisFucci on a very successful thesis defense! 👏
Many thanks to the evaluation committee members @debora_nozza, @mirco_ravanelli, and Leonardo Badino for their insightful feedback and appreciation of his work!
#nlproc