The TC-STAR project is envisaged as a long-term effort to advance research in all core technologies for Speech-to-Speech Translation (SST). SST technology is a combination of Automatic Speech Recognition (ASR), Spoken Language Translation (SLT) and Text to Speech (TTS) (speech synthesis). The objectives of the project are ambitious: making a breakthrough in SST that significantly reduces the gap between human and machine translation performance. The project targets a selection of unconstrained conversational speech domains—speeches and broadcast news—and three languages: European English, European Spanish, and Mandarin Chinese. Accurate translation of unrestricted speech is well beyond the capability of today’s state-of-the-art research systems. Therefore, advances are needed to improve the state-of the-art technologies for speech recognition and speech translation.
The 22nd edition of IWSLT will be co-located with @aclmeeting in Vienna, Austria on 31 July-1 Aug 2025!
Stay tuned for the CFP and more info about our 2025 shared tasks! Join our google group for periodic updates.
In "Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps," @BeatriceSavoldi, @DennisFucci, @dirk_hovy, and I show how speech recognition serves different gender groups differently and what to do about it.
Meet @sarapapi, @BeatriceSavoldi, and @negri_teo at EMNLP 2024 in Miami next week! 🌴
They will present two main conference papers about human-centered #MT and #genderbias, and #opensource #speech resources!
📍 Details here: https://mt.fbk.eu/our-postdocs-sara-papi-and-beatrice-savoldi-and-our-researcher-matteo-negri-at-emnlp-2024/
#NLProc #EMNLP2024
Weekly pick from the #MeetweenScientificWatch: "Vcoder: Versatile Vision Encoders for Multimodal LLMs" - A novel encoder boosts object perception in MLLMs, outperforming GPT-4V in visual reasoning! 🌆👀