The TOSCA-MP project aimed to develop user-centric content annotation and search tools for professionals in networked media production and archiving (television, radio, online), addressing their specific use cases and workflow requirements. The project brought together 10 partners from 6 European countries including industry partners providing solutions for the media industry, public service broadcasters as well as their European association, a university and research centres. TOSCA-MP investigated scalable and distributed content processing methods performing advanced multimodal information extraction and semantic enrichment. Other key technology areas included search methods across heterogeneous networked content repositories and novel user interfaces. An open standards based service oriented framework integrated the components of the system.
3️⃣ "Cross-Attention is Half Explanation in Speech-to-Text Models"
👥 @sarapapi, @DennisFucci, @mgaido91, @negri_teo, @luisabentivogli
🇪🇺 DVPS EU project
📄
1️⃣ "Do What I Say: A Spoken Prompt Dataset for Instruction-Following"
👥 @MaikeZufle, @sarapapi, Fabian Retkowski, Szymon Mazurek, @mkasztelnik, Alexander Waibel, @luisabentivogli, @_janius_
🇪🇺 Meetween EU project
📄
Our pick of the week by
@lina_conti
: "Greater accessibility can amplify discrimination in generative AI" by
@CarolinHolterm, @minhducbui_nlp, @KaitlynZhou, @vjhofmann, @kelina1124, @anne_lauscher
📰
#GenderBias #SpeechLLM
Pick of the week @fbk_mt: "Greater accessibility can amplify discrimination in generative AI"
Gender bias in speech-based LLMs examined from multiple angles: a user survey, automatic bias measurement, and pitch manipulation experiments.
https://arxiv.org/pdf/2603.22260