The TOSCA-MP project aimed to develop user-centric content annotation and search tools for professionals in networked media production and archiving (television, radio, online), addressing their specific use cases and workflow requirements. The project brought together 10 partners from 6 European countries including industry partners providing solutions for the media industry, public service broadcasters as well as their European association, a university and research centres. TOSCA-MP investigated scalable and distributed content processing methods performing advanced multimodal information extraction and semantic enrichment. Other key technology areas included search methods across heterogeneous networked content repositories and novel user interfaces. An open standards based service oriented framework integrated the components of the system.
Our pick of the week by
@lina_conti
: "Greater accessibility can amplify discrimination in generative AI" by
@CarolinHolterm, @minhducbui_nlp, @KaitlynZhou, @vjhofmann, @kelina1124, @anne_lauscher
📰
#GenderBias #SpeechLLM
Pick of the week @fbk_mt: "Greater accessibility can amplify discrimination in generative AI"
Gender bias in speech-based LLMs examined from multiple angles: a user survey, automatic bias measurement, and pitch manipulation experiments.
https://arxiv.org/pdf/2603.22260
Late update, but we had two great talks last month!
#MachineTranslation #FBK #NLProc #GenderBias #SpeechSynthesis
Our pick of the week by @dhairya_su47605
: "Scaling Laws for Precision" by @tanishqkumar07, Zachary Ankner, @bfspectorShiekh, @blake__bordelon, @Muennighoff, @mansiege, @CPehlevan, Christopher R´e, @AdtRaghunathan
📰
#Quantization #LLM #ScalingLaw
Pick of the week @fbk_mt
Super interesting paper on the limitations of quantization, demonstrating how post-training quantization scales poorly in data.
https://arxiv.org/abs/2411.04330
⭐ For our #PickOfTheWeek, this paper explores an important question for modern speech AI:
🎙️ Which Evaluation for Which Speech Model?
👥 Authors: @Maureendss , @EeshanDhekane
Speech foundation models are evolving rapidly, but evaluation practices are still fragmented.