The TOSCA-MP project aimed to develop user-centric content annotation and search tools for professionals in networked media production and archiving (television, radio, online), addressing their specific use cases and workflow requirements. The project brought together 10 partners from 6 European countries including industry partners providing solutions for the media industry, public service broadcasters as well as their European association, a university and research centres. TOSCA-MP investigated scalable and distributed content processing methods performing advanced multimodal information extraction and semantic enrichment. Other key technology areas included search methods across heterogeneous networked content repositories and novel user interfaces. An open standards based service oriented framework integrated the components of the system.
The 22nd edition of IWSLT will be co-located with @aclmeeting in Vienna, Austria on 31 July-1 Aug 2025!
Stay tuned for the CFP and more info about our 2025 shared tasks! Join our google group for periodic updates.
In "Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps," @BeatriceSavoldi, @DennisFucci, @dirk_hovy, and I show how speech recognition serves different gender groups differently and what to do about it.
Meet @sarapapi, @BeatriceSavoldi, and @negri_teo at EMNLP 2024 in Miami next week! 🌴
They will present two main conference papers about human-centered #MT and #genderbias, and #opensource #speech resources!
📍 Details here: https://mt.fbk.eu/our-postdocs-sara-papi-and-beatrice-savoldi-and-our-researcher-matteo-negri-at-emnlp-2024/
#NLProc #EMNLP2024
Weekly pick from the #MeetweenScientificWatch: "Vcoder: Versatile Vision Encoders for Multimodal LLMs" - A novel encoder boosts object perception in MLLMs, outperforming GPT-4V in visual reasoning! 🌆👀