Tasks that ultimately require knowledge-based multimedia techniques (content-oriented search, assessment, abstracting, etc.) are still to a major extent carried out manually. PATExpert’s overall scientific goal is to change the paradigm currently followed for patent processing from textual (viewing patents as text blocks enriched by “canned” picture material, sequences of morpho-syntactic tokens, or collections of syntactic structures) to semantic (viewing patents as multimedia knowledge objects) processing. PATExpert developed a multimedia content representation formalism based on Semantic Web technologies for selected technology areas and investigate the retrieval, classification, multilingual generation of concise patent information, assessment and visualization of patent material encoded in this formalism, taking the information needs of all user types as defined in a user typology into account. PATExpert’s technological goal was to develop a showcase that demonstrates the viability of PATExpert’s approach to content representation for real applications. The composition and the competence of the Consortium ensured the achievement of these goals.
Our pick of the week by @beomseok_lee_: "ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs" by Pooneh Mousavi, @yingzhi_wang, @mirco_ravanelli, and @CemSubakan (2025)
#SLU #speech #multimodal #LLM
Speech-language models show promise in multimodal tasks—but how well are speech & text actually aligned? 🤔
This paper https://arxiv.org/abs/2505.19937 proposes a new metric to measure layer-wise correlation between the two, with a focus on SLU tasks. 🔍🗣️📄
🔍 Ciao! Stiamo studiando come l'AI viene usata in Italia e per farlo abbiamo costruito un sondaggio!
👉https://bocconi.eu.qualtrics.com/jfe/form/SV_2nTelXaXvJlinbg (è anonimo, dura ~10 m, se partecipi o lo diffondi ci aiuti un sacco🙏)
Ci interessa anche raggiungere persone che non si occupano di AI!
Our pick of the week by @apierg: "Agree to Disagree? A Meta-Evaluation of LLM Misgendering" by Arjun Subramonian, Vagrant Gautam, Preethi Seshadri, Dietrich Klakow, @kaiwei_chang, @YizhouSun (2025).
#LLM #misgendering #gender
Super interesting paper by Subramonian et al: "Agree to Disagree? A Meta-Evaluation of LLM Misgendering" https://arxiv.org/abs/2504.17075
Turns out, misgendering is messier than just pronouns. I'd love to see this analysis extended to grammatical gender languages! #LLM #AI #ethics @fbk_mt
🚀 New tech report out! Meet FAMA, our open-science speech foundation model family for both ASR and ST in 🇬🇧 English and 🇮🇹 Italian.
The models are live and ready to try on @huggingface 👇
🔗
#ASR #ST #OpenScience #MultilingualAI