Based on the philosophy of the Semantic Web, Ontotext exploits text processing and automatic reasoning technologies to extract knowledge from texts and organise it conceptually in an ontology. Unlike common search engines, the Ontotext Portal directly accesses the concepts and entities of the ontology and presents the user with structured information instead of mere portions of texts. For each entity, the Ontotext Portal offers four different views: Articles (lists all the documents in which it is mentioned), Citografo (shows how often it is mentioned), Opinions (shows how often opinions are expressed about it), and Record (provides extra information about it).
🤔 What Matters in Data for DPO? I asked myself this question a few days ago while trying to understand how to generate a dataset with preferences to run #DPO. This recent #NeurIPS paper answered some of my questions. The findings are simple but crucial for data creation:
🎓 Come and join our group! 🎓
We offer 2 fully funded PhD positions:
🌍 Human-Centred Evaluation Frameworks for Multilingual Technologies (A6)
🤖 Multimedia Personalization with Multimodal Large Language Models (A7)
⏰ Deadline: 15 May 2026
🔗 Details: https://iecs.unitn.it/education/admission/call-for-application
Our pick of the week by
@FBKZhihangXie
: "Detecting Hallucination in SpeechLLMs at Inference Time Using Attention Maps" by @JWaldendorf, Bashar Awwad Shiekh Hasan and Evgenii Tsymbalov
📰
#SpeechLLM #Hallucination
🚀 New paper: Detecting Hallucinations in SpeechLLMs at Inference Time Using Attention Maps
📄 http://arxiv.org/abs/2604.19565
🧩 Lightweight inference-time detection for SpeechLLM hallucinations via audio attention.
✨ Attention classifiers beat uncertainty baselines on ASR and S2TT.
🚀 New Shared Task: Model Compression for Machine Translation at #WMT2026 (co-located with #EMNLP2026)!
📅 Test data out on June 18th, submissions by July 2nd!
Can you shrink an LLM and keep translation quality high? 🧠🔧
👉 https://www2.statmt.org/wmt26/model-compression.html #NLP #ML #LLM #ModelCompression