-
Trentin, Edmondo; Cattoni, Roldano,Learning Perception for Indoor Robot Navigation with a Hybrid HMM/Recurrent Neural Networks Approach,1999, pp. 243-265
-
G., Andolfi; Aste, Marco; Boninsegna, Massimo; Cattoni, Roldano; Potrich, Alessandra; Caprile, Bruno Giovanni,The Advanced Visual Monitoring Project at IRST,Regazzoni C.S., Fabri G., Vernazza G. (eds.), Advanced Video-Based Surveillance Systems, The Kluwer International Series in Engineering and Computer Science,1999, pp. 130-139
-
Cattoni, Roldano; Potrich, Alessandra,From Standard Specifications to a Multi-agent Software System: Audio Video Entertainment Application in Practice,1999
-
G., Andolfi; Caprile, Bruno Giovanni; Cattoni, Roldano; O., Salvetti,Models for BBM-Based Inference of Visual Properties,1999
-
Cattoni, Roldano; Potrich, Alessandra; P., Charlton; E., Mamdani,Evaluating the FIPA Standards on the field: an Audio Video Entertainment Application,First Asia-Pacific Conference on Intelligent Agent Technology,World Scientific Publishing,1999
-
Cettolo, Mauro; A., Corazza,History Integration Into Semantic Classification,Computational Models of Speech Pattern Processing,Springer,vol. 169,1999, pp. 356-361
-
Cettolo, Mauro; A., Corazza; Lazzari, Giannino; Pianesi, Fabio; Pianta, Emanuele; L., Tovena,A Speech-to-Speech Translation based Interface for Tourism,ENTER `99,1999, pp. 191-200
-
J., Haas; V., Warnke; H., Niemann; Cettolo, Mauro; A., Corazza; Falavigna, Giuseppe Daniele; Lazzari, Giannino,Semantic Boundaries in Multiple Languages,Eurospeech `99,1999, pp. 535-538
-
Cettolo, Mauro,Segmentation and Classification of Italian Audio Broadcast News,1999
-
A., Corazza; Cettolo, Mauro; Lazzari, Giannino; Pianta, Emanuele; Pianesi, Fabio; Tovena, L. M.,The ITC-irst Speech Translation System,Workshop AI*IA - Elaborazione del Linguaggio e Riconoscimento del Parlato `99,AI*IA,1999, pp. 30-32
-
Cettolo, Mauro; Falavigna, Giuseppe Daniele,Automatic Detection of Semantic Boundaries Based on Acoustic and Lexical Knowledge,ICSLP `98,1998, pp. 1551-1554
-
Cettolo, Mauro; Falavigna, Giuseppe Daniele,Automatic Recognition of Spontaneous Speech Dialogues,ICSLP `98,1998, pp. 261-264
-
Cettolo, Mauro; Gretter, Roberto; R., De Mori,Recognition as Search,Spoken Dialogues with Computers,Academic Press,1998, pp. 257-309
-
Cettolo, Mauro; Gretter, Roberto; R., De Mori,Knowledge Integration,Spoken Dialogues with Computers,Academic Press,1998, pp. 231-256
-
Cettolo, Mauro; A., Corazza; R., De Mori,Language Portability of a Speech Understanding System,in «COMPUTER SPEECH AND LANGUAGE»,Elsevier,vol. 12,n. 1,1998, pp. 1-21
-
Rossi, Massimo; Aste, Marco; Cattoni, Roldano; Caprile, Bruno Giovanni,Visual Routines for Real-Time Monitoring of Vehicle Behavior,in «MACHINE VISION AND APPLICATIONS»,vol. 11,n. 1,1998, pp. 16-23
-
Cattoni, Roldano; Tarcisio, Coianiz; Messelodi, Stefano; Modena, Carla Maria,1998
-
Cattoni, Roldano; Potrich, Alessandra,Bayesian Belief Networks: Introduction and Learning,1998
-
Cattoni, Roldano; T., Coianiz; F., Fignoni; Messelodi, Stefano; Modena, Carla Maria,Progetto CODICE: specifiche funzionali del sistema,1997
-
Aste, Marco; Cattoni, Roldano; Caprile, Bruno Giovanni,VGS – Visual Gate Simulator Simulatore degli eventi di occlusione di fotocellule in varchi controllati Manuale d’uso Versione: 1.0,1997
Our pick of the week by @DennisFucci: "Speech Representation Analysis Based on Inter- and Intra-Model Similarities" by Yassine El Kheir, Ahmed Ali, and Shammur Absar Chowdhury (ICASSP Workshops 2024)
#speech #speechtech
Findings from https://ieeexplore.ieee.org/document/10669908 show that speech SSL models converge on similar embedding spaces, but via different routes. While overall representations align, individual neurons learn distinct localized concepts.
Interesting read! @fbk_mt
Cosa chiedono davvero gli italiani all’intelligenza artificiale?
FBK in collaborazione con RiTA lancia un’indagine aperta a tutte/i per capire usi reali, abitudini e bisogni.
Bastano 10 minuti per partecipare, scopri di più: https://magazine.fbk.eu/it/news/italiani-e-ia-cosa-chiediamo-veramente-allintelligenza-artificiale/
🚀 Last call for the Model Compression for Machine Translation task at #WMT2025 (co-located with #EMNLP2025)!
Test data out on June 19 ➡️ 2 weeks for evaluation!
Can you shrink an LLM and keep translation quality high?
👉 https://www2.statmt.org/wmt25/model-compression.html #NLP #ML #LLM #ModelCompression
Our pick of the week by @beomseok_lee_: "ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs" by Pooneh Mousavi, @yingzhi_wang, @mirco_ravanelli, and @CemSubakan (2025)
#SLU #speech #multimodal #LLM
Speech-language models show promise in multimodal tasks—but how well are speech & text actually aligned? 🤔
This paper https://arxiv.org/abs/2505.19937 proposes a new metric to measure layer-wise correlation between the two, with a focus on SLU tasks. 🔍🗣️📄