-
Trentin, Edmondo; Cattoni, Roldano,Learning Perception for Indoor Robot Navigation with a Hybrid HMM/Recurrent Neural Networks Approach,1999, pp. 243-265
-
G., Andolfi; Aste, Marco; Boninsegna, Massimo; Cattoni, Roldano; Potrich, Alessandra; Caprile, Bruno Giovanni,The Advanced Visual Monitoring Project at IRST,Regazzoni C.S., Fabri G., Vernazza G. (eds.), Advanced Video-Based Surveillance Systems, The Kluwer International Series in Engineering and Computer Science,1999, pp. 130-139
-
Cattoni, Roldano; Potrich, Alessandra,From Standard Specifications to a Multi-agent Software System: Audio Video Entertainment Application in Practice,1999
-
G., Andolfi; Caprile, Bruno Giovanni; Cattoni, Roldano; O., Salvetti,Models for BBM-Based Inference of Visual Properties,1999
-
Cattoni, Roldano; Potrich, Alessandra; P., Charlton; E., Mamdani,Evaluating the FIPA Standards on the field: an Audio Video Entertainment Application,First Asia-Pacific Conference on Intelligent Agent Technology,World Scientific Publishing,1999
-
Cettolo, Mauro; A., Corazza,History Integration Into Semantic Classification,Computational Models of Speech Pattern Processing,Springer,vol. 169,1999, pp. 356-361
-
Cettolo, Mauro; A., Corazza; Lazzari, Giannino; Pianesi, Fabio; Pianta, Emanuele; L., Tovena,A Speech-to-Speech Translation based Interface for Tourism,ENTER `99,1999, pp. 191-200
-
J., Haas; V., Warnke; H., Niemann; Cettolo, Mauro; A., Corazza; Falavigna, Giuseppe Daniele; Lazzari, Giannino,Semantic Boundaries in Multiple Languages,Eurospeech `99,1999, pp. 535-538
-
Cettolo, Mauro,Segmentation and Classification of Italian Audio Broadcast News,1999
-
A., Corazza; Cettolo, Mauro; Lazzari, Giannino; Pianta, Emanuele; Pianesi, Fabio; Tovena, L. M.,The ITC-irst Speech Translation System,Workshop AI*IA - Elaborazione del Linguaggio e Riconoscimento del Parlato `99,AI*IA,1999, pp. 30-32
-
Cettolo, Mauro; Falavigna, Giuseppe Daniele,Automatic Detection of Semantic Boundaries Based on Acoustic and Lexical Knowledge,ICSLP `98,1998, pp. 1551-1554
-
Cettolo, Mauro; Falavigna, Giuseppe Daniele,Automatic Recognition of Spontaneous Speech Dialogues,ICSLP `98,1998, pp. 261-264
-
Cettolo, Mauro; Gretter, Roberto; R., De Mori,Recognition as Search,Spoken Dialogues with Computers,Academic Press,1998, pp. 257-309
-
Cettolo, Mauro; Gretter, Roberto; R., De Mori,Knowledge Integration,Spoken Dialogues with Computers,Academic Press,1998, pp. 231-256
-
Cettolo, Mauro; A., Corazza; R., De Mori,Language Portability of a Speech Understanding System,in ยซCOMPUTER SPEECH AND LANGUAGEยป,Elsevier,vol. 12,n. 1,1998, pp. 1-21
-
Rossi, Massimo; Aste, Marco; Cattoni, Roldano; Caprile, Bruno Giovanni,Visual Routines for Real-Time Monitoring of Vehicle Behavior,in ยซMACHINE VISION AND APPLICATIONSยป,vol. 11,n. 1,1998, pp. 16-23
-
Cattoni, Roldano; Tarcisio, Coianiz; Messelodi, Stefano; Modena, Carla Maria,1998
-
Cattoni, Roldano; Potrich, Alessandra,Bayesian Belief Networks: Introduction and Learning,1998
-
Cattoni, Roldano; T., Coianiz; F., Fignoni; Messelodi, Stefano; Modena, Carla Maria,Progetto CODICE: specifiche funzionali del sistema,1997
-
Aste, Marco; Cattoni, Roldano; Caprile, Bruno Giovanni,VGS – Visual Gate Simulator Simulatore degli eventi di occlusione di fotocellule in varchi controllati Manuale d’uso Versione: 1.0,1997
๐ New tech report out! Meet FAMA, our open-science speech foundation model family for both ASR and ST in ๐ฌ๐ง English and ๐ฎ๐น Italian.
The models are live and ready to try on @huggingface ๐
๐
#ASR #ST #OpenScience #MultilingualAI
Our pick of the week by @lina_conti: "Languages in Multilingual Speech Foundation Models Align Both Phonetically and Semantically" by @soheunshim, Domenico De Cristofaro, Chengzhi Martin Hu, Alessandro Vietti, and @barbara_plank (2025).
#speech #SFM #multilingual #speechtech
Pick of the week @fbk_mt: https://arxiv.org/abs/2505.19606 by @soheunshim @DomenicoDeCris1. XAI work on cross-lingual alignment in speech-to-text models that disentangles phonetics and semantics. Plus: their XAI insights yield actionable improvements for low-resource language performance.
๐ New shared task at #WMT2025 (co-located with @emnlpmeeting ): Model Compression for Machine Translation!
Can you shrink an LLM and keep translation quality high?๐ง
Submit by July 3 and push the limits of efficient NLP!
๐ https://www2.statmt.org/wmt25/model-compression.html #NLP #ML #LLM #ModelCompression
More great news! ๐
Our paper โEchoes of Phonetics: Unveiling Relevant Acoustic Cues for ASR via Feature Attributionโ was accepted at #Interspeech2025!
Interested in interpretability for speech models? Preprint coming soon!
โ๐ผ @mgaido91, @negri_teo, M.Cettolo, @luisabentivogli