-
Trentin, Edmondo; Cattoni, Roldano,Learning Perception for Indoor Robot Navigation with a Hybrid HMM/Recurrent Neural Networks Approach,1999, pp. 243-265
-
G., Andolfi; Aste, Marco; Boninsegna, Massimo; Cattoni, Roldano; Potrich, Alessandra; Caprile, Bruno Giovanni,The Advanced Visual Monitoring Project at IRST,Regazzoni C.S., Fabri G., Vernazza G. (eds.), Advanced Video-Based Surveillance Systems, The Kluwer International Series in Engineering and Computer Science,1999, pp. 130-139
-
Cattoni, Roldano; Potrich, Alessandra,From Standard Specifications to a Multi-agent Software System: Audio Video Entertainment Application in Practice,1999
-
G., Andolfi; Caprile, Bruno Giovanni; Cattoni, Roldano; O., Salvetti,Models for BBM-Based Inference of Visual Properties,1999
-
Cattoni, Roldano; Potrich, Alessandra; P., Charlton; E., Mamdani,Evaluating the FIPA Standards on the field: an Audio Video Entertainment Application,First Asia-Pacific Conference on Intelligent Agent Technology,World Scientific Publishing,1999
-
Cettolo, Mauro; A., Corazza,History Integration Into Semantic Classification,Computational Models of Speech Pattern Processing,Springer,vol. 169,1999, pp. 356-361
-
Cettolo, Mauro; A., Corazza; Lazzari, Giannino; Pianesi, Fabio; Pianta, Emanuele; L., Tovena,A Speech-to-Speech Translation based Interface for Tourism,ENTER `99,1999, pp. 191-200
-
J., Haas; V., Warnke; H., Niemann; Cettolo, Mauro; A., Corazza; Falavigna, Giuseppe Daniele; Lazzari, Giannino,Semantic Boundaries in Multiple Languages,Eurospeech `99,1999, pp. 535-538
-
Cettolo, Mauro,Segmentation and Classification of Italian Audio Broadcast News,1999
-
A., Corazza; Cettolo, Mauro; Lazzari, Giannino; Pianta, Emanuele; Pianesi, Fabio; Tovena, L. M.,The ITC-irst Speech Translation System,Workshop AI*IA - Elaborazione del Linguaggio e Riconoscimento del Parlato `99,AI*IA,1999, pp. 30-32
-
Cettolo, Mauro; Falavigna, Giuseppe Daniele,Automatic Detection of Semantic Boundaries Based on Acoustic and Lexical Knowledge,ICSLP `98,1998, pp. 1551-1554
-
Cettolo, Mauro; Falavigna, Giuseppe Daniele,Automatic Recognition of Spontaneous Speech Dialogues,ICSLP `98,1998, pp. 261-264
-
Cettolo, Mauro; Gretter, Roberto; R., De Mori,Recognition as Search,Spoken Dialogues with Computers,Academic Press,1998, pp. 257-309
-
Cettolo, Mauro; Gretter, Roberto; R., De Mori,Knowledge Integration,Spoken Dialogues with Computers,Academic Press,1998, pp. 231-256
-
Cettolo, Mauro; A., Corazza; R., De Mori,Language Portability of a Speech Understanding System,in «COMPUTER SPEECH AND LANGUAGE»,Elsevier,vol. 12,n. 1,1998, pp. 1-21
-
Rossi, Massimo; Aste, Marco; Cattoni, Roldano; Caprile, Bruno Giovanni,Visual Routines for Real-Time Monitoring of Vehicle Behavior,in «MACHINE VISION AND APPLICATIONS»,vol. 11,n. 1,1998, pp. 16-23
-
Cattoni, Roldano; Tarcisio, Coianiz; Messelodi, Stefano; Modena, Carla Maria,1998
-
Cattoni, Roldano; Potrich, Alessandra,Bayesian Belief Networks: Introduction and Learning,1998
-
Cattoni, Roldano; T., Coianiz; F., Fignoni; Messelodi, Stefano; Modena, Carla Maria,Progetto CODICE: specifiche funzionali del sistema,1997
-
Aste, Marco; Cattoni, Roldano; Caprile, Bruno Giovanni,VGS – Visual Gate Simulator Simulatore degli eventi di occlusione di fotocellule in varchi controllati Manuale d’uso Versione: 1.0,1997
Today's task: model compression!!
🎯 Goal: Compress a large, general-purpose multimodal model, making speech translation more efficient ⚡️, deployable 📲, and sustainable ♻️, while preserving translation quality ⭐️
#AI #SpeechTech #ModelCompression #LLMcompression
First up, a new task for 2025:
*Instruction-following for speech processing!*
Explore instruction-following for speech ⇨
Integrate speech foundation models with LLMs across tasks such as speech translation, recognition, summarization, and QA.
🔗:
📢Workshop gratuito 05/02: “Lo stato dell'arte nelle tecnologie per il riconoscimento del parlato.”
Diretta YouTube: https://www.youtube.com/live/i4x7w8fIIXo?si=wYvvrO3-MSh7Yik4
Registrazione: https://www.eventbrite.com/e/biglietti-lo-stato-dellarte-nelle-tecnologie-per-il-riconoscimento-del-parlato-1109098797359?aff=oddtdtcreator
I'm happy to share that our paper "Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison" has been accepted at @naacl @naaclmeeting 2025! #NAACL2025
@Lam19Tk @mgaido91 👏
📃 Preprint:
⏰ Code will be released soon
#NLProc #Speech