-
Trentin, Edmondo; Cattoni, Roldano,Learning Perception for Indoor Robot Navigation with a Hybrid HMM/Recurrent Neural Networks Approach,1999, pp. 243-265
-
G., Andolfi; Aste, Marco; Boninsegna, Massimo; Cattoni, Roldano; Potrich, Alessandra; Caprile, Bruno Giovanni,The Advanced Visual Monitoring Project at IRST,Regazzoni C.S., Fabri G., Vernazza G. (eds.), Advanced Video-Based Surveillance Systems, The Kluwer International Series in Engineering and Computer Science,1999, pp. 130-139
-
Cattoni, Roldano; Potrich, Alessandra,From Standard Specifications to a Multi-agent Software System: Audio Video Entertainment Application in Practice,1999
-
G., Andolfi; Caprile, Bruno Giovanni; Cattoni, Roldano; O., Salvetti,Models for BBM-Based Inference of Visual Properties,1999
-
Cattoni, Roldano; Potrich, Alessandra; P., Charlton; E., Mamdani,Evaluating the FIPA Standards on the field: an Audio Video Entertainment Application,First Asia-Pacific Conference on Intelligent Agent Technology,World Scientific Publishing,1999
-
Cettolo, Mauro; A., Corazza,History Integration Into Semantic Classification,Computational Models of Speech Pattern Processing,Springer,vol. 169,1999, pp. 356-361
-
Cettolo, Mauro; A., Corazza; Lazzari, Giannino; Pianesi, Fabio; Pianta, Emanuele; L., Tovena,A Speech-to-Speech Translation based Interface for Tourism,ENTER `99,1999, pp. 191-200
-
J., Haas; V., Warnke; H., Niemann; Cettolo, Mauro; A., Corazza; Falavigna, Giuseppe Daniele; Lazzari, Giannino,Semantic Boundaries in Multiple Languages,Eurospeech `99,1999, pp. 535-538
-
Cettolo, Mauro,Segmentation and Classification of Italian Audio Broadcast News,1999
-
A., Corazza; Cettolo, Mauro; Lazzari, Giannino; Pianta, Emanuele; Pianesi, Fabio; Tovena, L. M.,The ITC-irst Speech Translation System,Workshop AI*IA - Elaborazione del Linguaggio e Riconoscimento del Parlato `99,AI*IA,1999, pp. 30-32
-
Cettolo, Mauro; Falavigna, Giuseppe Daniele,Automatic Detection of Semantic Boundaries Based on Acoustic and Lexical Knowledge,ICSLP `98,1998, pp. 1551-1554
-
Cettolo, Mauro; Falavigna, Giuseppe Daniele,Automatic Recognition of Spontaneous Speech Dialogues,ICSLP `98,1998, pp. 261-264
-
Cettolo, Mauro; Gretter, Roberto; R., De Mori,Recognition as Search,Spoken Dialogues with Computers,Academic Press,1998, pp. 257-309
-
Cettolo, Mauro; Gretter, Roberto; R., De Mori,Knowledge Integration,Spoken Dialogues with Computers,Academic Press,1998, pp. 231-256
-
Cettolo, Mauro; A., Corazza; R., De Mori,Language Portability of a Speech Understanding System,in «COMPUTER SPEECH AND LANGUAGE»,Elsevier,vol. 12,n. 1,1998, pp. 1-21
-
Rossi, Massimo; Aste, Marco; Cattoni, Roldano; Caprile, Bruno Giovanni,Visual Routines for Real-Time Monitoring of Vehicle Behavior,in «MACHINE VISION AND APPLICATIONS»,vol. 11,n. 1,1998, pp. 16-23
-
Cattoni, Roldano; Tarcisio, Coianiz; Messelodi, Stefano; Modena, Carla Maria,1998
-
Cattoni, Roldano; Potrich, Alessandra,Bayesian Belief Networks: Introduction and Learning,1998
-
Cattoni, Roldano; T., Coianiz; F., Fignoni; Messelodi, Stefano; Modena, Carla Maria,Progetto CODICE: specifiche funzionali del sistema,1997
-
Aste, Marco; Cattoni, Roldano; Caprile, Bruno Giovanni,VGS – Visual Gate Simulator Simulatore degli eventi di occlusione di fotocellule in varchi controllati Manuale d’uso Versione: 1.0,1997
Our pick of the week by @mgaido91: "WhisperKit: On-device Real-time ASR with Billion-Scale Transformers" by Atila Orhon, Arda Okan, Berkin Durmus, @zachnagengast, and Eduardo Pacheco (ICML 2025)
#speech #speechtech #whisper #ASR #realtime
A couple of weeks before presenting our large-scale speech model compression task at IWSLT, here there is of the first attempts to bring large-scale models to the devices on the edge: https://arxiv.org/pdf/2507.10860... Hope to see more works along this direction!
Our pick of the week by @FBKZhihangXie: "Adversarial Speech-Text Pre-Training for Speech Translation" by Chenxuan Liu, Liping Chen, Weitai Zhang, Xiaoxi Li, Peiwang Tang, Mingjia Yu, Sreyan Ghosh, and Zhongyi Ye (ICASSP 2025)
#speech #speechprocessing #speechtech #translation
🚀 AdvST: Adversarial training aligns speech and text distributions without parallel data! Combines adversarial learning + hidden-state swapping to fix length mismatch & boost low-resource speech translation. https://ieeexplore.ieee.org/document/10888294
A special evening in Rome to talk about Physical AI and Europe’s role in shaping this new frontier.
Partners from across Europe came together to present the DVPS project, and connect with key people from public institutions, embassies, industries, national & international media.
Thrilled to be part of this amazing project and team!
🚀 DVPS has launched at Translated's HQ!
70 researchers from 20 institutions across 9 countries unite to build next-gen multimodal foundation models that learn from real-world interaction.
A new European AI journey begins.
#DVPS #PhysicalAI #HorizonEurope #MultimodalAI