-
G., Antoniol; Cettolo, Mauro; Federico, Marcello,Robust and Reliable Speech Understanding in Restricted Domains,IEEE Workshop on Automatic Speech Recognition,1993, pp. 103-104
-
G., Antoniol; Cettolo, Mauro; Federico, Marcello,Techniques for Robust Recognition in Restricted Domains,Eurospeech`93,1993, pp. 2219-2221
-
A., Antoniol; Brugnara, Fabio; Cettolo, Mauro; R., Fiutem; Flor, Roberto; Lazzari, Giannino,ASR System Comparison on New High Performance Workstation,Avios `93,1993, pp. 17-24
-
G., Antoniol; Brugnara, Fabio; Cettolo, Mauro; R., Fiutem; Flor, Roberto; Lazzari, Giannino,Comparison of Automatic Speech Recognition System based on New High Speed RISC Workstations,ICSPAT `93,1993, pp. 1473-1477
-
G., Antoniol; G., Carli; Cettolo, Mauro; R., Fiutem,Tools for Development, Test and Analysis of ASRs,ICSPAT `92,1992, pp. 1005-1010
-
G., Antoniol; G., Carli; Cettolo, Mauro; R., Fiutem; Flor, Roberto; Lazzari, Giannino,Finite State Network, Isolated Word, Real Time Automatic Speech Recognizer Based on DSP32C,ICSPAT `92,1992, pp. 1000-1004
-
Franco, Blanchini; Cettolo, Mauro; Patrizio, Righini,Disturbance Rejection Problem for Discrete-Time Linear Systems: a Comparison between Linear and Nonlinear Control,ISIRS `91,1991, pp. 28-32
-
Cattoni, Roldano; E., Franconi,Walking through the Semantics of Frame-Based Description Languages: A Case study,1990
Our pick of the week by @mgaido91: "WhisperKit: On-device Real-time ASR with Billion-Scale Transformers" by Atila Orhon, Arda Okan, Berkin Durmus, @zachnagengast, and Eduardo Pacheco (ICML 2025)
#speech #speechtech #whisper #ASR #realtime
A couple of weeks before presenting our large-scale speech model compression task at IWSLT, here there is of the first attempts to bring large-scale models to the devices on the edge: https://arxiv.org/pdf/2507.10860... Hope to see more works along this direction!
Our pick of the week by @FBKZhihangXie: "Adversarial Speech-Text Pre-Training for Speech Translation" by Chenxuan Liu, Liping Chen, Weitai Zhang, Xiaoxi Li, Peiwang Tang, Mingjia Yu, Sreyan Ghosh, and Zhongyi Ye (ICASSP 2025)
#speech #speechprocessing #speechtech #translation
🚀 AdvST: Adversarial training aligns speech and text distributions without parallel data! Combines adversarial learning + hidden-state swapping to fix length mismatch & boost low-resource speech translation. https://ieeexplore.ieee.org/document/10888294
A special evening in Rome to talk about Physical AI and Europe’s role in shaping this new frontier.
Partners from across Europe came together to present the DVPS project, and connect with key people from public institutions, embassies, industries, national & international media.
Thrilled to be part of this amazing project and team!
🚀 DVPS has launched at Translated's HQ!
70 researchers from 20 institutions across 9 countries unite to build next-gen multimodal foundation models that learn from real-world interaction.
A new European AI journey begins.
#DVPS #PhysicalAI #HorizonEurope #MultimodalAI