-
Cattoni, Roldano; M., Danieli; A., Panizza; Sandrini, Vanessa; C., Soria,Building a corpus of annotate dialogues: the ADAM experience,Proceedings of Corpus Linguistics 2001,2001
-
Cattoni, Roldano; Federico, Marcello; A., Lavie,Robust Analysis of Spoken Input combining Statistical and Knowledge-based Information Sources,Proceedings of the 2001 IEEE Workshop on Automatic Speech Recognition and Understanding,2001
-
Magnini, Bernardo; Negri, Matteo; Prevete, Roberto,Open Domain Question/Answering on the Web,Proceedings of AI*IA 2001: Advances in Artificial Intelligence, 7th Congress of the Italian Association for Artificial Intelligence,Springer,2001, pp. 273-284
-
Magnini, Bernardo; Negri, Matteo; Prevete, Roberto; Tanev, Hristo,Multilingual Question/Answering: the DIOGENE System,TREC 2001,2001, pp. 425-433
-
Bertoldi, Nicola; Brugnara, Fabio; Cettolo, Mauro; Federico, Marcello; Giuliani, Diego,From Broadcast News to Spontraneous Dialogue Transcription: Portability Issues,ICASSP 2001,2001, pp. 37-40
-
Cettolo, Mauro,Speaker Tracking in a Broadcast News Corpus,2001: A Speaker Odyssey - The Speaker Recognition Workshop,2001, pp. 163-167
-
Cettolo, Mauro,Segmentation, Classification and Clustering of an Italian Broadcast News Corpus,RIAO 2000,2000, pp. 372-381
-
Brugnara, Fabio; Cettolo, Mauro; Federico, Marcello; Giuliani, Diego,A System for the Segmentation and Transcription of Italian Radio News,RIAO 2000,2000, pp. 364-371
-
Cettolo, Mauro,You talk, I translate! Technical Aspects of the ITC-irst Speech Translation System,2000
-
Brugnara, Fabio; Cettolo, Mauro; Federico, Marcello; Giuliani, Diego,Advances in Automatic Transcription of Italian Broadcast News,ICSLP 2000,2000, pp. 660-663
-
Cettolo, Mauro; Federico, Marcello,Model Selection Criteria for Acoustic Segmentation,ISCA ITRW ASR2000,2000, pp. 221-227
-
Brugnara, Fabio; Cettolo, Mauro; Federico, Marcello; Giuliani, Diego,A Baseline for the Transcription of Italian Broadcast News,ICASSP 2000,2000, pp. 1667-1670
-
Bentivogli, Luisa; Pianesi, Fabio; Pianta, Emanuele,From Word-based to Concept-based Text Analysis: the PhiloNet Project,Workshop Informatica umanistica: filosofia e risorse digitali,2000
-
Bentivogli, Luisa; Pianta, Emanuele,Looking for Lexical Gaps,Ninth EURALEX International Congress [EURALEX 2000],2000, pp. 663-669
-
Bentivogli, Luisa; Pianta, Emanuele; Pianesi, Fabio,Coping with Lexical Gaps when Building Aligned Multilingual Wordnets,Second International Conference on Language Resources and Evaluation (LREC 2000),2000, pp. 993-997
-
C., Soria; Cattoni, Roldano; M., Danieli,ADAM: An Architecture for xml-based Dialogue Annotation on Multiple levels,Proceedings of the 1st SIGdial Workshop on Discourse and Dialogue, in conjuction with ACL-2000: The 38th Annual Meeting of the Association for Computational Linguistics,2000
-
P., Charlton; Cattoni, Roldano; Potrich, Alessandra; E., Mamdani,Evaluating the FIPA Standards and Its Role in Achieving Cooperation in Multi-agent Systems,Proceedings of Thirty-Third Annual Hawaii International Conference on System Sciences,2000
-
Negri, Matteo,La valutazione di moduli nell`elaborazione del linguaggio naturale: problemi e metodi,2000
-
Bentivogli, Luisa,Relazioni lessicali e semantiche nella costruzione di un lessico computazionale multilingue: Problematiche Tecniche e Filosofiche,1999
-
Trentin, Edmondo; Cattoni, Roldano,A Hybrid Framework for Indoor Robot Navigation,Marinaro M., Tagliaferri R. (eds.), Neural Nets WIRN-98, Section 6m Pattern Recognition and Signal Processing,1999, pp. 255-263
Our pick of the week by @mgaido91: "WhisperKit: On-device Real-time ASR with Billion-Scale Transformers" by Atila Orhon, Arda Okan, Berkin Durmus, @zachnagengast, and Eduardo Pacheco (ICML 2025)
#speech #speechtech #whisper #ASR #realtime
A couple of weeks before presenting our large-scale speech model compression task at IWSLT, here there is of the first attempts to bring large-scale models to the devices on the edge: https://arxiv.org/pdf/2507.10860... Hope to see more works along this direction!
Our pick of the week by @FBKZhihangXie: "Adversarial Speech-Text Pre-Training for Speech Translation" by Chenxuan Liu, Liping Chen, Weitai Zhang, Xiaoxi Li, Peiwang Tang, Mingjia Yu, Sreyan Ghosh, and Zhongyi Ye (ICASSP 2025)
#speech #speechprocessing #speechtech #translation
🚀 AdvST: Adversarial training aligns speech and text distributions without parallel data! Combines adversarial learning + hidden-state swapping to fix length mismatch & boost low-resource speech translation. https://ieeexplore.ieee.org/document/10888294
A special evening in Rome to talk about Physical AI and Europe’s role in shaping this new frontier.
Partners from across Europe came together to present the DVPS project, and connect with key people from public institutions, embassies, industries, national & international media.
Thrilled to be part of this amazing project and team!
🚀 DVPS has launched at Translated's HQ!
70 researchers from 20 institutions across 9 countries unite to build next-gen multimodal foundation models that learn from real-world interaction.
A new European AI journey begins.
#DVPS #PhysicalAI #HorizonEurope #MultimodalAI