-
Angelini, Bianca; Cettolo, Mauro; A., Corazza; Falavigna, Giuseppe Daniele; Lazzari, Giannino,Multilingual Person to Person Communication at IRST,ICASSP `97,1997, pp. 91-94
-
Cettolo, Mauro; A., Corazza,Automatic Detection of Semantic Boundaries,Eurospeech `97,1997, pp. 919-922
-
Cettolo, Mauro; Falavigna, Giuseppe Daniele,Automatic Speech Recognition of Spontaneous Dialogues,1997
-
Angelini, Bianca; Cettolo, Mauro; A., Corazza; Falavigna, Giuseppe Daniele; Lavelli, Alberto; Lazzari, Giannino; Pianesi, Fabio; Stock, Oliviero,Preliminary Results of the C-STAR Project at IRST,C-STAR `96 Workshop,1996
-
Cettolo, Mauro; A., Corazza; Lavelli, Alberto,C-Star Intermediate Report on the High-Level Modules,1996
-
Cettolo, Mauro; A., Corazza; R., De Mori,A Mixed Approach to Speech Understanding,ICSLP `96,1996, pp. 841-844
-
Antoniol, Giuliano; Brugnara, Fabio; Cettolo, Mauro; Federico, Marcello,1996
-
Trentin, Edmondo; Cattoni, Roldano,An Hybrid HMM/Recurrent Neural Networks Approach to Indoor Robot Navigation,Proceedings of the 1997 Real Word Computing Symposium [RWC'97],1996, pp. 282-287
-
Cettolo, Mauro; A., Corazza; R., De Mori,Automatic Learning of Sentence Dependencies in Spoken Dialogues,ESCA Workshop on Spoken Dialogue Systems,1995, pp. 77-80
-
G., Antoniol; Brugnara, Fabio; Cettolo, Mauro; Federico, Marcello,Language Models Comparison in a Robot Telecontrol Application,Speech Recognition and Coding,Springer Verlag,vol. 147,1995, pp. 244-247
-
Brugnara, Fabio; Cettolo, Mauro,Improvements in Tree-Based Language Model Representation,Eurospeech `95,1995, pp. 1797-1800
-
Cettolo, Mauro; A., Corazza,Versione Italiana di ATIS: Apprendere dai dati la comprensione del linguaggio parlato,1995
-
G., Antoniol; Brugnara, Fabio; Cettolo, Mauro; Federico, Marcello,Language Model Representations for Beam-Search Decoding,ICASSP `95,1995, pp. 588-591
-
Federico, Marcello; Cettolo, Mauro; Brugnara, Fabio; G., Antoniol,Language Modelling for Efficient Beam-Search,in «COMPUTER SPEECH AND LANGUAGE»,Elsevier,vol. 9,n. 4,1995, pp. 353-379
-
G., Antoniol; Brugnara, Fabio; Cettolo, Mauro; Federico, Marcello,Language Model Estimations and Representations for Real-Time Continuous Speech Recognition,ICSLP `94,1994, pp. 859-862
-
Angelini, Bianca; G., Antoniol; Brugnara, Fabio; Cettolo, Mauro; Federico, Marcello; R., Fiutem; Lazzari, Giannino,Radiological reporting by speech recognition: the A.Re.S. System,ICSLP `94,1994, pp. 1267-1270
-
Cattoni, Roldano; G., Di Caro; Aste, Marco; Caprile, Bruno Giovanni,Bridging the Gap between Planning and Reactivity: A Layered Architecture for Autonomous Indoor Navigation,Proceedings of the IEEE/RSJ/GI International Conference on Intelligent Robots and Systems [IROS'94], Advanced Robotic Systems and the Real World,1994, pp. 878-885
-
G., Antoniol; Cattoni, Roldano; Cettolo, Mauro; Federico, Marcello,Robust Speech Understanding for Robot Telecontrol,ICAR `93,1993, pp. 205-209
-
Cattoni, Roldano; T., Coianiz; Caprile, Bruno Giovanni,Reactivity and Planning for the Mobile Robot of MAIA,1993
-
Cattoni, Roldano,Behaviours as Bridges between Symbolic Reasoning and Reactivity,1993
Our pick of the week by @FBKZhihangXie: "PHRASED: Phrase Dictionary Biasing for Speech Translation" by Peidong Wang, Jian Xue, Rui Zhao, @ChenJunkun, Aswin Shanmugam Subramanian, and Jinyu Li (2025).
#Speech #SpeechAI #Translation #ST #SpeechTranslation
🚀 Boost rare-phrase translation in speech! Uses **bilingual dictionaries** to dynamically bias outputs.
✅ **+21%** recall in streaming ST
✅ **+85%** in multimodal LLMs
🔗: http://arxiv.org/abs/2506.09175
FAMA è il primo foundation model vocale open-science per ita e eng, sviluppato da FBK. Riconosce e traduce la voce usando solo dati e strumenti pubblici: oltre 150.000 ore di audio open, codice e processi completamente accessibili.
@fbk_stek @fbk_mt
https://magazine.fbk.eu/it/news/la-prima-famiglia-di-modelli-open-science-per-il-riconoscimento-vocale-e-la-traduzione-del-parlato/
Emanuele Pianta Award for the Best Master’s Thesis in Computational Linguistics submitted at an Italian university and defended between August 1st 2024 and July 31st 2025
- Deadline: August 1st, 2025 (11:59 pm CEST)
- All details online: https://clic2025.unica.it/emanuele-pianta-award-for-the-best-masters-thesis/
Our pick of the week by @DennisFucci: "Speech Representation Analysis Based on Inter- and Intra-Model Similarities" by Yassine El Kheir, Ahmed Ali, and Shammur Absar Chowdhury (ICASSP Workshops 2024)
#speech #speechtech
Findings from https://ieeexplore.ieee.org/document/10669908 show that speech SSL models converge on similar embedding spaces, but via different routes. While overall representations align, individual neurons learn distinct localized concepts.
Interesting read! @fbk_mt