Speech Analytics

May 30, 1999 | Projects

- PROJECT CLOSED

The project includes technology transfer activities from FBK-irst to Pervoice and development activities such as improvements of automatic transcription technology (rich transcription, automatic text polishing), speech analytics technologies for call centers (emotional state recognition, segmentation and classification of utterances, monitoring of transactions), and advanced acoustic normalization techniques.

MT Group at FBK Follow

#MachineTranslation Research Unit @FBK_research. #nlproc #deeplearning #ai

Retweet on Twitter MT Group at FBK Retweeted

Avatar Maike Züfle @maikezufle ·

26 Jul

🇦🇹 I’ll be in Vienna for #ACL2025NLP!

Interested in training a SpeechLLM without a lot of params or data? Come to my poster:
🖼️ Mon, 18:00

Also into Speech Summarization? Join my IWSLT talk in collab with @fbk_mt:
🎤 Fri, 14:00

Happy to chat - come say hi! 😎
Papers in 🧵

Reply on Twitter 1949080662016667735 Retweet on Twitter 1949080662016667735 3 Like on Twitter 1949080662016667735 7 Twitter 1949080662016667735

Retweet on Twitter MT Group at FBK Retweeted

Avatar arXiv Sound @arxivsound ·

11h

Sara Papi, Maike Z\"ufle, Marco Gaido, Beatrice Savoldi, Danni Liu, Ioannis Douros, Luisa Bentivogli, Jan Niehues, "MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks,"

Reply on Twitter 1950353795499827604 Retweet on Twitter 1950353795499827604 2 Like on Twitter 1950353795499827604 4 Twitter 1950353795499827604

Avatar MT Group at FBK @fbk_mt ·

16 Jul

Our pick of the week by @mgaido91: "WhisperKit: On-device Real-time ASR with Billion-Scale Transformers" by Atila Orhon, Arda Okan, Berkin Durmus, @zachnagengast, and Eduardo Pacheco (ICML 2025)

#speech #speechtech #whisper #ASR #realtime

Marco Gaido @mgaido91

A couple of weeks before presenting our large-scale speech model compression task at IWSLT, here there is of the first attempts to bring large-scale models to the devices on the edge: https://arxiv.org/pdf/2507.10860... Hope to see more works along this direction!

Reply on Twitter 1945464120620323275 Retweet on Twitter 1945464120620323275 Like on Twitter 1945464120620323275 3 Twitter 1945464120620323275

Avatar MT Group at FBK @fbk_mt ·

9 Jul

Our pick of the week by @FBKZhihangXie: "Adversarial Speech-Text Pre-Training for Speech Translation" by Chenxuan Liu, Liping Chen, Weitai Zhang, Xiaoxi Li, Peiwang Tang, Mingjia Yu, Sreyan Ghosh, and Zhongyi Ye (ICASSP 2025)

#speech #speechprocessing #speechtech #translation

Zhihang Xie @FBKZhihangXie

🚀 AdvST: Adversarial training aligns speech and text distributions without parallel data! Combines adversarial learning + hidden-state swapping to fix length mismatch & boost low-resource speech translation. https://ieeexplore.ieee.org/document/10888294

Reply on Twitter 1942964328593834393 Retweet on Twitter 1942964328593834393 Like on Twitter 1942964328593834393 2 Twitter 1942964328593834393