The project includes technology transfer activities from FBK-irst to Pervoice and development activities such as improvements of automatic transcription technology (rich transcription, automatic text polishing), speech analytics technologies for call centers (emotional state recognition, segmentation and classification of utterances, monitoring of transactions), and advanced acoustic normalization techniques.
Our pick of the week by
@lina_conti
: "Greater accessibility can amplify discrimination in generative AI" by
@CarolinHolterm, @minhducbui_nlp, @KaitlynZhou, @vjhofmann, @kelina1124, @anne_lauscher
📰
#GenderBias #SpeechLLM
Pick of the week @fbk_mt: "Greater accessibility can amplify discrimination in generative AI"
Gender bias in speech-based LLMs examined from multiple angles: a user survey, automatic bias measurement, and pitch manipulation experiments.
https://arxiv.org/pdf/2603.22260
Late update, but we had two great talks last month!
#MachineTranslation #FBK #NLProc #GenderBias #SpeechSynthesis
Our pick of the week by @dhairya_su47605
: "Scaling Laws for Precision" by @tanishqkumar07, Zachary Ankner, @bfspectorShiekh, @blake__bordelon, @Muennighoff, @mansiege, @CPehlevan, Christopher R´e, @AdtRaghunathan
📰
#Quantization #LLM #ScalingLaw
Pick of the week @fbk_mt
Super interesting paper on the limitations of quantization, demonstrating how post-training quantization scales poorly in data.
https://arxiv.org/abs/2411.04330
⭐ For our #PickOfTheWeek, this paper explores an important question for modern speech AI:
🎙️ Which Evaluation for Which Speech Model?
👥 Authors: @Maureendss , @EeshanDhekane
Speech foundation models are evolving rapidly, but evaluation practices are still fragmented.