AINEVA is the association of Italian regions and autonomous provinces which include Alps, whose goal is to coordinate efforts that local members play in the prevention and information in the field of snow and avalanches. On a daily basis, AINEVA members compile and make available bulletins of conditions and avalanche forecasts written in Italian. In order to allow their consultation to non-Italian people (e.g. foreign tourists), AINEVA members provide translations of original bulletins into languages such as English, French, German and Slovenian. AINEVA and FBK started a 20-month collaboration at the end of 2009 with the goal of developing a system for the automatic translation of such bulletins into English, French and German.
Our pick of the week by @dhairya_su47605
: "Scaling Laws for Precision" by @tanishqkumar07, Zachary Ankner, @bfspectorShiekh, @blake__bordelon, @Muennighoff, @mansiege, @CPehlevan, Christopher R´e, @AdtRaghunathan
📰
#Quantization #LLM #ScalingLaw
Pick of the week @fbk_mt
Super interesting paper on the limitations of quantization, demonstrating how post-training quantization scales poorly in data.
https://arxiv.org/abs/2411.04330
⭐ For our #PickOfTheWeek, this paper explores an important question for modern speech AI:
🎙️ Which Evaluation for Which Speech Model?
👥 Authors: @Maureendss , @EeshanDhekane
Speech foundation models are evolving rapidly, but evaluation practices are still fragmented.
🏝️ Yesterday at #LREC2026, Palma de Mallorca!
@lina_conti presented "Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation" at the poster session.
📄Paper:
💻Code: https://github.com/lina-conti/voice-bias-coreference
#SpeechTranslation #NLProc
How does the granularity of speech-text pairs impact SpeechLLM performance, and what is the optimal way to interleave tokens? Furthermore, what are the best practices for generating synthetic data to boost training?🧐