Beomseok Lee

PhD Student

E-mail: blee@fbk.eu
Google Scholar: My citations

Twitter: Profile

LinkedIn: Profile

Short bio

Beomseok Lee is a PhD student at the University of Trento, conducting research at Fondazione Bruno Kessler and NAVER LABS Europe.

Before starting his PhD, Beomseok worked as a full-time research engineer at SAMSUNG Research (Samsung Electronics R&D hub) Global AI Center, Seoul where he specialized in End-to-end (E2E) Speech-to-text translation. His current research focuses on E2E Spoken Language Understanding with an emphasis on multi-task, multi-lingual and multi-modal approaches. He holds a Computer Science (CS) Master's degree from Korea Advanced Institute of Science & Technology (KAIST, Korea) and a CS Bachelor's degree from Sungkyunkwan University (SKKU, Korea).

Research topics

Spoken Language Understanding, Multi-modality

Publications

Lee, Beomseok; Gaido, Marco; Calapodescu, Ioan; Besacier, Laurent; Negri, Matteo,

Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection,
in «»,
Proceedings of the 31st International Conference on Computational Linguistics,
,
vol. ,
n. ,
2025
, pp. 6816-
6826
Lee, Beomseok; Calapodescu, Ioan; Gaido, Marco; Negri, Matteo; Besacier, Laurent,

Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond,
in «»,
Proceedings of Interspeech2024,
,
vol. ,
n. ,
2024
, pp. 817-
821

MT Group at FBK Follow

#MachineTranslation Research Unit @FBK_research. #nlproc #deeplearning #ai

Avatar MT Group at FBK @fbk_mt ·

3 Jun

Our pick of the week by @dhairya_su47605

: "Scaling Laws for Precision" by @tanishqkumar07, Zachary Ankner, @bfspectorShiekh, @blake__bordelon, @Muennighoff, @mansiege, @CPehlevan, Christopher R´e, @AdtRaghunathan

📰

#Quantization #LLM #ScalingLaw

Dhairya Suman @dhairya_su47605

Pick of the week @fbk_mt
Super interesting paper on the limitations of quantization, demonstrating how post-training quantization scales poorly in data.

https://arxiv.org/abs/2411.04330

Reply on Twitter 2062203374712344946 Retweet on Twitter 2062203374712344946 Like on Twitter 2062203374712344946 3 Twitter 2062203374712344946

Avatar MT Group at FBK @fbk_mt ·

27 May

⭐ For our #PickOfTheWeek, this paper explores an important question for modern speech AI:

🎙️ Which Evaluation for Which Speech Model?
👥 Authors: @Maureendss , @EeshanDhekane

Speech foundation models are evolving rapidly, but evaluation practices are still fragmented.

Reply on Twitter 2059703767570780492 Retweet on Twitter 2059703767570780492 Like on Twitter 2059703767570780492 2 Twitter 2059703767570780492

Avatar MT Group at FBK @fbk_mt ·

15 May

🏝️ Yesterday at #LREC2026, Palma de Mallorca!
@lina_conti presented "Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation" at the poster session.
📄Paper:
💻Code: https://github.com/lina-conti/voice-bias-coreference
#SpeechTranslation #NLProc

Reply on Twitter 2055326042957713546 Retweet on Twitter 2055326042957713546 Like on Twitter 2055326042957713546 6 Twitter 2055326042957713546

Avatar MT Group at FBK @fbk_mt ·

13 May

How does the granularity of speech-text pairs impact SpeechLLM performance, and what is the optimal way to interleave tokens? Furthermore, what are the best practices for generating synthetic data to boost training?🧐

Reply on Twitter 2054503147721097286 Retweet on Twitter 2054503147721097286 Like on Twitter 2054503147721097286 3 Twitter 2054503147721097286

Load More