Beomseok Lee

PhD Student

    Short bio

    Beomseok Lee is a PhD student at the University of Trento, conducting research at Fondazione Bruno Kessler and NAVER LABS Europe.

    Before starting his PhD, Beomseok worked as a full-time research engineer at SAMSUNG Research (Samsung Electronics R&D hub) Global AI Center, Seoul where he specialized in End-to-end (E2E) Speech-to-text translation. His current research focuses on E2E Spoken Language Understanding with an emphasis on multi-task, multi-lingual and multi-modal approaches. He holds a Computer Science (CS) Master's degree from Korea Advanced Institute of Science & Technology (KAIST, Korea) and a CS Bachelor's degree from Sungkyunkwan University (SKKU, Korea).

    Research topics

    Spoken Language Understanding, Multi-modality

    Publications

    1. Lee, Beomseok; Calapodescu, Ioan; Gaido, Marco; Negri, Matteo; Besacier, Laurent,
      in «»,
      Proceedings of Interspeech2024,
      ,
      vol. ,
      n. ,
      2024
      , pp. 817-
      821

    Weekly pick from the #MeetweenScientificWatch: “Video-SALMONN: Speech-enhanced audio-visual large language models” – Redefining video comprehension with speech-aware AV-LLMs and groundbreaking QA accuracy. 🎥🎤🤖

    I’m glad to announce that our work “How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System?” has been accepted at the Transactions of @aclanthology (TACL)! 🎉

    The preprint is available here:

    The new @iwslt shared task on instruction following speech models is out! Test sets will be available on the 1st of April and participants have to submit their models by April 15th. Check out the description for more info (or get in touch with us):

    📢First Call for Papers 📢
    The 22nd @iwslt event will be co-located with @aclmeeting
    31 July-1 August 2025 –Vienna, Austria
    Scientific submission due March 15, 2025
    More details here:
    @marcfede @esalesk @ELRAnews @shashwatup9k @MarineCarpuat @_janius_

    Load More