News

Loading

Weekly pick from the #MeetweenScientificWatch: โ€œVideo-SALMONN: Speech-enhanced audio-visual large language modelsโ€ โ€“ Redefining video comprehension with speech-aware AV-LLMs and groundbreaking QA accuracy. ๐ŸŽฅ๐ŸŽค๐Ÿค–

Iโ€™m glad to announce that our work โ€œHow "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System?โ€ has been accepted at the Transactions of @aclanthology (TACL)! ๐ŸŽ‰

The preprint is available here:

The new @iwslt shared task on instruction following speech models is out! Test sets will be available on the 1st of April and participants have to submit their models by April 15th. Check out the description for more info (or get in touch with us):

๐Ÿ“ขFirst Call for Papers ๐Ÿ“ข
The 22nd @iwslt event will be co-located with @aclmeeting
31 July-1 August 2025 โ€“Vienna, Austria
Scientific submission due March 15, 2025
More details here:
@marcfede @esalesk @ELRAnews @shashwatup9k @MarineCarpuat @_janius_

Load More