EC Short Clips
EC Short Clips is a test set dedicated to evaluate automatic subtitling systems.
Read Moreby Marco Gaido | Jul 7, 2023 | Corpora | 0
EC Short Clips is a test set dedicated to evaluate automatic subtitling systems.
Read Moreby Marco Gaido | Jul 7, 2023 | Corpora | 0
EuroParl Interviews is a test set dedicated to evaluate automatic subtitling systems.
Read Moreby Roldano Cattoni | Jul 4, 2023 | Corpora | 0
MuST-SHE: a multilingual benchmark allowing for a fine-grained analysis of gender bias in Machine Translation and Speech Translation.
Read Moreby Luisa Bentivogli | Jun 1, 2023 | Corpora | 0
MuST-SHE: a multilingual benchmark allowing for a fine-grained analysis of gender bias in Machine Translation and Speech Translation.
Read Moreby Matteo Negri | Jun 1, 2023 | Corpora | 0
Multilingual benchmark built from European Parliament speeches and annotated with Named Entities and Terminology
Read Moreby Mauro Cettolo | May 30, 2023 | Corpora | 0
Ready-to-use version for MT research purposes of the multilingual transcriptions of TED talks
Read Moreby Mauro Cettolo | May 30, 2023 | Corpora | 0
Annotation of dubbing segments based on the Heroes corpus
Read Moreby Beatrice Savoldi | May 30, 2023 | Corpora | 0
This multilingual dataset was created within the TOSCA-MP project as ground truth data for the evaluation of automatic transcription and spoken language translation technologies.
Read Moreby Marco Gaido | May 30, 2023 | Corpora | 0
MuST-C validation set specifically designed to train ST systems for experiments on gender translation.
Read Moreby Beatrice Savoldi | May 30, 2023 | Corpora | 0
MuST-Speakers is a resource designed to i) foster research around gender bias in speech translation (ST) and machine translation (MT), and ii) facilitate the development of gender-enhanced translation models.
Read Moreby Marco Gaido | May 30, 2023 | Corpora | 0
The largest freely-available Synthetic Corpus for Automatic Post-Editing
Read More
‼️ *Paper, code, models, and outputs out* for one of our last papers "AlignAtt" about Simultaneous Speech Translation recently published at #Interspeech2023!
📍Official Paper: https://www.isca-speech.org/archive/pdfs/interspeech_2023/papi23_interspeech.pdf
📍Repo (code, etc.): https://github.com/hlt-mt/FBK-fairseq/blob/master/fbk_works/ALIGNATT_SIMULST_AGENT_INTERSPEECH2023.md
#NLProc #NLP #speech #translation
Our pick of the week by @apierg: "Prompt-Driven Neural Machine Translation" by Li et al., Findings ACL 2022.
#machine #translation #MT #NLP #NLProc #ACL #computational #linguistics #prompt
The call for diversity and inclusion (D&I) subsidies for #EMNLP2023 is online!
EMNLP 2023 is providing D&I funds for registration, caregiving, bandwidth, travel and VPN subsidies.
Deadline: October 20, 2023 11:59pm (Anywhere on Earth)
Details:
#NLProc
One of the very first multilingual and multimodal model to obtain performance competitive with dedicated models, maybe the first of many? Anyway, a very interesting read: https://arxiv.org/pdf/2308.11466.pdf