Corpora

INES

The INclusive Evaluation Suite (INES) is a test set designed to assess MT systems ability to produce gender-inclusive translations for the German→English language pair. By design, each German source sentence in INES includes an...

Read More

GeNTE

GeNTE (Gender-Neutral Translation Evaluation) is a natural, bilingual corpus designed to benchmark the ability of machine translation systems to generate gender-neutral translations. Built from European Parliament speeches,...

Read More

MUST-C

MuST-C is a multilingual speech translation corpus whose size and quality facilitates the training of end-to-end systems for speech translation from English into several languages. For each target language, MuST-C comprises...

Read More
Loading

Interested in speech translation evaluation?

Check out our LREC-COLING paper where we dive into the details, and release human assessments for @iwslt ST models!
Data on HuggingFace🤗

🧑‍🏫: Thurs 9:00-10:40 in Poster Area II
📝: https://aclanthology.org/2024.lrec-main.575.pdf
🤗: https://huggingface.co/datasets/IWSLT/da2023

🐾🐾 How do hyenas deal with human speech?

Discover how Hyena can be adapted to understand speech and how well it can transcribe and translate its content!

Visit our poster! The poster session @LrecColing is live now, Area 1!

#LRECCOLING2024 #speech

🐾 Do you wanna see a Hyena in action? 🐾

🥁 Tomorrow @mgaido91 will present the paper "How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena" at @LrecColing!

⏲️ Poster session 1: 11:00-12:40

🦴 See you there!

#NLProc #LRECCOLING2024 #ASR #ST

Load More