The FACILE project aimed at the development of a system for the categorisation of texts from the area of finance and business news in an exact and specific way. The intended users of the system were institutions from finance and commerce that have a vital interest in up-to-date business information stemming from online news agencies and periodicals. An important consideration in FACILE has been its use across country and language borders. The possibility to process texts in various languages and to derive factual information in a language independent, formatted form allowed for the rapid dissemination of information across borders.
Our pick of the week by @FBKZhihangXie: "Adversarial Speech-Text Pre-Training for Speech Translation" by Chenxuan Liu, Liping Chen, Weitai Zhang, Xiaoxi Li, Peiwang Tang, Mingjia Yu, Sreyan Ghosh, and Zhongyi Ye (ICASSP 2025)
#speech #speechprocessing #speechtech #translation
π AdvST: Adversarial training aligns speech and text distributions without parallel data! Combines adversarial learning + hidden-state swapping to fix length mismatch & boost low-resource speech translation. https://ieeexplore.ieee.org/document/10888294
A special evening in Rome to talk about Physical AI and Europeβs role in shaping this new frontier.
Partners from across Europe came together to present the DVPS project, and connect with key people from public institutions, embassies, industries, national & international media.
Thrilled to be part of this amazing project and team!
π DVPS has launched at Translated's HQ!
70 researchers from 20 institutions across 9 countries unite to build next-gen multimodal foundation models that learn from real-world interaction.
A new European AI journey begins.
#DVPS #PhysicalAI #HorizonEurope #MultimodalAI
Our pick of the week by @FBKZhihangXie: "PHRASED: Phrase Dictionary Biasing for Speech Translation" by Peidong Wang, Jian Xue, Rui Zhao, @ChenJunkun, Aswin Shanmugam Subramanian, and Jinyu Li (2025).
#Speech #SpeechAI #Translation #ST #SpeechTranslation
π Boost rare-phrase translation in speech! Uses **bilingual dictionaries** to dynamically bias outputs.
β
**+21%** recall in streaming ST
β
**+85%** in multimodal LLMs
π: http://arxiv.org/abs/2506.09175