The aim of the project is to explore the use of large language models (LLMs) for machine translation, by asking two main questions: (i) in what scenarios can contextual information be effectively used via prompting? and (ii) for low-resource scenarios (with a focus on dialects and regional languages), can LLMs be effectively trained without any parallel data?
The project is led by Josep Crego (Systran).
My role in the project is leader for Inria and head of the work package on translation for low-resource translation without parallel data.
2023–2026
ANR
The aim of the project is to develop new methods for the machine translation (MT) of complete scientific documents, as well as automatic metrics to evaluate the quality of these translations. Our main application target is the translation of scientific articles between French and English, where linguistic resources can be exploited to obtain more reliable translations, both for publication purposes and for gisting and text mining.
The project is led by François Yvon (CNRS).
My role in the project is leader for Inria and head of the work package on the evaluation of the MT for scientific documents.
2023–2026
ANR
The Inria DEFI COLaF (Corpus and Tools for the Languages of France) aims to provide open-source datasets and tools for automatic text and speech processing for the languages and speakers of France.
The project is led by Benoît Sagot (ALMAnaCH team) and Slim Ouni (MULTISPEECH team).
2025–2028
DEFI
PR[AI]RIE-PSAI (Paris School of AI), a follow-up to the PRAIRIE institute, is the largest of the AI Clusters established as part of the France 2030 national strategy. Led by PSL University, the project aims to create an internationally renowned school specialising in artificial intelligence. Its goal is to advance knowledge in AI, provide world-class higher education, and produce groundbreaking innovations in this field.
My role in the project is fellow.
2025–2028
ANR