Open Language Data Initiative

community

AI & ML interests

Multilingual NLP, underserved languages

Open Language Data Initiative

Welcome!

The Open Language Data Initiative (OLDI) empowers language communities around the globe to contribute to a database that drives the foundation of today’s machine translation and natural language processing work. We invite community, academic, and industry members to contribute to key datasets that are imperative to the organic expansion of language technology’s reach.

For more information, visit oldi.org.

models

None public yet