By the end of the DoReCo project, we had finalized processing data sets from the following 51 languages. Since then, we have been able to add two more languages, Gurindji (Pama-Nyungan, corpus created by Felicity Meakins) and Totoli (Austronesian, corpus created by Maria Bardají, Christoph Bracks, Claudia Leto, Datra Hasan, Sonja Riesberg, Winarno S. Alamudi, and Nikolaus P. Himmelmann). See https://doreco.huma-num.fr/languages for the full list of the currently available corpora
| Language | Family/Phylum | Corpus creator(s) |
|---|---|---|
| Anal | Sino-Tibetan | Pavel Ozerov |
| Arapaho | Algic | Andrew Cowell |
| Asimjeeg Datooga | Nilotic | Richard Griscom |
| Baïnounk Gubëeher | Atlantic-Congo | Alexander Yao Cobbinah |
| Beja | Afro-Asiatic | Martine Vanhove |
| Bora | Boran | Frank Seifart |
| Cabécar | Chibchan | Juan Diego Quesada, Stavros Skopeteas, Carolina Pasamonik, Carolin Brokmann & Florian Fischer |
| Cashinahua | Panoan | Sabine Reiter |
| Daakie | Austronesian | Manfred Krifka |
| Dalabon | Gunwinyguan | Maïa Ponsonnet |
| Dolgan | Turkic | Chris Lasse Däbritz, Nina Kudryakova, Eugénie Stapert & Alexandre Arkhipov |
| English | Indo-European | Nils Norman Schiborr |
| Evenki | Tungusic | Olga Kazakevich & Elena Klyachko |
| Fanbyak | Austronesian | Mike Franjieh |
| French (Switzerland) | Indo-European | Mathieu Avanzi, Marie-José Béguelin, Gilles Corminboeuf, Federica Diémoz & Laure Anne Johnsen |
| Goemai | Afro-Asiatic | Birgit Hellwig |
| Gorwaa | Afro-Asiatic | Andrew Harvey |
| Hoocąk | Siouan | Iren Hartmann |
| Jahai | Austroasiatic | Niclas Burenhult |
| Jejuan | Koreanic | Soung-U Kim |
| Kakabe | Mande | Alexandra Vydrina |
| Kamas | Uralic | Valentin Gusev, Tiina Klooster, Beáta Wagner-Nagy & Alexandre Arkhipov |
| Komnzo | Yam | Christian Döhler |
| Light Warlpiri | (mixed) | Carmel O'Shannessy |
| Lower Sorbian | Indo-European | Hauke Bartels, Marcin Szczepański, Kamil Thorquint-Stumpf & Serbski institut |
| Mojeño Trinitario | Arawakan | Françoise Rose |
| Movima | (isolate) | Katharina Haude |
| Nafsan (South Efate) | Austronesian | Nick Thieberger |
| Nisvai | Austronesian | Jocelyn Aznar |
| Northern Alta | Austronesian | Alexandro Garcia Laguia |
| Northern Kurdish (Kurmanji) | Indo-European | Geoffrey Haig, Maria Vollmer & Hanna Thiele |
| Nǁng | Tuu | Tom Güldemann, Martina Ernszt, Sven Siegmund & Alena Witzlack-Makarevich |
| Pnar | Austroasiatic | Hiram Ring |
| Resígaro | Arawakan | Frank Seifart |
| Ruuli | Atlantic-Congo | Alena Witzlack-Makarevich, Saudah Namyalo, Anatol Kiriggwajjo, Zarina Molochieva & Amos Atuhairwe |
| Sadu | Sino-Tibetan | Xianming Xu, Bibo Bai & Yan Yang |
| Sanzhi Dargwa | Nakh-Daghestanian | Diana Forker & Nils Norman Schiborr |
| Savosavo | (isolate) | Claudia Wegener |
| Sümi | Sino-Tibetan | Amos Teo & H Salome Kinny |
| Svan | Kartvelian | Jost Gippert |
| Tabaq (Karko) | Nubian | Birgit Hellwig, Gertrud Schneider-Blum & Ismail Khaleel Bakheet Khaleel |
| Teop | Austronesian | Ulrike Mosel |
| Tabasaran | Nakh-Daghestanian | Natalia Bogomolova, Dmitry Ganenkov & Nils Norman Schiborr |
| Texistepec Popoluca | Zoque | Søren Wichmann |
| Urum | Turkic | Stavros Skopeteas, Violeta Moisidi, Nutsa Tsetereli, Johanna Lorenz, Stefanie Schröter |
| Vera'a | Austronesian | Stefan Schnell |
| Warlpiri | Pama-Nyungan | Carmel O'Shannessy |
| Yali (Apahapsili) | Trans-New-Guinea | Sonja Riesberg |
| Yongning Na | Sino-Tibetan | Alexis Michaud |
| Yucatec Maya | Mayan | Stavros Skopeteas |
| Yurakaré | (isolate) | Sonja Gipper & Jeremías Ballivián Torrico |