By the end of the project, we finalized processing data sets from the following 50 (plus one) languages:
Language | Family/Phylum | Corpus creator(s) |
---|---|---|
Anal | Sino-Tibetan | Pavel Ozerov |
Arapaho | Algic | Andrew Cowell |
Asimjeeg Datooga | Nilotic | Richard Griscom |
Baïnounk Gubëeher | Atlantic-Congo | Alexander Yao Cobbinah |
Beja | Afro-Asiatic | Martine Vanhove |
Bora | Boran | Frank Seifart |
Cabécar | Chibchan | Juan Diego Quesada, Stavros Skopeteas, Carolina Pasamonik, Carolin Brokmann & Florian Fischer |
Cashinahua | Panoan | Sabine Reiter |
Daakie | Austronesian | Manfred Krifka |
Dalabon | Gunwinyguan | Maïa Ponsonnet |
Dolgan | Turkic | Chris Lasse Däbritz, Nina Kudryakova, Eugénie Stapert & Alexandre Arkhipov |
English | Indo-European | Nils Norman Schiborr |
Evenki | Tungusic | Olga Kazakevich & Elena Klyachko |
Fanbyak | Austronesian | Mike Franjieh |
French (Switzerland) | Indo-European | Mathieu Avanzi, Marie-José Béguelin, Gilles Corminboeuf, Federica Diémoz & Laure Anne Johnsen |
Goemai | Afro-Asiatic | Birgit Hellwig |
Gorwaa | Afro-Asiatic | Andrew Harvey |
Hoocąk | Siouan | Iren Hartmann |
Jahai | Austroasiatic | Niclas Burenhult |
Jejuan | Koreanic | Soung-U Kim |
Kakabe | Mande | Alexandra Vydrina |
Kamas | Uralic | Valentin Gusev, Tiina Klooster, Beáta Wagner-Nagy & Alexandre Arkhipov |
Komnzo | Yam | Christian Döhler |
Light Warlpiri | (mixed) | Carmel O'Shannessy |
Lower Sorbian | Indo-European | Hauke Bartels, Marcin Szczepański, Kamil Thorquint-Stumpf & Serbski institut |
Mojeño Trinitario | Arawakan | Françoise Rose |
Movima | (isolate) | Katharina Haude |
Nafsan (South Efate) | Austronesian | Nick Thieberger |
Nisvai | Austronesian | Jocelyn Aznar |
Northern Alta | Austronesian | Alexandro Garcia Laguia |
Northern Kurdish (Kurmanji) | Indo-European | Geoffrey Haig, Maria Vollmer & Hanna Thiele |
Nǁng | Tuu | Tom Güldemann, Martina Ernszt, Sven Siegmund & Alena Witzlack-Makarevich |
Pnar | Austroasiatic | Hiram Ring |
Resígaro | Arawakan | Frank Seifart |
Ruuli | Atlantic-Congo | Alena Witzlack-Makarevich, Saudah Namyalo, Anatol Kiriggwajjo, Zarina Molochieva & Amos Atuhairwe |
Sadu | Sino-Tibetan | Xianming Xu, Bibo Bai & Yan Yang |
Sanzhi Dargwa | Nakh-Daghestanian | Diana Forker & Nils Norman Schiborr |
Savosavo | (isolate) | Claudia Wegener |
Sümi | Sino-Tibetan | Amos Teo & H Salome Kinny |
Svan | Kartvelian | Jost Gippert |
Tabaq (Karko) | Nubian | Birgit Hellwig, Gertrud Schneider-Blum & Ismail Khaleel Bakheet Khaleel |
Teop | Austronesian | Ulrike Mosel |
Tabasaran | Nakh-Daghestanian | Natalia Bogomolova, Dmitry Ganenkov & Nils Norman Schiborr |
Texistepec Popoluca | Zoque | Søren Wichmann |
Urum | Turkic | Stavros Skopeteas, Violeta Moisidi, Nutsa Tsetereli, Johanna Lorenz, Stefanie Schröter |
Vera'a | Austronesian | Stefan Schnell |
Warlpiri | Pama-Nyungan | Carmel O'Shannessy |
Yali (Apahapsili) | Trans-New-Guinea | Sonja Riesberg |
Yongning Na | Sino-Tibetan | Alexis Michaud |
Yucatec Maya | Mayan | Stavros Skopeteas |
Yurakaré | (isolate) | Sonja Gipper & Jeremías Ballivián Torrico |