Languages

As of January 2020 we have received complete data sets and have begun processing data from the following 43 languages:

LanguageFamily/PhylumCorpus creator(s)
AnalSino-TibetanPavel Ozerov
ArapahoAlgonquianAndrew Cowell
Asimjeeg DatoogaNiloticRichard Griscom
Bainounk GujaherNiger-CongoAmadou Bèye
BejaAfro-AsiaticMartine Vanhove
BoraBoranFrank Seifart
DaakakaAustronesianKilu von Prince
DaakieAustronesianManfred Krifka
FanbyakAustronesianMike Franjieh
GoemaiAfro-AsiaticBirgit Hellwig
GorwaaAfro-AsiaticAndrew Harvey
GurindjiPama-NyunganFelicity Meakins
Gurindji Kriol(Mixed)Felicity Meakins
JahaiAustroasiaticNiclas Burenhult, Nicole Kruspe
Jakarta IndonesianAustronesianBradley Taylor, David Gil
Kagate (Syuba)Sino-TibetanLauren Gawne
KakabeNiger-CongoAlexandra Vydrina
KamasUralicValentin Gusev, Tiina Klooster, Beáta Wagner-Nagy, Alexandre Arkhipov
KatlaNiger-CongoBirgit Hellwig
KomnzoYamChristian Döhler
Lower SorbianIndo-EuropeanHauke Bartels, Marcin Szczepański, Serbski institut
MaveaAustronesianValérie Guérin
Mojeño TrinitarioArawakanFrançoise Rose
Movima(isolate)Katharina Haude
MwotlapAustronesianAlex François
NafsanAustronesianNick Thieberger, Ana Krajinovic
Northern AltaAustronesianAlexandro Garcia-Laguia
PnarAustroasiaticHiram Ring, Nicole Kruspe
ResígaroArawakanFrank Seifart
RuuliNiger-CongoAlena Witzlack-Makarevich, Saudah Namyalo, Anatol Kiriggwajjo, Zarina Molochieva, Amos Atuhairwe
SaduSino-Tibetan Xianming Xu
SavosavoAustronesianClaudia Wegener
SumiSino-TibetanAmos Teo
Tabaq (Karko)NubianBirgit Hellwig
TeopAustronesianUlrike Mosel
TotoliAustronesianMaria Bardají i Farré
UrumTurkicStavros Skopeteas
Vera'aAustronesianStefan Schnell
Western PantarTimor-Alar-PantarGary Holton
YaliTrans-New-GuineaSonja Riesberg
YanomamiYanomamicHelder Perri Ferreira
Yongning NaSino-TibetanAlexis Michaud
Yurakaré(isolate)Sonja Gipper