Final lengthening in 17 DoReCo languages

We finalized processing of 17 languages and analyzed these regarding final lengthening. Results were presented at three conferences: the 12th International Seminar on Speech Production (poster), the 18th Old World Conference on Phonology (abstract), and the 43rd Annual Conference of the German Linguistic Society (DGfS) (workshop program). Thanks to the audiences for feedback! Here’s a snapshot of some of the results:

Congrats Asst Prof Easterday

It’s official: (Former) DoReCo project member Shelece Easterday will be
assistant professor at the University of Hawai’i. Congratulations,
Shelece! We’re looking forward to cooperating with you at U Hawai’i on
corpus-based, cross-linguistic studies on, e.g., phonological complexity.

50 languages!

Early into the second project year, we have now received data sets from more than 50 languages (see These data sets are currently at various stages of processing, but we have already fully processed and created alignments at the word and segment levels for the following five languages: Arapaho, Kamas, Svan, Urum, and Yongning Na. As the number of fully processed corpora grows, several exciting phonetic and morphological studies are already on their way, building on the research ideas described in Stay tuned for more info!

DoReCo workflow @ LREC

We are proud to announce our latest publication, in which we describe in detail DoReCo’s data processing workflow:
Paschen, Ludger, François Delafontaine, Christoph Draxler, Susanne Fuchs, Matthew Stave & Frank Seifart (2020). Building a Time-Aligned Cross-Linguistic Reference Corpus from LanguageDocumentation Data (DoReCo). Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), 2657–2666.
For a list of all DoReCo publications, see

PostDoc opportunity

DoReCo’s sister project QUEST in Berlin is looking for a PostDoc to work on optimizing fieldwork data for cross-linguistic research. We’re open to candidates proposing their own cross-linguistic, cross-corpus research questions for exploratory projects, using, e.g. DoReCo data. Check out details at

PhD opportunities

Our host institution in Berlin, Leibniz-ZAS, currently invites applications for PhD positions, deadline 15.3.2020: One possibility are PhD projects that exploit and further develop DoReCo. Potential applicants are welcome to contact Frank Seifart before applying. Spread the word among your students and colleagues!

Upcoming conference presentations

Come see DoReCo presenting our work at three venues over the next three months! On November 28-29 we will be presenting at the GDR-LIFT kick-off meeting in Orléans, France ( On December 13 we will be at the Workshop on Rate and Rhythm in Speech Recognition in Nijmegen, Netherlands ( And on January 2-5 we will be presenting at the Linguistics Society of America in New Orleans, USA (

We will be reporting, among other things, on our work with the MAUS system for phonemic time-alignment, developed by our project partners in Munich. Currently, the DoReCo corpus contains data from 40 languages, 20 of which have already been time-aligned, and many more on the way. If you’ll be at any of these three events, talk to us to find out more!