5th version of the Corpus of Dialects of the SNC released

The new version dialekt-5.0 containing 980 643 tokens.

CD SNC is not lemmatised nor morphologically annotated. User can browse the corpus by searching for a word or using CQL. The transcribed texts contain sociolinguistic metadata about respondents, informants, origin and content of record.

To search the corpus free registration is required. For more information, please visit the following website.