Article list
The new version web-6.0 currently contains 4 373 231 228 tokens. The texts are given basic information on URL and time of retrieval. To search the corpus free registration is required. For more information, please visit the following website.
The new version s-hovor-7.0 contains 869 records, it is composed of 851 hours of audio recordings containing 7 852 469 tokens. The text transcriptions are lemmatized and morphologically anotated. The transcription metadata contain information about the participant, origin and content of the audio recording. A user can enter a word, lemma or pronunciation in the…
The new version dialekt-5.0 containing 980 643 tokens. CD SNC is not lemmatised nor morphologically annotated. User can browse the corpus by searching for a word or using CQL. The transcribed texts contain sociolinguistic metadata about respondents, informants, origin and content of record. To search the corpus free registration is required. For more information, please visit the…
The new version of the corpus hist-6.0 contains 916 743 tokens. Texts in the Corpus of Historical Slovak are not lemmatized nor morphologically annotated, users can search for a word form or use CQL. Transcriptions include information about the origin of the text, its storage (or release) and date. To search the corpus free registration is required. For more information,…