Article list
The new version web-6.0 currently contains 4 373 231 228 tokens. The texts are given basic information on URL and time of retrieval. To search the corpus free registration is required. For more information, please visit the following website.
The new version of the main corpus of written texts prim-10.0 contains 1 688 211 881 tokens. The current version uses more precise software tools and more accurate configuration, it also offers improved tools and models developed in the SNC Department and Ľudovít Štúr Institute of Linguistics SAS. To search the corpus free registration is required. For more information, please visit…
The new version s-hovor-7.0 contains 869 records, it is composed of 851 hours of audio recordings containing 7 852 469 tokens. The text transcriptions are lemmatized and morphologically anotated. The transcription metadata contain information about the participant, origin and content of the audio recording. A user can enter a word, lemma or pronunciation in the…
The new version dialekt-5.0 containing 980 643 tokens. CD SNC is not lemmatised nor morphologically annotated. User can browse the corpus by searching for a word or using CQL. The transcribed texts contain sociolinguistic metadata about respondents, informants, origin and content of record. To search the corpus free registration is required. For more information, please visit the…