Article list
The new version s-hovor-7.0 contains 869 records, it is composed of 851 hours of audio recordings containing 7 852 469 tokens. The text transcriptions are lemmatized and morphologically anotated. The transcription metadata contain information about the participant, origin and content of the audio recording. A user can enter a word, lemma or pronunciation in the…
The new version dialekt-5.0 containing 980 643 tokens. CD SNC is not lemmatised nor morphologically annotated. User can browse the corpus by searching for a word or using CQL. The transcribed texts contain sociolinguistic metadata about respondents, informants, origin and content of record. To search the corpus free registration is required. For more information, please visit the…
The new version of par-skes-2.0 is comprised of 225 publications, containing 35.6 million tokens (18.9 million tokens in the Spanish half and 16.7 million tokens in the Sloval half). To search the corpus free registration is required. For more information, please visit the following website.
Slovak-German parallel corpus has been completed with more than 170 publications, so the current version contains 468 million tokens (229.9 million tokens in the Slovak part and 238.1 million tokens in the German part). To search the corpus free registration is required. For more information, please visit the following website.