7th version of the Corpus of Spoken Slovak released

The new version s-hovor-7.0 contains 869 records, it is composed of 851 hours of audio recordings containing 7 852 469 tokens.

The text transcriptions are lemmatized and morphologically anotated. The transcription metadata contain information about the participant, origin and content of the audio recording. A user can enter a word, lemma or pronunciation in the input field and the transcription will be displayed.

To search the corpus free registration is required. For more information, please visit the following website.