The pilot version of Literary Author Corpus released

The pilot version lak-1.0 was created in 2024, containing 177 938 tokens (140 536 words). The corpus is enriched by narratological annotation reflecting three literary annotation keys: narrator, direct speech, and embedded structures. It is comprised of the texts written by the prominent Slovak author Pavol Vilikovský (1941 – 2020).

The texts are also given information on style and genre annotation, they are lemmatized and morphologically annotated using Morphodita tagger.

More information can be found here.