The first version of the text corpus for foreigners learning Slovak as a foreign language released – ERRKORP

In cooperation with the Faculty of Arts of the Comenius University in Bratislava, the Faculty of Arts of University in Prešov and the Faculty of Arts of the Matej Bel University in Banská Bystrica, we made available the 1st version of the text corpus for foreigners learning Slovak as a foreign language.

The corpus errkorp-1.0 contains 347,395 tokens and you can find it among the SNK corpora in the section Written Corpus – Acquisition Corpus, after you sign in to your account.

The corpus is comprised of 1,063 texts written by students learning Slovak as a foreign language, with different mother tongues and different knowledge of Slovak. The current version contains, at the level of manual annotation of marked errors, qualitatively improved data, compared to the pilot version, and also new supplemented data.

To search the corpus free registration is required. For more information, please visit the following website.

Corpus creation has been supported by the Slovak Research and Development Agency as part of the project APVV-19-0155.