We have released the third version of the acquisition corpus errkorp-3.0, containing 953,156 tokens which is 225,000 tokens more than in the previous version. You can find it after logging into your account in the SNC Corpora – Learner Corpora section.
It is comprised of 3,054 texts written by students with different mother tongues and different knowledge of Slovak.
You can search:
- by correct and incorrect words;
- by specific error and correction tags;
- using CQL.
The corpus is free to use after registration.
After the APVV project closure, the Department of Slovak National Corpus of the Ľ. Štúr Institute of Linguistics prepared, within the project Building and Development of the Slovak National Corpus (5th Stage), the third version of the acquisition corpus.
For more information, please visit the website.