Article list
Slovak-Hungarian Parallel Corpus par-skhu-1.1 currently contains about 51 million tokens in the Slovak part and almost 48 million tokens in the Hungarian part. Compared to previous version, text annotation for both languages is more detailed and extensive. We believe that our corpora could be helpful in the comparative research, as well as in the field…
Please watch a short instructional video on How to search for the most frequent words beginning with “GEO”. The video is available on our YouTube channel, or Facebook website. User can also find it here.
The corpus errkorp–pilot contains 137 393 tokens. Errkorp-pilot can be accessed through NoSkE in the SNC user account (in part Written corpora Acquisition corpora). ERRKORP is compiled of texts of non-native speakers of Slovak so that the language errors can be observed and described, as well as the correlations between them explored. To search the…
The new version web-6.0 currently contains 4 373 231 228 tokens. The texts are given basic information on URL and time of retrieval. To search the corpus free registration is required. For more information, please visit the following website.