Kolokat

KOLOKAT is a tool for collocation visualization. It is used to visualize the distance between the components in (two) word combinations in text corpora.

Since some phrases are more fixed than other, the graphical representation of collocations shows whether the phrase is “tightly-knit” or “loose”, how many words occur in between the individual components and in how many instances. Some expressions are fixed (Kitty cat), in some, the second word may be modified by other attributes (be careful, but also be very careful, be extremely careful, be as careful as possible), sometimes the second member “must” be modified («Ambassador of the Kingdom» is practically non-existent, it is always the Ambassador of the United Kingdom, the Ambassador of the Kingdom of Spain, the Ambassador of the Kingdom of Belgium…), etc.

 

For collocation visualization, please click here.

 

The first version of KOLOKAT was made available in the SNC on October 30, 2014 (for Slovak only, prim-6.1-public-all).

The latest version was made available in March 2023 (ARANEA Corpora family). The tool has been currently developing by the Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences, not within the project Building and Development of the Slovak National Corpus (5th Stage).