The current version of the corpus web-8.0 contains 5 889 464 749 tokens. Compared to the previous version, the corpus increased by approximately 600 000 000 tokens. The latest version of the web corpus can be found in your SNK account in the section Written Corpora – Web Corpora.
The corpus is currently available free of charge (after registration).
More information can be found here.
If you use the corpus in your work or wish to cite it, please use the following reference: