Earlier versions

  • prim-9.0 (2020), 1 650 million tokens, 73.96 % journalistic, 15.98 % fiction, 9.15 % professional, 0.91 % other texts
  • prim-8.0 (2018), 1 477 million tokens: 71.10 % journalistic, 15.22 % fiction, 8.51 % professional and 5.17 % other texts
  • prim-7.0 (2015), 1 250 million tokens: 65.1 % journalistic, 15.1 % fiction, 9.5 % professional and 10.3 % other texts
  • prim-6.1 (2013), 830 million tokens: 68.8 % journalistic, 13.9 % fiction, 15.3 % professional and 2 % other texts

  • prim-6.0 (2013), 1 155 million tokens: 77.8 % journalistic, 9.8 % fiction, 11 % professional and 1.4 % other texts

  • prim-5.0 (2011), 719 million tokens: 73% journalistic, 14% fiction, 12% professional and 1% other texts

  • prim-4.0 (2009), 526 million tokens: 65% journalistic, 17% fiction, 16% professional and 2% other texts

  • prim-3.0 (2007), 350 million tokens: 57% journalistic, 21.5% fiction, 18.5% professional and 3% other texts

  • prim-2.1 (2006), 300 million tokens: 63% journalistic, 20% fiction, 12% professional and 5% other texts

  • prim-2.0 (2005), 250 million tokens
  • prim1 (2004), 182 million tokens
  • prim0.2 (2003), 170 million tokens
  • prim0.1 (2003), 30 million tokens