Words statistics

Corpora token count

209,151,441

Corpus Token count PCT
Fiction 21,196,089 10.13%
Nonfiction 27,022,058 12.92%
Journalism 93,184,178 44.55%
Administrative literature 15,506,670 7.41%
Spoken language 715,176 0.34%