Words statistics

Corpora token count

208,459,634

Corpus Token count PCT
Fiction 21,196,089 10.17%
Nonfiction 27,022,058 12.96%
Journalism 93,194,172 44.71%
Administrative literature 15,506,670 7.44%
Spoken language 715,176 0.34%