Corpora token count |
208,387,526 |
| Corpus | Token count | PCT |
|---|---|---|
| Fiction | 21,196,089 | 10.17% |
| Nonfiction | 27,022,058 | 12.97% |
| Journalism | 93,147,343 | 44.7% |
| Administrative literature | 15,506,670 | 7.44% |
| Spoken language | 715,176 | 0.34% |