This page presents the lexicographical coverage of the Wikidata lexicographical data compared to a corpus of the given language. Unless the entry for the language says otherwise, the corpora are based on Wikipedia (available here ).
These pages are updated weekly on Wednesdays by NikkiBot , although no edit will be made if nothing has changed.
The code for the bot is at https://github.com/nikkiwd/lexcover and is based on the original PAWS notebook by Denny. Report issues to Nikki (either on User talk:Nikki or on Telegram). Requests for additional languages, improvements and suggestions are also welcome.
Words can be filtered out by adding them to the "Filter" subpage for the language (e.g. Wikidata:Lexicographical coverage/nb/Filter ). The entries in the list can be customised, e.g. to add search links, by editing the "Missing/row" subpage (e.g. Wikidata:Lexicographical coverage/nb/Missing/row ). It is also possible to add things before and after the list, e.g. if you want the output to be a table, by editing the "Missing/head" and "Missing/foot" subpages.
More information:
More statistics:
Forms in Wikidata: 2,777
Forms in Wikipedia: 246,598
Tokens: 69,840,956
Covered forms: 574 (0.2%)
Missing forms: 246,024 (99.8%)
Covered tokens: 1,938,703 (2.8%)
Missing tokens: 67,902,253 (97.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 233
Forms in Wikipedia: 118,514
Tokens: 33,132,887
Covered forms: 200 (0.2%)
Missing forms: 118,314 (99.8%)
Covered tokens: 775,767 (2.3%)
Missing tokens: 32,357,120 (97.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 9,937
Forms in Wikipedia: 9,552
Tokens: 1,459,030
Covered forms: 2,014 (21.1%)
Missing forms: 7,538 (78.9%)
Covered tokens: 1,080,960 (74.1%)
Missing tokens: 378,070 (25.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
(This analysis was performed separately from all the others on this page, using the corpus linked here and custom counting code.)
Forms in Wikidata: 46,504
Forms in Wikipedia: 5,34,894
Tokens: 1,33,06,025
Covered forms: 13,900 (2.60%)
Missing forms: 5,20,994 (97.40%)
Covered tokens: 50,79,352 (38.17%)
Missing tokens: 82,26,673 (61.83%)
Most frequent missing forms
Covered: 13,900
Missing: 520,994
Covered: 5,079,352
Missing: 8,226,673
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 7
Forms in Wikipedia: 35,431
Tokens: 3,876,195
Covered forms: 4 (0.0%)
Missing forms: 35,427 (100.0%)
Covered tokens: 392 (0.0%)
Missing tokens: 3,875,803 (100.0%)
Most frequent missing forms
Covered: 4
Missing: 35,427
Covered: 392
Missing: 3,875,803
Forms in Wikidata: 254
Forms in Wikipedia: 176,311
Tokens: 108,297,498
Covered forms: 185 (0.1%)
Missing forms: 176,126 (99.9%)
Covered tokens: 22,360,767 (20.6%)
Missing tokens: 85,936,731 (79.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 212,419
Forms in Wikipedia: 261,374
Tokens: 74,084,890
Covered forms: 53,642 (20.5%)
Missing forms: 207,732 (79.5%)
Covered tokens: 51,472,250 (69.5%)
Missing tokens: 22,612,640 (30.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 169
Forms in Wikipedia: 10,844
Tokens: 1,442,683
Covered forms: 99 (0.9%)
Missing forms: 10,745 (99.1%)
Covered tokens: 120,454 (8.3%)
Missing tokens: 1,322,229 (91.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 586,235
Forms in Wikipedia: 111,139
Tokens: 30,879,404
Covered forms: 63,389 (57.0%)
Missing forms: 47,750 (43.0%)
Covered tokens: 28,632,672 (92.7%)
Missing tokens: 2,246,732 (7.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 206,983
Forms in Wikipedia: 1,008,036
Tokens: 596,433,479
Covered forms: 109,798 (10.9%)
Missing forms: 898,238 (89.1%)
Covered tokens: 478,573,493 (80.2%)
Missing tokens: 117,859,986 (19.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 38,904
Forms in Wikipedia: 129,276
Tokens: 40,452,744
Covered forms: 16,793 (13.0%)
Missing forms: 112,483 (87.0%)
Covered tokens: 18,421,537 (45.5%)
Missing tokens: 22,031,207 (54.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 113,054
Forms in Wikipedia: 965,225
Tokens: 1,508,248,447
Covered forms: 78,319 (8.1%)
Missing forms: 886,906 (91.9%)
Covered tokens: 1,404,551,605 (93.1%)
Missing tokens: 103,696,842 (6.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 8,928
Forms in Wikipedia: 27,201
Tokens: 4,222,541
Covered forms: 3,632 (13.4%)
Missing forms: 23,569 (86.6%)
Covered tokens: 2,654,687 (62.9%)
Missing tokens: 1,567,854 (37.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 565,813
Forms in Wikipedia: 372,589
Tokens: 405,914,020
Covered forms: 93,209 (25.0%)
Missing forms: 279,380 (75.0%)
Covered tokens: 372,914,517 (91.9%)
Missing tokens: 32,999,503 (8.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 2,637,179
Forms in Wikipedia: 123,073
Tokens: 16,832,892
Covered forms: 72,767 (59.1%)
Missing forms: 50,306 (40.9%)
Covered tokens: 13,698,989 (81.4%)
Missing tokens: 3,133,903 (18.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 1,002,660
Forms in Wikipedia: 26,466
Tokens: 3,138,442
Covered forms: 16,173 (61.1%)
Missing forms: 10,293 (38.9%)
Covered tokens: 2,381,172 (75.9%)
Missing tokens: 757,270 (24.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 36,898
Forms in Wikipedia: 100,251
Tokens: 44,426,012
Covered forms: 7,885 (7.9%)
Missing forms: 92,366 (92.1%)
Covered tokens: 11,574,855 (26.1%)
Missing tokens: 32,851,157 (73.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 9,506
Forms in Wikipedia: 276,898
Tokens: 46,847,582
Covered forms: 5,359 (1.9%)
Missing forms: 271,539 (98.1%)
Covered tokens: 13,003,345 (27.8%)
Missing tokens: 33,844,237 (72.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 254,980
Forms in Wikipedia: 465,138
Tokens: 474,988,250
Covered forms: 55,990 (12.0%)
Missing forms: 409,148 (88.0%)
Covered tokens: 416,827,377 (87.8%)
Missing tokens: 58,160,873 (12.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 1,483
Forms in Wikipedia: 4,816
Tokens: 859,259
Covered forms: 535 (11.1%)
Missing forms: 4,281 (88.9%)
Covered tokens: 344,400 (40.1%)
Missing tokens: 514,859 (59.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 320,226
Forms in Wikipedia: 249,890
Tokens: 76,643,376
Covered forms: 53,635 (21.5%)
Missing forms: 196,255 (78.5%)
Covered tokens: 43,946,047 (57.3%)
Missing tokens: 32,697,329 (42.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 7,863
Forms in Wikipedia: 54,443
Tokens: 18,734,831
Covered forms: 3,153 (5.8%)
Missing forms: 51,290 (94.2%)
Covered tokens: 12,480,752 (66.6%)
Missing tokens: 6,254,079 (33.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 5,704
Forms in Wikipedia: 135,627
Tokens: 28,543,040
Covered forms: 3,092 (2.3%)
Missing forms: 132,535 (97.7%)
Covered tokens: 13,695,305 (48.0%)
Missing tokens: 14,847,735 (52.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 158
Forms in Wikipedia: 274,652
Tokens: 64,674,851
Covered forms: 104 (0.0%)
Missing forms: 274,548 (100.0%)
Covered tokens: 270,693 (0.4%)
Missing tokens: 64,404,158 (99.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 393,739
Forms in Wikipedia: 100,137
Tokens: 40,049,055
Covered forms: 16,877 (16.9%)
Missing forms: 83,260 (83.1%)
Covered tokens: 23,075,996 (57.6%)
Missing tokens: 16,973,059 (42.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 3,754
Forms in Wikipedia: 1,153
Tokens: 113,878
Covered forms: 458 (39.7%)
Missing forms: 695 (60.3%)
Covered tokens: 79,771 (70.0%)
Missing tokens: 34,107 (30.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 412,568
Forms in Wikipedia: 341,080
Tokens: 284,500,580
Covered forms: 102,138 (29.9%)
Missing forms: 238,942 (70.1%)
Covered tokens: 263,389,055 (92.6%)
Missing tokens: 21,111,525 (7.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 573
Forms in Wikipedia: 290,844
Tokens: 34,282,183
Covered forms: 465 (0.2%)
Missing forms: 290,379 (99.8%)
Covered tokens: 2,491,378 (7.3%)
Missing tokens: 31,790,805 (92.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 778,856
Forms in Wikipedia: 11,551
Tokens: 1,031,544
Covered forms: 8,326 (72.1%)
Missing forms: 3,225 (27.9%)
Covered tokens: 884,102 (85.7%)
Missing tokens: 147,442 (14.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 1,032
Forms in Wikipedia: 10,240
Tokens: 1,365,293
Covered forms: 417 (4.1%)
Missing forms: 9,823 (95.9%)
Covered tokens: 541,377 (39.7%)
Missing tokens: 823,916 (60.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 96
Forms in Wikipedia: 92,063
Tokens: 13,288,668
Covered forms: 46 (0.0%)
Missing forms: 92,017 (100.0%)
Covered tokens: 63,094 (0.5%)
Missing tokens: 13,225,574 (99.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 2,176
Forms in Wikipedia: 60,189
Tokens: 8,004,635
Covered forms: 1,299 (2.2%)
Missing forms: 58,890 (97.8%)
Covered tokens: 2,485,421 (31.0%)
Missing tokens: 5,519,214 (69.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 748,805
Forms in Wikipedia: 28,789
Tokens: 1,992,352
Covered forms: 8,837 (30.7%)
Missing forms: 19,952 (69.3%)
Covered tokens: 1,074,456 (53.9%)
Missing tokens: 917,896 (46.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 8,899
Forms in Wikipedia: 51,515
Tokens: 16,143,659
Covered forms: 5,797 (11.3%)
Missing forms: 45,718 (88.7%)
Covered tokens: 13,100,699 (81.2%)
Missing tokens: 3,042,960 (18.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 631
Forms in Wikipedia: 5,941
Tokens: 371,515
Covered forms: 298 (5.0%)
Missing forms: 5,643 (95.0%)
Covered tokens: 112,211 (30.2%)
Missing tokens: 259,304 (69.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 42
Forms in Wikipedia: 5,791
Tokens: 1,119,898
Covered forms: 2 (0.0%)
Missing forms: 5,789 (100.0%)
Covered tokens: 63 (0.0%)
Missing tokens: 1,119,835 (100.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 160,313
Forms in Wikipedia: 153,555
Tokens: 49,620,256
Covered forms: 48,545 (31.6%)
Missing forms: 105,010 (68.4%)
Covered tokens: 44,395,505 (89.5%)
Missing tokens: 5,224,751 (10.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 13,581
Forms in Wikipedia: 260,266
Tokens: 130,343,371
Covered forms: 7,932 (3.0%)
Missing forms: 252,334 (97.0%)
Covered tokens: 92,887,092 (71.3%)
Missing tokens: 37,456,279 (28.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 91,980
Forms in Wikipedia: 23,956
Tokens: 4,198,152
Covered forms: 8,962 (37.4%)
Missing forms: 14,994 (62.6%)
Covered tokens: 3,440,465 (82.0%)
Missing tokens: 757,687 (18.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 15,858
Forms in Wikipedia: 21,156
Tokens: 4,611,923
Covered forms: 3,358 (15.9%)
Missing forms: 17,798 (84.1%)
Covered tokens: 3,382,864 (73.4%)
Missing tokens: 1,229,059 (26.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 19,441
Forms in Wikipedia: 333,225
Tokens: 117,356,732
Covered forms: 7,624 (2.3%)
Missing forms: 325,601 (97.7%)
Covered tokens: 43,485,887 (37.1%)
Missing tokens: 73,870,845 (62.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 15,908
Forms in Wikipedia: 21,465
Tokens: 5,029,117
Covered forms: 1,868 (8.7%)
Missing forms: 19,597 (91.3%)
Covered tokens: 2,442,022 (48.6%)
Missing tokens: 2,587,095 (51.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 35,334
Forms in Wikipedia: 214,847
Tokens: 158,056,230
Covered forms: 13,944 (6.5%)
Missing forms: 200,903 (93.5%)
Covered tokens: 118,464,506 (75.0%)
Missing tokens: 39,591,724 (25.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 111
Forms in Wikipedia: 119,245
Tokens: 40,889,103
Covered forms: 87 (0.1%)
Missing forms: 119,158 (99.9%)
Covered tokens: 502,923 (1.2%)
Missing tokens: 40,386,180 (98.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 914,665
Forms in Wikipedia: 651,825
Tokens: 290,067,562
Covered forms: 141,255 (21.7%)
Missing forms: 510,570 (78.3%)
Covered tokens: 193,338,165 (66.7%)
Missing tokens: 96,729,397 (33.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 978
Forms in Wikipedia: 11,146
Tokens: 1,533,326
Covered forms: 81 (0.7%)
Missing forms: 11,065 (99.3%)
Covered tokens: 391,591 (25.5%)
Missing tokens: 1,141,735 (74.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 57,033
Forms in Wikipedia: 1,705
Tokens: 95,453
Covered forms: 53 (3.1%)
Missing forms: 1,652 (96.9%)
Covered tokens: 3,518 (3.7%)
Missing tokens: 91,935 (96.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 128,106
Forms in Wikipedia: 109,573
Tokens: 18,366,700
Covered forms: 46,034 (42.0%)
Missing forms: 63,539 (58.0%)
Covered tokens: 12,451,506 (67.8%)
Missing tokens: 5,915,194 (32.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 104
Forms in Wikipedia: 106,577
Tokens: 19,924,659
Covered forms: 77 (0.1%)
Missing forms: 106,500 (99.9%)
Covered tokens: 235,954 (1.2%)
Missing tokens: 19,688,705 (98.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 1,067
Forms in Wikipedia: 183,777
Tokens: 42,439,136
Covered forms: 847 (0.5%)
Missing forms: 182,930 (99.5%)
Covered tokens: 6,592,297 (15.5%)
Missing tokens: 35,846,839 (84.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 295,661
Forms in Wikipedia: 219,718
Tokens: 72,173,155
Covered forms: 71,097 (32.4%)
Missing forms: 148,621 (67.6%)
Covered tokens: 64,607,415 (89.5%)
Missing tokens: 7,565,740 (10.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 6,641
Forms in Wikipedia: 31,721
Tokens: 2,539,025
Covered forms: 1,104 (3.5%)
Missing forms: 30,617 (96.5%)
Covered tokens: 285,118 (11.2%)
Missing tokens: 2,253,907 (88.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 73
Forms in Wikipedia: 9,793
Tokens: 1,252,518
Covered forms: 20 (0.2%)
Missing forms: 9,773 (99.8%)
Covered tokens: 3,397 (0.3%)
Missing tokens: 1,249,121 (99.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 18
Forms in Wikipedia: 27,089
Tokens: 2,068,858
Covered forms: 15 (0.1%)
Missing forms: 27,074 (99.9%)
Covered tokens: 5,104 (0.2%)
Missing tokens: 2,063,754 (99.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 28
Forms in Wikipedia: 20,893
Tokens: 3,583,109
Covered forms: 20 (0.1%)
Missing forms: 20,873 (99.9%)
Covered tokens: 21,843 (0.6%)
Missing tokens: 3,561,266 (99.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 2,574
Forms in Wikipedia: 151,341
Tokens: 30,211,406
Covered forms: 1,511 (1.0%)
Missing forms: 149,830 (99.0%)
Covered tokens: 6,902,621 (22.8%)
Missing tokens: 23,308,785 (77.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 238,655
Forms in Wikipedia: 356,409
Tokens: 114,386,141
Covered forms: 26,816 (7.5%)
Missing forms: 329,593 (92.5%)
Covered tokens: 17,494,040 (15.3%)
Missing tokens: 96,892,101 (84.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 7,941
Forms in Wikipedia: 17,576
Tokens: 4,872,849
Covered forms: 1,377 (7.8%)
Missing forms: 16,199 (92.2%)
Covered tokens: 2,282,909 (46.8%)
Missing tokens: 2,589,940 (53.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 55
Forms in Wikipedia: 60,377
Tokens: 75,656,151
Covered forms: 37 (0.1%)
Missing forms: 60,340 (99.9%)
Covered tokens: 3,191,611 (4.2%)
Missing tokens: 72,464,540 (95.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.