This page presents the lexicographical coverage of the Wikidata lexicographical data compared to a corpus of the given language. Unless the entry for the language says otherwise, the corpora are based on Wikipedia (available here ).
These pages are updated weekly on Wednesdays by NikkiBot , although no edit will be made if nothing has changed.
The code for the bot is at https://github.com/nikkiwd/lexcover and is based on the original PAWS notebook by Denny. Report issues to Nikki (either on User talk:Nikki or on Telegram). Requests for additional languages, improvements and suggestions are also welcome.
Words can be filtered out by adding them to the "Filter" subpage for the language (e.g. Wikidata:Lexicographical coverage/nb/Filter ) and the entries in the list can be customised, e.g. to add search links, by editing the "Missing/row" subpage (e.g. Wikidata:Lexicographical coverage/nb/Missing/row ). It is also possible to add things before and after the list, e.g. if you want the output to be a table, by editing the "Missing/head" and "Missing/foot" subpages.
More information:
More statistics:
Forms in Wikidata: 328
Forms in Wikipedia: 246,598
Tokens: 69,840,956
Covered forms: 87 (0.0%)
Missing forms: 246,511 (100.0%)
Covered tokens: 4,035,083 (5.8%)
Missing tokens: 65,805,873 (94.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 189
Forms in Wikipedia: 118,514
Tokens: 33,132,887
Covered forms: 166 (0.1%)
Missing forms: 118,348 (99.9%)
Covered tokens: 367,667 (1.1%)
Missing tokens: 32,765,220 (98.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 6,814
Forms in Wikipedia: 9,552
Tokens: 1,459,030
Covered forms: 1,282 (13.4%)
Missing forms: 8,270 (86.6%)
Covered tokens: 840,487 (57.6%)
Missing tokens: 618,543 (42.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
(This analysis was performed separately from all the others on this page, using the corpus linked here and custom counting code.)
Forms in Wikidata: 46,504
Forms in Wikipedia: 5,34,894
Tokens: 1,33,06,025
Covered forms: 13,900 (2.60%)
Missing forms: 5,20,994 (97.40%)
Covered tokens: 50,79,352 (38.17%)
Missing tokens: 82,26,673 (61.83%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 7
Forms in Wikipedia: 35,431
Tokens: 3,876,195
Covered forms: 4 (0.0%)
Missing forms: 35,427 (100.0%)
Covered tokens: 392 (0.0%)
Missing tokens: 3,875,803 (100.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 172
Forms in Wikipedia: 176,311
Tokens: 108,297,498
Covered forms: 127 (0.1%)
Missing forms: 176,184 (99.9%)
Covered tokens: 14,810,125 (13.7%)
Missing tokens: 93,487,373 (86.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 190,677
Forms in Wikipedia: 261,374
Tokens: 74,084,890
Covered forms: 45,489 (17.4%)
Missing forms: 215,885 (82.6%)
Covered tokens: 46,754,558 (63.1%)
Missing tokens: 27,330,332 (36.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 125
Forms in Wikipedia: 10,844
Tokens: 1,442,683
Covered forms: 79 (0.7%)
Missing forms: 10,765 (99.3%)
Covered tokens: 27,189 (1.9%)
Missing tokens: 1,415,494 (98.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 104,514
Forms in Wikipedia: 111,139
Tokens: 30,879,404
Covered forms: 30,238 (27.2%)
Missing forms: 80,901 (72.8%)
Covered tokens: 26,219,946 (84.9%)
Missing tokens: 4,659,458 (15.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 201,911
Forms in Wikipedia: 1,008,036
Tokens: 596,433,479
Covered forms: 107,957 (10.7%)
Missing forms: 900,079 (89.3%)
Covered tokens: 473,191,695 (79.3%)
Missing tokens: 123,241,784 (20.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 38,826
Forms in Wikipedia: 129,276
Tokens: 40,452,744
Covered forms: 16,755 (13.0%)
Missing forms: 112,521 (87.0%)
Covered tokens: 15,462,404 (38.2%)
Missing tokens: 24,990,340 (61.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 109,937
Forms in Wikipedia: 965,225
Tokens: 1,508,248,447
Covered forms: 75,978 (7.9%)
Missing forms: 889,247 (92.1%)
Covered tokens: 1,401,256,022 (92.9%)
Missing tokens: 106,992,425 (7.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 5,620
Forms in Wikipedia: 27,201
Tokens: 4,222,541
Covered forms: 2,474 (9.1%)
Missing forms: 24,727 (90.9%)
Covered tokens: 2,449,107 (58.0%)
Missing tokens: 1,773,434 (42.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 332,954
Forms in Wikipedia: 372,589
Tokens: 405,914,020
Covered forms: 67,259 (18.1%)
Missing forms: 305,330 (81.9%)
Covered tokens: 364,150,808 (89.7%)
Missing tokens: 41,763,212 (10.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 2,728,483
Forms in Wikipedia: 123,073
Tokens: 16,832,892
Covered forms: 72,698 (59.1%)
Missing forms: 50,375 (40.9%)
Covered tokens: 13,667,980 (81.2%)
Missing tokens: 3,164,912 (18.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 1,002,628
Forms in Wikipedia: 26,466
Tokens: 3,138,442
Covered forms: 16,168 (61.1%)
Missing forms: 10,298 (38.9%)
Covered tokens: 2,341,144 (74.6%)
Missing tokens: 797,298 (25.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 36,557
Forms in Wikipedia: 100,251
Tokens: 44,426,012
Covered forms: 7,910 (7.9%)
Missing forms: 92,341 (92.1%)
Covered tokens: 24,873,018 (56.0%)
Missing tokens: 19,552,994 (44.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 8,308
Forms in Wikipedia: 276,898
Tokens: 46,847,582
Covered forms: 4,913 (1.8%)
Missing forms: 271,985 (98.2%)
Covered tokens: 11,606,809 (24.8%)
Missing tokens: 35,240,773 (75.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 248,459
Forms in Wikipedia: 465,138
Tokens: 474,988,250
Covered forms: 52,085 (11.2%)
Missing forms: 413,053 (88.8%)
Covered tokens: 393,929,109 (82.9%)
Missing tokens: 81,059,141 (17.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 66
Forms in Wikipedia: 4,816
Tokens: 859,259
Covered forms: 31 (0.6%)
Missing forms: 4,785 (99.4%)
Covered tokens: 8,706 (1.0%)
Missing tokens: 850,553 (99.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 328,184
Forms in Wikipedia: 249,890
Tokens: 76,643,376
Covered forms: 54,457 (21.8%)
Missing forms: 195,433 (78.2%)
Covered tokens: 41,905,265 (54.7%)
Missing tokens: 34,738,111 (45.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 5,468
Forms in Wikipedia: 54,443
Tokens: 18,734,831
Covered forms: 2,418 (4.4%)
Missing forms: 52,025 (95.6%)
Covered tokens: 12,130,886 (64.8%)
Missing tokens: 6,603,945 (35.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 3,658
Forms in Wikipedia: 135,627
Tokens: 28,543,040
Covered forms: 2,195 (1.6%)
Missing forms: 133,432 (98.4%)
Covered tokens: 12,286,356 (43.0%)
Missing tokens: 16,256,684 (57.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 154
Forms in Wikipedia: 274,652
Tokens: 64,674,851
Covered forms: 100 (0.0%)
Missing forms: 274,552 (100.0%)
Covered tokens: 268,172 (0.4%)
Missing tokens: 64,406,679 (99.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 392,729
Forms in Wikipedia: 100,137
Tokens: 40,049,055
Covered forms: 16,454 (16.4%)
Missing forms: 83,683 (83.6%)
Covered tokens: 22,653,503 (56.6%)
Missing tokens: 17,395,552 (43.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 3,048
Forms in Wikipedia: 1,153
Tokens: 113,878
Covered forms: 410 (35.6%)
Missing forms: 743 (64.4%)
Covered tokens: 74,137 (65.1%)
Missing tokens: 39,741 (34.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 9,411
Forms in Wikipedia: 341,080
Tokens: 284,500,580
Covered forms: 8,231 (2.4%)
Missing forms: 332,849 (97.6%)
Covered tokens: 148,288,404 (52.1%)
Missing tokens: 136,212,176 (47.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 455
Forms in Wikipedia: 290,844
Tokens: 34,282,183
Covered forms: 376 (0.1%)
Missing forms: 290,468 (99.9%)
Covered tokens: 2,456,564 (7.2%)
Missing tokens: 31,825,619 (92.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 778,496
Forms in Wikipedia: 11,551
Tokens: 1,031,544
Covered forms: 8,308 (71.9%)
Missing forms: 3,243 (28.1%)
Covered tokens: 884,512 (85.7%)
Missing tokens: 147,032 (14.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 867
Forms in Wikipedia: 10,240
Tokens: 1,365,293
Covered forms: 363 (3.5%)
Missing forms: 9,877 (96.5%)
Covered tokens: 487,187 (35.7%)
Missing tokens: 878,106 (64.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 46
Forms in Wikipedia: 92,063
Tokens: 13,288,668
Covered forms: 25 (0.0%)
Missing forms: 92,038 (100.0%)
Covered tokens: 56,590 (0.4%)
Missing tokens: 13,232,078 (99.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 1,480
Forms in Wikipedia: 60,189
Tokens: 8,004,635
Covered forms: 848 (1.4%)
Missing forms: 59,341 (98.6%)
Covered tokens: 2,117,085 (26.4%)
Missing tokens: 5,887,550 (73.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 745,517
Forms in Wikipedia: 28,789
Tokens: 1,992,352
Covered forms: 8,532 (29.6%)
Missing forms: 20,257 (70.4%)
Covered tokens: 1,043,543 (52.4%)
Missing tokens: 948,809 (47.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 3,774
Forms in Wikipedia: 51,515
Tokens: 16,143,659
Covered forms: 3,056 (5.9%)
Missing forms: 48,459 (94.1%)
Covered tokens: 11,701,517 (72.5%)
Missing tokens: 4,442,142 (27.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 520
Forms in Wikipedia: 5,941
Tokens: 371,515
Covered forms: 272 (4.6%)
Missing forms: 5,669 (95.4%)
Covered tokens: 125,274 (33.7%)
Missing tokens: 246,241 (66.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 40
Forms in Wikipedia: 5,791
Tokens: 1,119,898
Covered forms: 2 (0.0%)
Missing forms: 5,789 (100.0%)
Covered tokens: 63 (0.0%)
Missing tokens: 1,119,835 (100.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 126,144
Forms in Wikipedia: 153,555
Tokens: 49,620,256
Covered forms: 43,860 (28.6%)
Missing forms: 109,695 (71.4%)
Covered tokens: 44,114,546 (88.9%)
Missing tokens: 5,505,710 (11.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 357
Forms in Wikipedia: 260,266
Tokens: 130,343,371
Covered forms: 275 (0.1%)
Missing forms: 259,991 (99.9%)
Covered tokens: 26,475,599 (20.3%)
Missing tokens: 103,867,772 (79.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 49,520
Forms in Wikipedia: 23,956
Tokens: 4,198,152
Covered forms: 7,114 (29.7%)
Missing forms: 16,842 (70.3%)
Covered tokens: 3,315,321 (79.0%)
Missing tokens: 882,831 (21.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 4,931
Forms in Wikipedia: 21,156
Tokens: 4,611,923
Covered forms: 1,703 (8.0%)
Missing forms: 19,453 (92.0%)
Covered tokens: 2,951,231 (64.0%)
Missing tokens: 1,660,692 (36.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 17,707
Forms in Wikipedia: 333,225
Tokens: 117,356,732
Covered forms: 6,827 (2.0%)
Missing forms: 326,398 (98.0%)
Covered tokens: 40,040,899 (34.1%)
Missing tokens: 77,315,833 (65.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 4,946
Forms in Wikipedia: 21,465
Tokens: 5,029,117
Covered forms: 1,025 (4.8%)
Missing forms: 20,440 (95.2%)
Covered tokens: 2,171,329 (43.2%)
Missing tokens: 2,857,788 (56.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 33,880
Forms in Wikipedia: 214,847
Tokens: 158,056,230
Covered forms: 13,463 (6.3%)
Missing forms: 201,384 (93.7%)
Covered tokens: 117,347,485 (74.2%)
Missing tokens: 40,708,745 (25.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 50
Forms in Wikipedia: 119,245
Tokens: 40,889,103
Covered forms: 41 (0.0%)
Missing forms: 119,204 (100.0%)
Covered tokens: 360,899 (0.9%)
Missing tokens: 40,528,204 (99.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 910,674
Forms in Wikipedia: 651,825
Tokens: 290,067,562
Covered forms: 138,679 (21.3%)
Missing forms: 513,146 (78.7%)
Covered tokens: 176,738,096 (60.9%)
Missing tokens: 113,329,466 (39.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 185
Forms in Wikipedia: 11,146
Tokens: 1,533,326
Covered forms: 44 (0.4%)
Missing forms: 11,102 (99.6%)
Covered tokens: 351,155 (22.9%)
Missing tokens: 1,182,171 (77.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 57,028
Forms in Wikipedia: 1,705
Tokens: 95,453
Covered forms: 50 (2.9%)
Missing forms: 1,655 (97.1%)
Covered tokens: 3,279 (3.4%)
Missing tokens: 92,174 (96.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 128,106
Forms in Wikipedia: 109,573
Tokens: 18,366,700
Covered forms: 46,034 (42.0%)
Missing forms: 63,539 (58.0%)
Covered tokens: 12,451,506 (67.8%)
Missing tokens: 5,915,194 (32.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 78
Forms in Wikipedia: 106,577
Tokens: 19,924,659
Covered forms: 76 (0.1%)
Missing forms: 106,501 (99.9%)
Covered tokens: 114,079 (0.6%)
Missing tokens: 19,810,580 (99.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 32
Forms in Wikipedia: 183,777
Tokens: 42,439,136
Covered forms: 23 (0.0%)
Missing forms: 183,754 (100.0%)
Covered tokens: 127,781 (0.3%)
Missing tokens: 42,311,355 (99.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 249,842
Forms in Wikipedia: 219,718
Tokens: 72,173,155
Covered forms: 64,904 (29.5%)
Missing forms: 154,814 (70.5%)
Covered tokens: 63,979,055 (88.6%)
Missing tokens: 8,194,100 (11.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 1,930
Forms in Wikipedia: 31,721
Tokens: 2,539,025
Covered forms: 330 (1.0%)
Missing forms: 31,391 (99.0%)
Covered tokens: 106,963 (4.2%)
Missing tokens: 2,432,062 (95.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 1,447
Forms in Wikipedia: 9,793
Tokens: 1,252,518
Covered forms: 431 (4.4%)
Missing forms: 9,362 (95.6%)
Covered tokens: 269,508 (21.5%)
Missing tokens: 983,010 (78.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 11
Forms in Wikipedia: 27,089
Tokens: 2,068,858
Covered forms: 10 (0.0%)
Missing forms: 27,079 (100.0%)
Covered tokens: 4,330 (0.2%)
Missing tokens: 2,064,528 (99.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 15
Forms in Wikipedia: 20,893
Tokens: 3,583,109
Covered forms: 7 (0.0%)
Missing forms: 20,886 (100.0%)
Covered tokens: 8,448 (0.2%)
Missing tokens: 3,574,661 (99.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 215
Forms in Wikipedia: 151,341
Tokens: 30,211,406
Covered forms: 159 (0.1%)
Missing forms: 151,182 (99.9%)
Covered tokens: 529,857 (1.8%)
Missing tokens: 29,681,549 (98.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 238,578
Forms in Wikipedia: 356,409
Tokens: 114,386,141
Covered forms: 26,762 (7.5%)
Missing forms: 329,647 (92.5%)
Covered tokens: 16,763,131 (14.7%)
Missing tokens: 97,623,010 (85.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 5,586
Forms in Wikipedia: 17,576
Tokens: 4,872,849
Covered forms: 1,163 (6.6%)
Missing forms: 16,413 (93.4%)
Covered tokens: 2,262,845 (46.4%)
Missing tokens: 2,610,004 (53.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 37
Forms in Wikipedia: 60,377
Tokens: 75,656,151
Covered forms: 28 (0.0%)
Missing forms: 60,349 (100.0%)
Covered tokens: 3,106,167 (4.1%)
Missing tokens: 72,549,984 (95.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.