This page presents the lexicographical coverage of the Wikidata lexicographical data compared to a corpus of the given language. Unless the entry for the language says otherwise, the corpora are based on Wikipedia (available here ).
These pages are updated weekly on Wednesdays by NikkiBot , although no edit will be made if nothing has changed.
The code for the bot is at https://github.com/nikkiwd/lexcover and is based on the original PAWS notebook by Denny. Report issues to Nikki (either on User talk:Nikki or on Telegram). Requests for additional languages, improvements and suggestions are also welcome.
Words can be filtered out by adding them to the "Filter" subpage for the language (e.g. Wikidata:Lexicographical coverage/nb/Filter ). The entries in the list can be customised, e.g. to add search links, by editing the "Missing/row" subpage (e.g. Wikidata:Lexicographical coverage/nb/Missing/row ). It is also possible to add things before and after the list, e.g. if you want the output to be a table, by editing the "Missing/head" and "Missing/foot" subpages.
More information:
More statistics:
Forms in Wikidata: 1,315,979
Forms in Wikipedia: 246,598
Tokens: 69,840,956
Covered forms: 1,162 (0.5%)
Missing forms: 245,436 (99.5%)
Covered tokens: 2,099,753 (3.0%)
Missing tokens: 67,741,203 (97.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 236
Forms in Wikipedia: 118,514
Tokens: 33,132,887
Covered forms: 203 (0.2%)
Missing forms: 118,311 (99.8%)
Covered tokens: 778,430 (2.3%)
Missing tokens: 32,354,457 (97.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 11,372
Forms in Wikipedia: 9,552
Tokens: 1,459,030
Covered forms: 2,254 (23.6%)
Missing forms: 7,298 (76.4%)
Covered tokens: 1,108,768 (76.0%)
Missing tokens: 350,262 (24.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
(This analysis was performed separately from all the others on this page, using the corpus linked here and custom counting code.)
Forms in Wikidata: 46,504
Forms in Wikipedia: 5,34,894
Tokens: 1,33,06,025
Covered forms: 13,900 (2.60%)
Missing forms: 5,20,994 (97.40%)
Covered tokens: 50,79,352 (38.17%)
Missing tokens: 82,26,673 (61.83%)
Most frequent missing forms
Covered: 13,900
Missing: 520,994
Covered: 5,079,352
Missing: 8,226,673
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 7
Forms in Wikipedia: 35,431
Tokens: 3,876,195
Covered forms: 4 (0.0%)
Missing forms: 35,427 (100.0%)
Covered tokens: 392 (0.0%)
Missing tokens: 3,875,803 (100.0%)
Most frequent missing forms
Covered: 4
Missing: 35,427
Covered: 392
Missing: 3,875,803
Forms in Wikidata: 488
Forms in Wikipedia: 176,311
Tokens: 108,297,498
Covered forms: 335 (0.2%)
Missing forms: 175,976 (99.8%)
Covered tokens: 24,562,099 (22.7%)
Missing tokens: 83,735,399 (77.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 218,567
Forms in Wikipedia: 261,374
Tokens: 74,084,890
Covered forms: 55,568 (21.3%)
Missing forms: 205,806 (78.7%)
Covered tokens: 52,413,118 (70.7%)
Missing tokens: 21,671,772 (29.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 169
Forms in Wikipedia: 10,844
Tokens: 1,442,683
Covered forms: 99 (0.9%)
Missing forms: 10,745 (99.1%)
Covered tokens: 120,454 (8.3%)
Missing tokens: 1,322,229 (91.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 603,099
Forms in Wikipedia: 111,139
Tokens: 30,879,404
Covered forms: 64,023 (57.6%)
Missing forms: 47,116 (42.4%)
Covered tokens: 28,811,730 (93.3%)
Missing tokens: 2,067,674 (6.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 789,308
Forms in Wikipedia: 1,008,036
Tokens: 596,433,479
Covered forms: 227,310 (22.5%)
Missing forms: 780,726 (77.5%)
Covered tokens: 508,537,869 (85.3%)
Missing tokens: 87,895,610 (14.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 38,928
Forms in Wikipedia: 129,276
Tokens: 40,452,744
Covered forms: 16,802 (13.0%)
Missing forms: 112,474 (87.0%)
Covered tokens: 18,430,379 (45.6%)
Missing tokens: 22,022,365 (54.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 121,142
Forms in Wikipedia: 965,225
Tokens: 1,508,248,447
Covered forms: 79,852 (8.3%)
Missing forms: 885,373 (91.7%)
Covered tokens: 1,406,993,840 (93.3%)
Missing tokens: 101,254,607 (6.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 9,025
Forms in Wikipedia: 27,201
Tokens: 4,222,541
Covered forms: 3,657 (13.4%)
Missing forms: 23,544 (86.6%)
Covered tokens: 2,686,846 (63.6%)
Missing tokens: 1,535,695 (36.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 568,227
Forms in Wikipedia: 372,589
Tokens: 405,914,020
Covered forms: 93,727 (25.2%)
Missing forms: 278,862 (74.8%)
Covered tokens: 373,046,684 (91.9%)
Missing tokens: 32,867,336 (8.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 2,637,231
Forms in Wikipedia: 123,073
Tokens: 16,832,892
Covered forms: 72,787 (59.1%)
Missing forms: 50,286 (40.9%)
Covered tokens: 13,703,830 (81.4%)
Missing tokens: 3,129,062 (18.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 1,002,661
Forms in Wikipedia: 26,466
Tokens: 3,138,442
Covered forms: 16,174 (61.1%)
Missing forms: 10,292 (38.9%)
Covered tokens: 2,381,388 (75.9%)
Missing tokens: 757,054 (24.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 35,294
Forms in Wikipedia: 100,251
Tokens: 44,426,012
Covered forms: 7,091 (7.1%)
Missing forms: 93,160 (92.9%)
Covered tokens: 8,236,995 (18.5%)
Missing tokens: 36,189,017 (81.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 9,605
Forms in Wikipedia: 276,898
Tokens: 46,847,582
Covered forms: 5,396 (1.9%)
Missing forms: 271,502 (98.1%)
Covered tokens: 13,016,760 (27.8%)
Missing tokens: 33,830,822 (72.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 255,605
Forms in Wikipedia: 465,138
Tokens: 474,988,250
Covered forms: 56,905 (12.2%)
Missing forms: 408,233 (87.8%)
Covered tokens: 417,200,217 (87.8%)
Missing tokens: 57,788,033 (12.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 1,727
Forms in Wikipedia: 4,816
Tokens: 859,259
Covered forms: 611 (12.7%)
Missing forms: 4,205 (87.3%)
Covered tokens: 409,126 (47.6%)
Missing tokens: 450,133 (52.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 316,373
Forms in Wikipedia: 249,890
Tokens: 76,643,376
Covered forms: 53,317 (21.3%)
Missing forms: 196,573 (78.7%)
Covered tokens: 43,977,544 (57.4%)
Missing tokens: 32,665,832 (42.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 7,913
Forms in Wikipedia: 54,443
Tokens: 18,734,831
Covered forms: 3,172 (5.8%)
Missing forms: 51,271 (94.2%)
Covered tokens: 12,491,855 (66.7%)
Missing tokens: 6,242,976 (33.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 5,719
Forms in Wikipedia: 135,627
Tokens: 28,543,040
Covered forms: 3,104 (2.3%)
Missing forms: 132,523 (97.7%)
Covered tokens: 13,725,613 (48.1%)
Missing tokens: 14,817,427 (51.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 166
Forms in Wikipedia: 274,652
Tokens: 64,674,851
Covered forms: 105 (0.0%)
Missing forms: 274,547 (100.0%)
Covered tokens: 270,778 (0.4%)
Missing tokens: 64,404,073 (99.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 393,773
Forms in Wikipedia: 100,137
Tokens: 40,049,055
Covered forms: 16,888 (16.9%)
Missing forms: 83,249 (83.1%)
Covered tokens: 23,028,968 (57.5%)
Missing tokens: 17,020,087 (42.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 3,862
Forms in Wikipedia: 1,153
Tokens: 113,878
Covered forms: 477 (41.4%)
Missing forms: 676 (58.6%)
Covered tokens: 80,844 (71.0%)
Missing tokens: 33,034 (29.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 413,474
Forms in Wikipedia: 341,080
Tokens: 284,500,580
Covered forms: 102,453 (30.0%)
Missing forms: 238,627 (70.0%)
Covered tokens: 263,446,125 (92.6%)
Missing tokens: 21,054,455 (7.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 1,041
Forms in Wikipedia: 290,844
Tokens: 34,282,183
Covered forms: 665 (0.2%)
Missing forms: 290,179 (99.8%)
Covered tokens: 2,643,552 (7.7%)
Missing tokens: 31,638,631 (92.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 778,975
Forms in Wikipedia: 11,551
Tokens: 1,031,544
Covered forms: 8,332 (72.1%)
Missing forms: 3,219 (27.9%)
Covered tokens: 884,298 (85.7%)
Missing tokens: 147,246 (14.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 1,054
Forms in Wikipedia: 10,240
Tokens: 1,365,293
Covered forms: 426 (4.2%)
Missing forms: 9,814 (95.8%)
Covered tokens: 543,520 (39.8%)
Missing tokens: 821,773 (60.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 96
Forms in Wikipedia: 92,063
Tokens: 13,288,668
Covered forms: 46 (0.0%)
Missing forms: 92,017 (100.0%)
Covered tokens: 63,094 (0.5%)
Missing tokens: 13,225,574 (99.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 3,023
Forms in Wikipedia: 60,189
Tokens: 8,004,635
Covered forms: 1,737 (2.9%)
Missing forms: 58,452 (97.1%)
Covered tokens: 2,827,052 (35.3%)
Missing tokens: 5,177,583 (64.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 749,595
Forms in Wikipedia: 28,789
Tokens: 1,992,352
Covered forms: 8,838 (30.7%)
Missing forms: 19,951 (69.3%)
Covered tokens: 1,074,470 (53.9%)
Missing tokens: 917,882 (46.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 10,120
Forms in Wikipedia: 51,515
Tokens: 16,143,659
Covered forms: 6,609 (12.8%)
Missing forms: 44,906 (87.2%)
Covered tokens: 13,347,789 (82.7%)
Missing tokens: 2,795,870 (17.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 636
Forms in Wikipedia: 5,941
Tokens: 371,515
Covered forms: 298 (5.0%)
Missing forms: 5,643 (95.0%)
Covered tokens: 112,211 (30.2%)
Missing tokens: 259,304 (69.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 42
Forms in Wikipedia: 5,791
Tokens: 1,119,898
Covered forms: 2 (0.0%)
Missing forms: 5,789 (100.0%)
Covered tokens: 63 (0.0%)
Missing tokens: 1,119,835 (100.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 165,289
Forms in Wikipedia: 153,555
Tokens: 49,620,256
Covered forms: 49,149 (32.0%)
Missing forms: 104,406 (68.0%)
Covered tokens: 44,422,279 (89.5%)
Missing tokens: 5,197,977 (10.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 16,409
Forms in Wikipedia: 260,266
Tokens: 130,343,371
Covered forms: 8,968 (3.4%)
Missing forms: 251,298 (96.6%)
Covered tokens: 95,350,111 (73.2%)
Missing tokens: 34,993,260 (26.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 98,448
Forms in Wikipedia: 23,956
Tokens: 4,198,152
Covered forms: 9,244 (38.6%)
Missing forms: 14,712 (61.4%)
Covered tokens: 3,456,085 (82.3%)
Missing tokens: 742,067 (17.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 15,873
Forms in Wikipedia: 21,156
Tokens: 4,611,923
Covered forms: 3,360 (15.9%)
Missing forms: 17,796 (84.1%)
Covered tokens: 3,382,927 (73.4%)
Missing tokens: 1,228,996 (26.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 19,675
Forms in Wikipedia: 333,225
Tokens: 117,356,732
Covered forms: 7,682 (2.3%)
Missing forms: 325,543 (97.7%)
Covered tokens: 43,491,447 (37.1%)
Missing tokens: 73,865,285 (62.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 15,919
Forms in Wikipedia: 21,465
Tokens: 5,029,117
Covered forms: 1,869 (8.7%)
Missing forms: 19,596 (91.3%)
Covered tokens: 2,442,132 (48.6%)
Missing tokens: 2,586,985 (51.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 36,187
Forms in Wikipedia: 214,847
Tokens: 158,056,230
Covered forms: 14,617 (6.8%)
Missing forms: 200,230 (93.2%)
Covered tokens: 120,028,325 (75.9%)
Missing tokens: 38,027,905 (24.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 119
Forms in Wikipedia: 119,245
Tokens: 40,889,103
Covered forms: 88 (0.1%)
Missing forms: 119,157 (99.9%)
Covered tokens: 504,492 (1.2%)
Missing tokens: 40,384,611 (98.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 915,844
Forms in Wikipedia: 651,825
Tokens: 290,067,562
Covered forms: 141,596 (21.7%)
Missing forms: 510,229 (78.3%)
Covered tokens: 195,860,664 (67.5%)
Missing tokens: 94,206,898 (32.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 978
Forms in Wikipedia: 11,146
Tokens: 1,533,326
Covered forms: 81 (0.7%)
Missing forms: 11,065 (99.3%)
Covered tokens: 391,591 (25.5%)
Missing tokens: 1,141,735 (74.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 57,037
Forms in Wikipedia: 1,705
Tokens: 95,453
Covered forms: 53 (3.1%)
Missing forms: 1,652 (96.9%)
Covered tokens: 3,518 (3.7%)
Missing tokens: 91,935 (96.3%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 128,106
Forms in Wikipedia: 109,573
Tokens: 18,366,700
Covered forms: 46,034 (42.0%)
Missing forms: 63,539 (58.0%)
Covered tokens: 12,451,506 (67.8%)
Missing tokens: 5,915,194 (32.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 104
Forms in Wikipedia: 106,577
Tokens: 19,924,659
Covered forms: 77 (0.1%)
Missing forms: 106,500 (99.9%)
Covered tokens: 235,954 (1.2%)
Missing tokens: 19,688,705 (98.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 1,072
Forms in Wikipedia: 183,777
Tokens: 42,439,136
Covered forms: 850 (0.5%)
Missing forms: 182,927 (99.5%)
Covered tokens: 6,594,649 (15.5%)
Missing tokens: 35,844,487 (84.5%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 338,218
Forms in Wikipedia: 219,718
Tokens: 72,173,155
Covered forms: 75,750 (34.5%)
Missing forms: 143,968 (65.5%)
Covered tokens: 65,060,021 (90.1%)
Missing tokens: 7,113,134 (9.9%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 6,679
Forms in Wikipedia: 31,721
Tokens: 2,539,025
Covered forms: 1,113 (3.5%)
Missing forms: 30,608 (96.5%)
Covered tokens: 286,727 (11.3%)
Missing tokens: 2,252,298 (88.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 45
Forms in Wikipedia: 9,793
Tokens: 1,252,518
Covered forms: 6 (0.1%)
Missing forms: 9,787 (99.9%)
Covered tokens: 456 (0.0%)
Missing tokens: 1,252,062 (100.0%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 21
Forms in Wikipedia: 27,089
Tokens: 2,068,858
Covered forms: 16 (0.1%)
Missing forms: 27,073 (99.9%)
Covered tokens: 5,122 (0.2%)
Missing tokens: 2,063,736 (99.8%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 43
Forms in Wikipedia: 20,893
Tokens: 3,583,109
Covered forms: 27 (0.1%)
Missing forms: 20,866 (99.9%)
Covered tokens: 27,395 (0.8%)
Missing tokens: 3,555,714 (99.2%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 2,630
Forms in Wikipedia: 151,341
Tokens: 30,211,406
Covered forms: 1,890 (1.2%)
Missing forms: 149,451 (98.8%)
Covered tokens: 12,256,017 (40.6%)
Missing tokens: 17,955,389 (59.4%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 238,686
Forms in Wikipedia: 356,409
Tokens: 114,386,141
Covered forms: 26,832 (7.5%)
Missing forms: 329,577 (92.5%)
Covered tokens: 17,624,025 (15.4%)
Missing tokens: 96,762,116 (84.6%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
These statistics use corpus data from the Leipzig Corpora Collection .
Forms in Wikidata: 7,992
Forms in Wikipedia: 17,576
Tokens: 4,872,849
Covered forms: 1,388 (7.9%)
Missing forms: 16,188 (92.1%)
Covered tokens: 2,287,028 (46.9%)
Missing tokens: 2,585,821 (53.1%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.
Forms in Wikidata: 73
Forms in Wikipedia: 60,377
Tokens: 75,656,151
Covered forms: 41 (0.1%)
Missing forms: 60,336 (99.9%)
Covered tokens: 3,250,357 (4.3%)
Missing tokens: 72,405,794 (95.7%)
Most frequent missing forms
Graphs are temporarily unavailable due to technical issues.
Graphs are temporarily unavailable due to technical issues.