Wikidata:Lexicographical data/Ideas of queries

From Wikidata
Jump to navigation Jump to search

This page is a list of maintenance and cool queries that could be built with lexicographical data on Wikidata.

Consider adding your signature to queries you insert to help users contact the author with questions, etc.

Maintenance and repairing[edit]


For more lexeme statistics, see Wikidata:Lexicographical_data/Statistics

LinguaLibre audio recording[edit]

You can use a wikidata query url with Form-ID in External Tools option in LinguaLibre, and then the User:Lingua Libre Bot will automatically add pronunciation audio file in the respective Form-id of lexeme entry. The query should have both ?id and ?label parameters representing form-id and form label respectively. You have to copy paste the "URL" to the query (from "</> Code" button or your address bar in the given form. Example queries:

  • All forms of a language missing pronunciation audio: query (replace with your language Q-id)
  • All lemma forms missing pronunciation: query (replace with your language Q-id)
  • Latest lexeme forms by a user: query (Replace with your own username)


Specific languages[edit]


  • A dynamic map of all the variations and pronunciation of the word "ki" (dog) depending on the geographical zone in Brittany (similar to this)


  • Antonyms which differs only in tone



  • peas. To quote Wiktionary (wikt:en:pease#English, CC BY-SA 3.0): “The original singular was pease, and the plural was peasen. Over the centuries, pease became used as the plural, peasen was dropped, pea was created as a new singular, and finally pease was respelled peas.” I have no clue how this will be modeled, but whatever it is I’m sure there’ll be some interesting queries to write about this and similar words :)
  • Acronyms deriving from latin words, like e. g., i. e., cf., etc.
  • Noun lexemes which do not have a plural form
  • Noun lexemes which have an invalid plural form (see en:English plurals and [2] for rules)
  • Verb lexemes which do not have a simple form (see [3])
  • Verb lexemes which do not have a 3rd person singular present tense form (see [4])
  • Verb lexemes which do not have a present participle and gerund form (see [5])
  • Regular verb lexemes which do not have a past tense and past participle form (see [6])


  • Out of all feminine French nouns how many end by an "e"?
    • query
    • Reverse: how many words end in "e" are feminine?
  • Words ending in "-té" are usually feminine (but "un comité"@fr) or in "-age" are masculine (but "une page"@fr *but* "un page" for the person), but what are the exceptions and how many are there? maybe search for a link with etymology (most words in -té are abstract and come from a latin in -tas, exceptions probably come from elsewhere)


  • Plurals ending in just “n” (without an “e” before the “n”), like „Triumvirn“.
    • query, now annotated with comments, including in its scope all varieties of German, limited to nomitative-case forms, and excluding words whose plural forms are not obtained by appending “n” but just happen to end in “n” (like
  • Verbs ending in just “n”, not “en”
  • Acronyms that don’t adopt the grammatical gender of the last word in the expanded form.
  • Word triples with the same pronunciation and different genders, like „der Coup“, „die Kuh“, „das Q“ (title of a book by CUS (Q1024556))
  • Word quintuples with one vowel exchanged, like „Wart/Wert/Wirt/Wort/Wurt“ (from that same book) or „Zacken/Zecken/Zicken/Zocken/Zucken“ (inspiration) (probably very expensive)


See Wikidata:Lexicographical data/Malayalam/Queries


  • Распределение существительных по классу склонения, с примерами


  • Lexemes without forms
  • Lexemes with forms without hyphenation