Wikidata:WikiProject Names

From Wikidata
Jump to: navigation, search

Other languages:
Deutsch • ‎English • ‎español • ‎فارسی • ‎français • ‎Հայերեն • ‎italiano • ‎日本語 • ‎македонски • ‎Nederlands • ‎polski • ‎português • ‎português do Brasil • ‎русский • ‎српски / srpski • ‎Türkçe • ‎українська • ‎中文

Home

 

Maintenance reports

 

Reference lists

 
First Names Dictionary

Purpose[edit]

This WikiProject aims to improve the structure of name related data in Wikidata. Priority is given to first names.

Participants[edit]

[+] Add yourself to the list

The participants listed below can be notified using the following template in discussions:

{{Ping project|Names}}

Help[edit]

See http://ashtree.eu/wordpress/prenom-wikidata/ (in French)

How to create a new item for a given name[edit]

How to clean up a given name item[edit]

items with P31 = Wikimedia disambiguation page (Q4167410)[edit]

Sometimes given name items include information about a similar family name or links to disambiguation pages. It needs to be decided what to do with the item depending on its content, linked items and already other available items. Options are:

  1. keep it as a disambiguation
  1. keep it as a given name (first name) item
  2. keep it as a name item
  3. keep it as a family name (surname) item

Items should not be re-purposed: the item should be kept as a disambiguation and new items created for given names and family names.

  • Uses of the item should be moved to (new) given name/family name items.
  • If the disambiguation empty remains without any links, eventually it will be deleted.

Given name items[edit]

For items that are for given names (first names), the following clean-up steps can be needed:

  • clean up instance of (P31)
  • add native label (P1705)
  • clean up labels (for items with native label (P1705) in Roman script, all labels in Roman script should be identical)
  • clean up descriptions (remove descriptions related to family names or disambiguation)
  • clean up aliases (languages in Roman script generally would not have any aliases for spelling variations).
  • clean up interwiki links:
    • if an article listed in the interwikis spells the name differently than the label of the item: create a new item for this variant of the name and move the interwiki there
    • if interwikis link to disambiguation pages: these should be moved to another item
  • clean up uses of the item:
    • if the name of the person is spelled differently in the item of the person using it, replace it in given name (P735) with the appropriate item
    • if the item is used with family name (P734), replace it with the appropriate item for the family name

How to clean up given name items (top-down approach)[edit]

Sample approach (by Jura):


Main elements[edit]

Main items[edit]

Properties[edit]

Title ID Data type Description Examples Inverse
given name P735 Item given name: first name or another given name of this person. Values used with the property shouldn't link disambiguations nor family names. George Washington <given name> George -
family name P734 Item family name: surname or last name of a person George Washington <family name> Washington -
birth name P1477 Monolingual text name at birth: full name of a person at birth, if different from their current, generally used name (samples: John Peter Doe for Joe Doe, Ann Smith for Ann Miller) Mark Twain <birth name> Samuel Langhorne Clemens (language: en) -
noble family P53 Item noble family: include dynasty and nobility houses Genghis Khan <noble family> Borjigin -
pseudonym P742 String pseudonym: alias used by someone or by which this person is universally known Mark Twain <pseudonym> Mark Twain -
noble title P97 Item royal or noble rank: titles held by the person William Mansfield, 1st Baron Sandhurst <noble title> Baron Sandhurst -
honorific prefix P511 Item honorific and title of honor: word or expression used before a name, in addressing or referring to a person Douglas Haig, 1st Earl Haig <honorific prefix> The Right Honourable -
family name identical to this first name P1533 Item last name that is the same as a given first name. Use on items for given names. Sylvain <family name identical to this first name> Sylvain -
name in native language P1559 Monolingual text name: name of a person in their native language Barack Obama <name in native language> Barack Hussein Obama II (language: en) -
given name version for other gender P1560 Item equivalent name (with respect to the meaning of the name) in the same language: female version of a male first name, male version of a female first name. Add primarily the closest matching one Riccardo <given name version for other gender> Riccarda -
name day P1750 Item name day: day of the year associated with a first/given name. A qualifier should be used to identify the calendar that is being used. Distinguish from "feast day" (P:P841) Lucy <name day> December 13 -
language of work or name P407 Item for works (for original language use P364 and for persons P103 and P1412) Charles <language of work or name> English -
writing system P282 Item writing system: alphabet, character set or other system of writing used by subject language William <writing system> Latin script -
second surname in Spanish name P1950 Item second or maternal family name in Spanish names (do not use for other double barrelled names) Gabriel García Márquez <second surname in Spanish name> Márquez -
significant event P793 Item key event: significant or notable events associated with the subject
Statistics about a first name: use qualifiers to indicate rank series ordinal (P1545) and/or occurrences quantity (P1114)
Cornelis <significant event> most frequent first names at birth in Rotterdam (1811-1913) -
significant event P793 Item key event: significant or notable events associated with the subject
Status in Iceland: use given name authorized in Iceland (Q26959205) as value.
Arnar <significant event> given name authorized in Iceland -

Uses[edit]

For a given name[edit]

Title ID Data type Description Examples Inverse
instance of P31 Item instance of: this item is a specific example and a member of that class. Not to be confused with Property:P279 (subclass of). John <instance of> male given name -
language of work or name P407 Item for works (for original language use P364 and for persons P103 and P1412) John <language of work or name> English -
said to be the same as P460 Item Wikimedia duplicated page: this item is said to be the same as that item, but the statement is disputed John <said to be the same as> Jean 460
part of P361 Item part: subject is a part of that object. Inverse property of "has part" (P527). Sylvain <part of> Sylvain has as part
family name identical to this first name P1533 Item last name that is the same as a given first name. Use on items for given names. Sylvain <family name identical to this first name> Sylvain -
given name version for other gender P1560 Item equivalent name (with respect to the meaning of the name) in the same language: female version of a male first name, male version of a female first name. Add primarily the closest matching one Riccardo <given name version for other gender> Riccarda 1560
opposite of P461 Item opposite and antonym: item that is the opposite of this item Jean <opposite of> Jean 461
  • Create a new item when none is available for a name.
  • Labels for given names should be the same in all languages with Roman script.
  • Alias: languages in Roman script generally would not have any aliases.
  • People with translated given names could have several items for the same given name.
  • People with several given names will have several values in given name (P735).
  • Avoid adding items with Wikimedia disambiguation page (Q4167410) or items that link to disambiguation pages in one of the languages.
  • The property should only be used on items for persons (humans or fictional humans).

For a family name/surname[edit]

Title ID Data type Description Examples Inverse
instance of P31 Item instance of: this item is a specific example and a member of that class. Not to be confused with Property:P279 (subclass of). Fisher <instance of> family name -
named after P138 Item eponym: entity or event that inspired the subject's name, or namesake (in at least one language) Fisher <named after> fisherman -
part of P361 Item part: subject is a part of that object. Inverse property of "has part" (P527). Sylvain <part of> Sylvain has as part
  • Labels for family name should be the same in all languages with Roman script.
  • Data in the property "family name" is not meant to indicate any sort of kinship between people with the same surname.
  • Avoid adding items with Wikimedia disambiguation page (Q4167410) or items that link to disambiguation pages in one of the languages.
  • The property should only be used on items for person (humans or fictional humans).

For a combined family name with given name[edit]

The approach is the following:

Specific naming conventions[edit]

Roman names[edit]

Roman names generally have the following parts: praenomen, nomen, and cognomen. Sometimes, an agnomen is used as well. → en:Roman naming conventions#The tria nomina.


Points being developed[edit]

For given names:

An item for each first name
create an item for every variation of a given name.
Labels
use identical labels across languages with Roman script. Tools like the new LabelLister or the list at Person names allows to check them and nameGuzzler to define them.
Descriptions
use standardized descriptions such as "male given name", "female given name". Tools like the new LabelLister can help check them and remove old ones, Autoedit can add standardized ones. This helps identify and resolve inconsistent or duplicate items. Normalize descriptions with terms used on given name (Q202444), male given name (Q12308941), female given name (Q11879590)
De-mix disambiguation items
avoid having items in given name (P735) with "disambiguation" in instance of (P31). Create new items for these given names instead.
Distinguish between items for given names and items for family names
separate items should be available for each. family name identical to this first name (P1533) can link them.
List related given names
similar first names from other languages can be listed with said to be the same as (P460).
Complete missing names in the main languages
Add_name_labels works fine for this
Distribution maps
maps from Commons added
Reasonator
improved handling of "see also" on given names (displaying languages)
Lady/Sir/Lord/…
given names for items with labels starting with "Sir", "Lord", "Lady", etc.
Japanese names
Romanized Japanese names can mix several names. Example: "Yuriko" for ゆりこ or ゆり子 or 百合子. Each of these three should have an item. An additional undifferentiated item can be used if it can't be determined which one applies and it can also link a possible article about all three at enwiki.
Pronunciation
link files available at Commons
Japanese names
description of the approach currently used

Statistics[edit]

date humans [1] first name [2]  % [3] [4] family names [5]  % [6] Todo[7]
2014-09-06 2,492,359 129,304 5.1 % 24,830 0.9 %
2014-09-27 2,525,530 563,129 22.3 % 16,545 7,103 30,902 1.2 % 38,821
2014-11-01 2,582,649 1,054,292 40.8 % 17,450 7,130 76,914 2.9 % 39,204
2014-11-29 2,630,939 1,385,520 52.7 % 17,559 7,068 96,326 3.7 % 40,090


2014-12-31 2,654,912 1,583,475 59.6 % 20,296 7,008 97,696 3.7 % 40,056
2015-01-31 2,692,070 1,611,459 59.9 % 20,613 6,972 99,745 3.7 % 40,088
2015-02-28 [8] 2,702,106 1,632,296 60.4 % 17,768 3,682 100,555 3.7 % 40,080
2015-03-31 2,762,253 1,735,275 62.8 % 18,522 3,743 105,934 3.8 % 40,245
2015-05-01 2,773,802 1,920,034 69.2 % 19,547 4,111 142,423 5.1 % 40,424
2015-05-31 2,797,258 2,000,204 71.5 % 23,185 3,679 145,961 5.2 % 40,382
2015-06-30 2,829,073 2,026,845 71.6 % 23,742 3,548 153,344 5.4 % 40,559
2015-08-02 2,846,056 2,047,949 72.0 % 23,935 3,416 163,252 5.7 % 40,678
2015-09-01 2,860,916 2,060,315 72.0 % 24,186 3,355 168,659 5.9 % 40,786
2015-10-01 2,900,428 2,087,605 72.0 % 24,659 3,324 169,926 5.9 % 43,966
2015-11-01 2,937,180 2,120,839 72.2 % 25,428 129 171,896 5.9 % 44,975 8.6%
2015-12-01 3,008,267 2,186,636 72.7 % 27,197 118 176,052 5.9 % 44,785 8.0%
2016-01-06 3,039,909 2,197,616 72.3 % 28,035 107 179,666 5.9 % 44,840 8.1%
2016-02-28 3,082,264 2,238,719 72.6%
  1. Humans: items with P31:Q5
  2. Humans with given name (P735) or P2358
  3. Total number of given names (used or not)
  4. mixed given name items. Ideally = 0. See also Constraint_violations/P735 and P734
  5. Humans with surnames (P734)
  6. surnames
  7. People with given names and nationalities in selected list
  8. data incomplete
done: 1,287,905 (91.9%) to do: 113,902 (8.1%)Circle frame white.svg
  •   done: 1,287,905 (91.9%)
  •   to do: 113,902 (8.1%)
with: 2,197,616 (72.3%) without: 842,293 (27.7%)Circle frame white.svg
  •   with: 2,197,616 (72.3%)
  •   without: 842,293 (27.7%)
1,000,000
2,000,000
3,000,000
4,000,000
08
2014
09
10
11
12
01
2015
02
03
04
05
06
07
08
09
10
11
12
  •   with given names
  •   without given names

Tasks and task forces[edit]

Regularly updated reports and dynamic lists[edit]

Points to develop[edit]

For given names:

How to cover combined given names?
Given names can be combined with a dash (sample: "Jean-Paul") or without a dash (sample: "John Paul"). For names like "Jean-Paul", there would probably be a single item in given name (P735). For "John Paul", there might be three: "John", "John Paul" and "Paul". Various names should probably be checked to match one or the other way.
How to ensure given names are correctly set for languages that add them after the family name (e.g. Chinese, Korean, Japanese, Hungarian)?
We will need to make sure that we don't place family names as given names for these. Sample: "Kim Jong-il" (Kim Jong-il (Q10665)) has the family name "Kim" and not a given name "Kim". Hungarian names are being worked on.
"Master of altar of the church"
review, possibly, localize names including "Master of.." etc.
Senior/Junior
Localize (or not) when included in name.
Module/infobox
build a sample infobox. Possible solution: add given name (P735) with "no value" and qualifier instance of (P31): Master of … (Q19968968)
Integrated Icelandic last names
tbd
Roman cognomen
determine how to handle them
Indian names
check names in the format <family name> - <given name>
instance of (P31)
define usage for items like mononymous person (Q2985549)
Inclusion of etymology
this should be included in a structured way. It is currently left to Wikipedia infobox and/or Wiktionary (July 2015 property proposal, May 2015 Wiktionary absorption proposal). Please don't add etymologies to descriptions instead.
Inclusion of the new transliteration properties
recently a series of transliteration properties became available, these should be included in the model.

Tools[edit]

See also[edit]