Wikidata:Property proposal/JRC Names id

From Wikidata
Jump to navigation Jump to search

JRC Names id[edit]

Originally proposed at Wikidata:Property proposal/Authority control

   Done: JRC Names id (P6640) (Talk and documentation)
DescriptionID in the JRC Names gazetteer, which provides spelling variants and EMM news about the entity (220k news items per day). Alias: JRCN
RepresentsJRC Names (Q62084229)
Data typeExternal identifier
Domainperson, organization
Example 1Johannes Bosco (Q146183)100006
Example 2Tim Berners-Lee (Q80)61072
Example 3Met Office (Q1851405)100029
Example 4Dignitas Personae (Q631612)1000097
Format and edit filter validation\d+
Number of IDs in source620,138 person names, 42,296 organization names
Expected completenessalways incomplete (Q21873886)
Formatter URLhttp://emm.newsbrief.eu/NewsBrief/entityedition/en/$1.html
Robot and gadget jobsTwo download formats are available, see below

Motivation[edit]

JRC Names (JRCN) is a comprehensive resource of 620,138 person names and 42,296 organization names. It is related to EMM, which indexes 220k news per day. JRCN provides entity profile pages where the name variants and news about that entity can be seen.

Two download formats are available (see JRC Names (Q62084229)):

  • JRCN TSV is up to date and very simple: id, lang (most are "u"), type (P, O), name (replace "+" with " ").
  • JRCN RDF is last updated May 2015, a lot more complex, but includes 95+243k links to DBpedia, which can be leveraged to link to WD:
    • Dbpedia links on entities (total): 95721
    • Dbpedia links on entities (strict): 64199
    • Dbpedia links on entities (with disamb on type): 31427
    • Dbpedia links on entities (with disamb on title): 95
    • Dbpedia non disambiguated: 242912

Vladimir Alexiev (talk) 12:57, 17 March 2019 (UTC)

Discussion[edit]

  • Symbol support vote.svg Support David (talk) 06:59, 18 March 2019 (UTC)
@Vladimir Alexiev, ديفيد عادل وهبة خليل 2: JRC Names id (P6640) has been created. Regards, ZI Jony (Talk) 09:09, 30 March 2019 (UTC)