Wikidata:WikiProject Infoboxes

From Wikidata
(Redirected from Wikidata:Phase 2)
Jump to: navigation, search

WikiProject
Infoboxes

 

Persons /
Personen /
Personnes

 

Organizations /
Organisationen /
Organisations

 

Events /
Veranstaltungen /
Évènements

 

Works /
Werke /
Œuvres

 

Terms /
Sachbegriffe /
Termes

 

Places /
Geografika /
Lieux

The aim of Wikidata phase II is
  • to facilitate auto-translation to Wikipedia infobox templates.
The aim of this WikiProject is
  • to identify good examples of Wikipedia infoboxes and Wikidata items as use cases for phase II of Wikidata.
  • to map and harmonize Wikidata properties to common infobox parameters.
  • to suggest new Wikidata properties that are useful for auto-translation of Wikipedia infobox data.
  • to suggest and list Phase II bot activities for collecting data from specific infoboxes and external sources.

Infobox inclusion and changes of the current infoboxes should be discussed on the infobox talk pages on the Wikipedias.


This is the list of entities (also known as domains, or high level entities) used by the Integrated Authority File (GND).

Type German Item
p Person (individualisiert) person (Q215627)
k Körperschaft organization (Q43229)
v Veranstaltung event (Q1656682)
w Werk work (Q386724)
s Sachbegriff term (Q1969448)
g Geografikum geographical object (Q618123)
n Name (nicht individualisiert) don't use: Ambiguity (Q1140419)

Note: The terms "person", "term" etc. are meant to generalize. Please don't take them literally. (Examples)


Properties
Infoboxes

Data sources / Quellen / Source[edit]

Data types[edit]

Authority control / Normdaten / Autorité[edit]

Authority control: Mockup 1 by Theopolisme
See Wikidata:List of properties#Authority control for related properties, and Authority control.js for a tool that automatically gathers authority control data from the Wikipedias.
English

In library science, authority control is a process that organizes library catalogs and bibliographic information by using a single, distinct identification code for each topic. Wikidata is itself a kind of authority file (library catalogue), since it connects names and terms with one unique number.

If the code is based on a hierarchical library classification scheme, it also provides unambiguous relationships between high-level entity types (classes) and their low-level sub-classes and instances, which is a key to organizing the Wikidata knowledge base into an ontology for the semantic web, making machine analysis and auto-translation easier. Hierarchic classification may also provide a method for Wikidata bots to check that only relevant properties are assigned to each Wikidata item, especially if items that corresponds to the same Wikipedia infobox are grouped into the same class. Assigning classification properties to items is however difficult in the first release of the software, since items but no other text are possible as property datatype, and only items that have Wikipedia articles are allowed according to the Wikidata:Notability policy.

German

Eine Normdatei oder Bibliotheksklassifikation ist ein Verzeichnis von normierten Namen und Begriffen. Es handelt sich um ein kontrolliertes Vokabular (controlled vocabulary), in dem festgelegt wird, welche Ansetzung bei der Erschließung von Materialien zu verwenden ist.

Wikidata ist übrigens selbst eine Art Normdatei, da es Namen und Begriffe mit einem Identifikator verknüpft. Beispiel: Q2516 = Helmut Schmidt (Bundeskanzler), ヘルムート・シュミット oder Гельмут Шмидт. Der Identifikator "Q2516" ist, im Gegensatz zum Lemma in Wikipedia, feststehend.

Authority files used in Wikipedia / In Wikipedia verwendete Normdaten[edit]

Over 250.000 Wikipedia articles (the year 2012) are already linked to authority records, by means of the template Authority control, supporting identification schemes such as:

  • en:Template:Authority control parameters:
    • VIAF (All types; collection of GND, LCCN etc, no original source)
    • GND (All types, German, Austrian & Swiss system)
    • LCCN (People, American)
    • SELIBR (People, Swedish)
    • ORCID (People)
    • BNF (People, french)
    • BPN (People, dutch)
    • RID (Researchers)
    • BIBSYS (People, Norway)
    • ULAN (Artists)
  • de:Vorlage:Normdaten
    • GND (All types, German, Austrian & Swiss system)
    • LCCN (People, American)
    • NDL (People, Japanese)
    • VIAF (All types; collection of GND, LCCN etc, no original source)
Book identification (not part of authority control) and library classification
  • en:Template:Infobox_book parameters:
    • ISBN (books)
    • OCLC (books)
    • Dewey (subject classification)
    • Congress (subject classification)

Wikidata main type of item[edit]

As a first approach to Wikidata authority control, the types (also known as domains or high-level entity types) of the Integrated Authority File (Gemeinsame Normdatei, GND) specification are used as a basis for Property:P107, Wikidata main type of item. See the table above. GND type and library catalogue identification number are assigned to 9.5 million items (April 2012) in the German national library catalogue.[1][2]

Example[edit]

Helmut Schmidt is linked to the following library catalogue systems (authority files) on de:German Wikipedia, using the Authority control (Normdaten) template:

  • GND type: p ("person")
  • GND identifier: 118608819
  • LCCN: n50006770
  • NDL: 00474593
  • VIAF: 12314986

Template parameters on German Wikipedia: {{Normdaten|TYP=p|GND=118608819|LCCN=n/50/6770|NDL=00474593|VIAF=12314986}}

See the corresponding page in the German national library catalogue: GND 118608819 (in German).


Phase 2 technical information[edit]

Data model[edit]

For an introduction to how data is represented at Datawiki, see the Data model primer. The data model can be summarized as follows:

Entity or data set is one of the following three types of Wikidata pages:

  1. Item (a page in the main namespace) consisting of:
    1. Item identifier (number prefixed with q)
    2. Multilingual label ("names", incorrectly called "titles" in the user interface)
    3. Multilingual description (the combination of label+description in a certain language must be unique for each entity)
    4. Multilingual aliases ("also known as")
    5. Interwiki links
    6. Claims, consisting of:
      1. Statements, each consisting of:
        1. Property
        2. Property value
        3. Qualifiers (additional property values)
      2. References
  2. Property (a page in the namespace Properties), consisting of
    1. Property identifier (number prefixed with p)
    2. Multilingual property label
    3. Multilingual property description
    4. Multilingual property aliases ("also known as")
    5. Datatype*
  3. Query**
    1. Query identifier (number prefixed with y)
    2. Multilingual query label
    3. Multilingual query description
    4. Multilingual query aliases ("also known as")

*) Not all datatypes are yet deployed at Wikidata.org. See Special:ListDatatypes
**) Not yet deployed at Wikidata.org.

See also[edit]

References / Einzelnachweise / Références[edit]

  1. Entitätencodierung: Vergaberichtlinien - Kurzliste. (in German)
  2. GND Ontology - Namespace Document, version 2012-06-30 (in English).