Wikidata:WikiProject PCC Wikidata Pilot/Smithsonian Libraries

From Wikidata
Jump to navigation Jump to search
 Welcome Participants Projects Properties Resources 


Aim and Scope[edit]

The Smithsonian Libraries PCC Wikidata Pilot Project intended to connect library data with Wikidata and facilitate the creation of linked data focusing on the organization, collections, and research staff of the Smithsonian. This project has come to an end but individual projects have grown from this learning experience.

History of Sub-Projects[edit]

  • African Ethnic Groups
    • Items will be created, or, if already existing, enhanced, for the African ethnic group names currently used in local subject headings by the Warren M. Robbins Library of the National Museum of African Art.
  • Art and artists' files
  • Chinese Ancestor Portraits
    • Items will be created, or, if already existing, enhanced, for names matched to the Freer and Sackler Galleries collection of Chinese Ancestors portraits.
  • Dibner Library of the History of Science and Technology Portraits
    • Items will be created, or, if already existing, enhanced, for the sitters, artists, engravers, etc. featured in the Dibner Library of the History of Science and Technology's Scientific Identity website, to be supplemented later by additional portraits which had been omitted from the website for various reasons (not digitized at the time, acquired later, etc.).
  • Smithsonian Research Online
    • Items will be created to facilitate scholarly discovery of publication citations and full online editions for works by researchers and scholars affiliated with the Smithsonian, drawing from the Smithsonian Research Online database.
    • Smithsonian museums and research units (subproject)
    • Smithsonian Profiles (subproject)
      • Items will be created, or, if already existing, enhanced, for notable curatorial and research staff from the Smithsonian Profiles website.

Project Year-end report (Oct 2020-Sep 2021)[edit]

African ethnic group[edit]

Artists files[edit]

Chinese ancestors portraits[edit]

Dibner scientists portraits[edit]

Smithsonian Research Online[edit]

Timeline[edit]

Phase I: Data preps

Identifying and collection data sets for respective sub-projects, reviewing and selecting core and extended Wikidata properties for individual project needs. Organizing data and designing workflow via OpenRefine to confirm name entity's presence or absence in Wikidata.

Project name Date Status
African Ethnic Groups October-December 2020 Updates
Art and artists' files Spring-Fall 2020 Updates
Chinese Ancestors Portraits January 2020 -December 2020 Updates
Dibner Library of the History of Science and Technology Portraits October-December 2020 Updates
Smithsonian Research Online Updates

Phase II:

Creating new items in Wikidata, augmenting descriptions to existing Wikidata items based on the project needs.

Project name Date Status
African Ethnic Groups Updates
Art and artists' files Spring 2021- Updates
Chinese Ancestors Portraits Spring 2021- Updates
Dibner Library of the History of Science and Technology Portraits Spring 2021- Updates
Smithsonian Research Online

Updates

Phase III: Extracting Wikidata claims (statements), importing into local Wikidata instance (a Wikibase installation)

Phase IV: Wikibase federation collaboration

Contributors[edit]

  • Heidy Berthoud (Berthoudh), Head of Resource Description, Smithsonian Libraries and Archives

Additional team members are listed in respective sub-project pages.

Questions on Data Modeling via User Stories and WDQS[edit]

There were series of data modeling issues regarding organizational name changes

1) "Label" reflects the organization's "Official names", past and present.

2) Indication of date of name changes

3) Explicit and implicit relationships among units (museums and subcategories of art museum, postal museum, history museum)

4) Impact on queries and data model refinements

5) Reciprocal constraints

The Data Modeling via User Stories and WDQS examples intend to showcase various data models for the Smithsonian five sub-projects and its progressive data modeling process

Property Organization[edit]

Clarification Notes[edit]

Understanding the difference of "instance of" and "subclass of"[edit]

An "instance of (P31)" refers to a SPECIFIC thing - e.g. Seabiscuit is a SPECIFIC instance of a horse. The "Secretariat" entry has "instance of (P31)" for horse and has a property, "animal breed (P4743)" for Thoroughbred. Whereas "subclass of (P279)" is about hierarchies of concepts - like ‘horse is a subclass of mammal (Q7377)’.

Core Properties[edit]

These are the properties that all data contributed to Wikidata from Smithsonian projects must contain.

Property Value Note
Label text string Most clear, recognizable name for the individual or entity
Description text string brief description as specified by the subproject's rules
Alias text string other form(s) of name in use
instance of (P31) human (Q5),
organization (Q43229)
put yours here
sex or gender (P21) male (Q6581097)
female (Q6581072)
for human (Q5) when available
one or more significant property like
occupation (P106),
position held (P39),
field of work (P101)

astronomer (Q11063),
Emperor of China (Q268218),
mathematics (Q395)
Need to identify terms, but necessary for people because of Wikidata's rules about notoriety
one or more identifiers Each project should have at least an external identifier linking it to a Smithsonian record

All Properties Related to People[edit]

Property Usage Notes African Ethnic Groups Artists' Files Chinese Portraits Dibner Portraits Research Online
instance of (P31) core (ethnic group (Q41710)) core (human (Q5)) core (human (Q5)) core (human (Q5))
(sitter & artist),
core (artworks)
work of art (Q838948)
core (human (Q5))(person),
core organization (Q43229), etc (org)
occupation (P106) core extended core (sitter & artist) core
date of birth (P569) core core core (sitter & artist) extended
date of death (P570) core core core (sitter & artist) extended
sex or gender (P21) core core core (sitter & artist) extended
family name (P734) extended core (sitter & artist) nice
notable work (P800) extended extended (sitter & artist) extended
award received (P166) extended extended (sitter) extended
position held (P39) extended extended (sitter) core
place of birth (P19) extended extended (sitter & artist) extended
academic degree (P512) extended extended (sitter) extended (qualifier for educated at (P69))
start time (P580) extended extended (sitter; qualifier for educated at (P69)) extended (qualifier for educated at (P69), director / manager (P1037))
end time (P582) extended extended (sitter; qualifier for educated at (P69)) extended (qualifier for educated at (P69))
country of citizenship (P27) extended core extended (sitter & artist)
name in native language (P1559) extended extended (sitter & artist) nice
given name (P735) core (sitter & artist) nice
ISNI (P213) core (sitter) core (person, org)
VIAF ID (P214) core (sitter & artist) extended (org)
place of death (P20) extended extended extended
educated at (P69) extended (sitter & artist) extended
spouse (P26) extended nice
part of (P361) Use when ethnic group is itself a member of a larger ethnic group; make sure to create reciprocal "has part" relationship at entry for the larger group. core extended (artist corporate bodies) core
has part (P527) core extended (artist corporate bodies) extended (org)
described by source (P1343) core extended
described at URL (P973) core extended
location (P276) core core (artworks)
Library of Congress authority ID (P244) core core (sitter)
Smithsonian resource ID (P7851)
religion (P140) extended extended
floruit (P1317) extended extended
has works in the collection (P6379) extended extended (artist)
image (P18) extended extended (sitter)
birth name (P1477) extended extended (sitter & artist)
languages spoken, written or signed (P1412) extended extended (sitter)
student of (P1066) extended extended (sitter)
student (P802) extended extended (sitter)
field of work (P101) extended (sitter & artist) core
inception (P571) core (artworks) core (org)
affiliation (P1416) extended (sitter)
alternate names (P4970) extended (sitter & artist)
ORCID iD (P496) core
LinkedIn personal profile ID (P6634) nice
employer (P108) core
participant (P710) nice
native label (P1705) core
colonial name (proposed) core
colonized by (proposed) core
official language (P37) extended
different from (P1889) extended
population (P1082) extended
pseudonym (P742) core
family (P53) noble house NOT family name core
father (P22) extended extended (sitter & artist)
mother (P25) extended
sibling (P3373) extended
child (P40) extended extended (sitter & artist)
cause of death (P509) extended
manner of death (P1196) extended extended (sitter & artist)
place of burial (P119) extended extended (sitter & artist)
image of grave (P1442) extended
native language (P103) extended extended (sitter & artist)
temple name (P1785) extended
posthumous name (P1786) extended
type of kinship (P1039) extended
replaces (P1365) extended
replaced by (P1366) extended
follows (P155) extended
followed by (P156) extended
noble title (P97) extended
seal image (P158) extended
owner of (P1830) extended
residence (P551) extended extended (sitter & artist)
official residence (P263) extended
Eight Banner Register (P470) extended
ethnic group (P172) extended
military rank (P410) extended
series ordinal (P1545) extended
time period (P2348) extended
subclass of (P279) extended
era name (P6902) extended
depicted by (P1299) extended
reference URL (P854) extended extended
retrieved date (P813) extended
transliteration (P2440) extended
Möllendorff transliteration (P5139) extended
Pinyin transliteration (P1721) extended
Commons category (P373) extended extended
country (P17) extended (org)
located in the administrative territorial entity (P131) core (org)
movement (P135) extended (sitter & artist)
genre (P136) extended (artist, artworks)
creator (P170) core (artworks)
depicts (P180) extended (artworks)
materials used (P186) extended (artworks)
Union List of Artist Names ID (P245) core (artist)
member of (P463) extended (sitter & artist) nice
country of origin (P495) extended (artworks)
point in time (P585) extended (qualifier for educated at (P69))
parent organization (P749) core (org)
academic major (P812) extended (qualifier for educated at (P69))
official website (P856) core (org)
main subject (P921) core (artworks)
work location (P937) extended (sitter & artist)
doctoral thesis (P1026) extended (qualifier for educated at (P69))
director / manager (P1037) extended (org)
official name (P1448) nice (people), extended (org)
title (P1476) core (artworks)
for work (P1686) extended (qualifier for award received (P166))
FAST ID (P2163) core (sitter)
GRID ID (P2427) extended (org)
Crossref funder ID (P3153) extended (org)
on focus list of Wikimedia Project (P5008) core (sitter)
street address (P6375) extended (org)
curriculum vitae URL (P8214) nice

Properties Related to Exhibitions[edit]

  • Exhibition catalogs get a separate Q number from the Q number for the exhibition itself
  • Good model Wikidata item for a museum/art exhibition is the one for the Armory Show Q688909
Property Value Usage Note
instance of (P31) exhibition (Q464980) or art exhibition (Q667276) or temporary exhibition (Q29023906) required
image (P18) optional, if available
country (P17) required
located in the administrative territorial entity (P131) required
coordinate location (P625) optional, if available
start time (P580) required
end time (P582) required
organizer (P664) required
location (P276) required
main subject (P921) required
catalog (P972) optional, if available
movement (P135) optional, if available
participant (P710) optional, if available
genre (P136) optional, if available
curator (P1640) optional, if available
title (P1476) required
Commons category (P373) optional, if available

Properties of Organizations/Units/Departments[edit]

Property Value Usage Note Modeling questions
instance of (P31) some values from q that are required required structure of organizations e.g. museums and subclass of art museums and postal museum and history museums
parent organization (P749) Smithsonian Institution (Q131626) part of the core required part of and parent organization
parent organization (P749) give the Q for the level between this unit and Smithsonian Institution overall required if applicable
official name (P1448) required - or not required ? (have we decided what the source is of the official names?) date ranges of name changes?
GRID ID (P2427) optional
ISNI ID (P213) optional