Wikidata:WikiProject Source MetaData

From Wikidata
Jump to: navigation, search
Translate this page; This page contains changes which are not marked for translation.

Other languages:
Deutsch • ‎English • ‎español • ‎français • ‎Bahasa Indonesia • ‎українська

WikiProject Source Metadata

The word Metadata in Wikidata Morse code.svg
Metadata is a love note to the future (8071729256) (cropped).jpg
WikiCite 2016, Berlin, May 25-26, 2016.

The aim of of the WikiProject Source Metadata is:

  • to act as a hub for work in Wikidata involving citation data and bibliographic data.
  • to define a set of properties that can be used by citations, infoboxes, and Wikisource.
  • to map and import all relevant metadata that currently is spread across Commons, Wikipedia, and Wikisource.
  • establish methods to interact with this metadata from different projects.
  • to create a large open bibliographic database within Wikidata.
  • to reveal, build, and maintain community stakeholdership for the inclusion and management of source metadata in wikidata.

There have been various proposals over the years for similar projects (see meta:WikiCite for details). Now that Wikidata is here, we can make it happen.

Timeline (from 1952 till present) of Wikidata items with publication date (P577) and with main subject (P921) being set to Zika virus (Q202864) and/ or Zika fever (Q8071861), as per this Wikidata list.

Properties[edit]

See this subpage for more details.

Projects[edit]

Examples[edit]

Here are some examples of the kind of entry this project could produce. (Note -- please replace with better samples if you know of them! -Pete F (talk) 17:23, 24 June 2015 (UTC))

«ГОСТ» examples:

Tasks[edit]

  • Clean up communication media (manifestation) and work trees
    • Restructure as appropriate
    • Fix items which should not appear in tree since they are actually instances, not subclasses of the work.
      • For instance, almost all of the items in the typeface tree, are instances, not subclass. Remove the property "subclass of" typeface and replace with "instance of" typeface. There are also similar issues in religious texts and others.
  • Annotate this table of standard citation source types (i.e. CSL, BibTex) with the appropriate Wikidata item.
    • If no item corresponds, create a new one.
    • Add a "subclass of" property pointing to an item from either the work or manifestation (communication media) trees.
  • Discuss and collaborate with the community to bring in more participation and input to relevant source metadata
  • Improve quality, coverage, and analysis of existing tools used to manage source metadata, identifiers, references, citations, etc.
    • Wikidata:Wikiproject_Source_MetaData/Tool_audit
      • Especially of interest are ways users and communities currently manage information that may ultimately be moved to tools and workflows that leverage Wikidata. Analyses of tools and protocols may thus imply potential equivalent replacements, or new tools, which should strive to improve the overall experience for these users and communities.

Possible Data Collaborators[edit]

ContentMine presentation, Wikimania 2014. Wikiwish: "An Open Bibliography of science, updated daily" (the first bulletpoint at 27:30)
Citing as a public service: presentation by User:DarTar at the 2015 Wikipedia Science Conference pitching Wikidata as an open bibliographic and citation data repository.

Some possible Data Collaborators have expressed interest on working on source metadata in Wikidata: others might usefully be approached.

OCLC, which runs WorldCat, is very keen on collaborating with Wikidata; User:Maximiliankleinoclc wrote a letter about the possibilities.

ContentMine has some excellent open software tools, which we could use to let Wikidata answer queries like "List all the review papers ever written on malaria vaccines", "List all the articles that mention Lygodactylus williamsii", "List every paper ever written by John Tuzo Wilson" and "List all the papers cited in Wikipedia articles that have been retracted". They listed "An Open Bibliography of science, updated daily" as a "wikiwish" at Wikimania 2014, apparently unaware that this project has been started at a slightly earlier workshop.

PLOS has an API for RichCitations, which contains metadata on all PLOS papers up through late 2014. Rich Citations is a novel structured format to express each citation as a data element, and it includes a set of useful, additional terms specific to scholarly literature that enable research about the knowledge web citations create. It also includes a display feature much like Reference Tooltips, but linked to a database (which is open licensed), so it can update metainformation. They presented at Wikimania 2014 and are keen to collaborate and share their results with us.

Zotero is interested in the idea of a proofread metadata source. Some Zotero users currently upload to cloud storage; we might build tools to let them upload here, instead. CiteseerX has a large open-licensed database of article metadata, and might want to set up an exchange, but have not responded to e-mails.

The Cochrane Collaboration is developing an API to its metadata (they were contacted about this project in July 2014, so this use case may have helped shape the API). They produce large amounts of non-conventional metadata on works they review, and on works they produce, both of which Wikimedians quote.

Institutional repositories are also increasingly interested in open APIs and linked databases, and seem generally receptive to this project. The university-run academic search engine BASE aggregates and normalizes these repositories and makes its data collection available for non-commercial purposes.

Resources[edit]

Contact[edit]

Participants[edit]

The participants listed below can be notified using the following template in discussions:

{{Ping project|Source MetaData}}

Historical discussions[edit]

There have been historical discussions about Wikidata hosting information about the sources of data.

See also[edit]