Wikidata:SourceMD/instructions

From Wikidata
Jump to navigation Jump to search

Typical use[edit]

Overview[edit]

  1. get 1 or more identifiers for a publication: a DOI, PMID, or PMC ID
  2. go to tools.wmflabs.org/sourcemd/
  3. put the identifiers 1 per line
  4. run
  5. check output in SourceMD if you wish, then proceed to go to QuickStatements
  6. run in Wikidata:QuickStatements
  7. done!
  8. check output in various Wikidata records if you wish
  9. address problems with further Wikidata editing if any identified

Collect media identifiers[edit]

SourceMD accepts input in these forms:

Traditional citation styles based on paper publishing may not list these identifiers. Citation systems for digital publishing may show them. Often the publication itself will list the media identifier.

Put the identifiers into the input box[edit]

List one identifier per line. Use only one identifier per publication. When in doubt, use the PubMed Central ID (PMCID).

100 SourceMD input field.png

SourceMD stages information for review[edit]

SourceMD takes the source identifier and returns source metadata to edit
screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata


SourceMD collects information from off-wiki databases and formats it for inclusion into Wikidata.

The user can edit the text which SourceMD presents. Typically there is no reason to change anything.

SourceMD will provide different information from different identifiers. In publishing academic papers the following sequence of events happens:

  1. Publishers report to CrossRef that they have media to publish
  2. CrossRef assigns a DOI to the media and registers it in their database
  3. About one day later, PubMed checks to see if the media is in their index of medical publications. If it is, then they copy the Crossref data, get the doi, and assign a PMID to the work.
  4. About a week later, PubMed Central shares a free-to-read copy of the publication only if they have an agreement to publish it. If they publish, then they take the DOI and the PMID, and also they assign a PMC ID.

What this means for Wikidata is that if possible share the PMC ID. In this case Wikidata gets the PMC ID, the PMID, and the doi. Anyone sharing the PMID also gets the doi. Anyone sharing the doi only gets the doi.

Transfer data from SourceMD to QuickStatements[edit]

screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata


screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata

Run QuickStatements[edit]

screenshot of MediaWiki taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata
screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata


screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata

Consider output of QuickStatements[edit]

screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata
screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata

Special cases[edit]

Merge records[edit]

identify multiple Wikidata items for one publication[edit]

By error Wikidata may have more than one item for the same media. Correct this error by merging the items.

This error can happen with SourceMD by one person processing one set of identifiers, like a DOI, then another person processing another possible identifier, like a PMID. The tool could create different items.

Use the merge function[edit]

screenshot taken of Wikidata edit log for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata

Verify the merge[edit]

screenshot taken of Wikidata edit log for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata
screenshot taken of Wikidata edit log for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata
screenshot taken of Wikidata for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata

Changing the SourceMD formatting[edit]

screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata
screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata

Property applied in error[edit]

screenshot taken of Wikidata for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata

Duplicated field[edit]

screenshot taken of Wikidata for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata
screenshot taken of Wikidata for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata



Other[edit]

screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata
screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata


screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata
screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata




screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata
screenshot of SourceMD tool taken for use in documentation of the SourceMD tool for generating structured data for citations in Wikidata