Wikidata:WikiFactMine/Legacy dictionaries

From Wikidata
Jump to: navigation, search
WikiFactMine terrible logo.svg
WikiFactMine schematic as of June 2017

WikiFactMine is a ContentMine project to add referenced scientific facts to Wikidata. This page concerns the dictionaries in use to mid-2017.

Dictionaries[edit]

The dictionaries in use to August 2017 were of varying size and provenance. They are those given below as JSON, and they dated from 2016. See Github page for more.

Name URL JSON matched to Wikidata, fully YesYYesY or partially YesY Comment
cochrane https://github.com/ContentMine/dictionaries/blob/master/xml/cochrane.xml
https://github.com/ContentMine/dictionaries/blob/master/json/cochrane.json
YesY short list of terms that may be of interest to or about Cochrane Collaboration (Q1105202).
disease https://github.com/ContentMine/dictionaries/blob/master/xml/disease.xml
https://github.com/ContentMine/dictionaries/blob/master/json/disease.json
YesY "list of diseases, origin currently unknown perhaps wikidata"
endangered https://github.com/ContentMine/dictionaries/blob/master/json/endangered.json YesY 14.5 MB
epidemic https://github.com/ContentMine/dictionaries/blob/master/xml/epidemic.xml
https://github.com/ContentMine/dictionaries/blob/master/json/epidemic.json
YesY "very short list relating to epidemics"
funders https://github.com/ContentMine/dictionaries/blob/master/xml/funders.xml
https://github.com/ContentMine/dictionaries/blob/master/json/funders.json
YesY 1.5 Mb, "list of funders provided by CrossRef"
hgnc https://github.com/ContentMine/dictionaries/blob/master/xml/hgnc.xml
https://github.com/ContentMine/dictionaries/blob/master/json/hgnc.json
YesY 2.7 Mb, "list of human genes perhaps from NIH?"
inn https://github.com/ContentMine/dictionaries/blob/master/xml/inn.xml
https://github.com/ContentMine/dictionaries/blob/master/json/inn.json
YesY 234 KB, "list of generic drug names from ChEBI"
insecticides https://github.com/ContentMine/dictionaries/blob/master/json/insecticides.json YesY 41.9 KB
jax https://github.com/ContentMine/dictionaries/blob/master/xml/jax.xml
https://github.com/ContentMine/dictionaries/blob/master/json/jax.json
YesY 3.43 MB, "list of mouse genes ~ synbio - list of synthetic biology terms, handwritten"
taxdumpGenus https://github.com/ContentMine/dictionaries/blob/master/xml/taxdumpGenus.xml N/A 4.14 MB, list of taxonomic genera, source unknown
tropicalVirus https://github.com/ContentMine/dictionaries/blob/master/xml/tropicalVirus.xml
https://github.com/ContentMine/dictionaries/blob/master/json/tropicalVirus.json
YesY 672 Bytes, list of tropical viruses, handwritten
wikidatacountry https://github.com/ContentMine/dictionaries/blob/master/json/wikidatacountry.json YesYYesY 43.1 KB
wikidatagenus https://github.com/ContentMine/dictionaries/blob/master/json/wikidatagenus.json YesYYesY 44.4 MB

Fellowships dictionaries[edit]

Some other dictionaries were created for ContentMine fellowship projects. See for example Willighagen, Lars Gerard, Species co-occurrences from EuPMC articles related to pines