Wikidata talk:WikiProject Medicine

From Wikidata
Jump to: navigation, search



We should run this update on a few languages. The EMA drug if appears to be in 21 languages. Doc James (talk · contribs · email) (if I write on your page reply on mine) 00:56, 2 June 2013 (UTC)

We basically always have 2 options: Create an item that doesn't exist on Wikipedia, but is noteworthy on its own or wait for the datatype Multilingual text to be rolled out. Both can be translated in all languages. The item having the advantage that it can hold additional statements about e.g. a drug. I think you already said that we should use non-proprietary names. We could create them as items and put all the copyrighted sales names into a multilingual text field. --Tobias1984 (talk) 08:40, 2 June 2013 (UTC)
So the INN name is fairly consistent across languages. There are many brand names sometimes in a single language with these being determined by country and manufacturer.
We should have multilingual links for the EMA. I am still not 100% clear how Wikidata works. Will need to spend some time. Doc James (talk · contribs · email) (if I write on your page reply on mine) 13:43, 4 June 2013 (UTC)
I think everybody has a steep learning curve with Wikidata ;). I added the babel-template to your page which will enable you to view more languages and edit them (You can also switch between languages in the drop-down in the top-right of your screen). Each of the items (Q with number) can have labels and descriptions in each language. If you switch languages then items, statements and multi-lingual strings will be translated in your viewing language. Strings stay the same and numbers are displayed in the local number formatting.
We can make a language dependent link to EMA as soon as multi-lingual string becomes available as a datatype. Until that time we can just continue mapping the diseases and drug infobox. If those are finished that is going to take a huge workload of all language Wikipedias because we will generate the infobox centrally in every language and there will be no outdated numbers anymore :) --Tobias1984 (talk) 14:03, 4 June 2013 (UTC)
That sounds amazing. Blue Rasberry (talk) 15:53, 4 June 2013 (UTC)


Anatomy will be a harder subject (especially for me as a non-physician). It will definitely require much more human input than gathering strings from a database. We can for example say that the hypothalamus (Q164386) is a subclass of (P279) of diencephalon (Q192419) which is a subclass of (P279) of brain (Q1073). The brain could be defined as a subclass of (P279) of the nervous system (Q9404) and an instance of (P31) of organ (Q712378). It is up to us to find a good (and well sourced) classification for all of these things. Ideally we have to create as little properties as possible. Maybe we can tackle one anatomical subject first and find a good approach on how to handle it. --Tobias1984 (talk) 16:24, 4 June 2013 (UTC)

For the examples you give above, part of (P361) would seem to be more appropriate to me than 'subclass of'. The Terminologia Anatomica might be a good starting point, which I think is what the anatomical navigation templates on the English wikipedia are based on. Arcadian did a lot of work on that, he might be able to help further. --Wouterstomp (talk) 10:51, 5 June 2013 (UTC)
Sounds good. If the data is structured enough in the infobox we could make a bot request to transfer the data. If we can get an expert on board that would be great too. --Tobias1984 (talk) 11:15, 5 June 2013 (UTC)
Hello, is there any news about human anatomy data? I tried to get all data about human body (Q23852) through a traversal query (see here) but it seems not possible. --Floatingpurr (talk) 09:40, 20 September 2017 (UTC)
@Floatingpurr: Hello! Although not so huge change from 4 years ago, but there are advances. 2 properties are implemented, TA98 (P1323) and TA98 Latin term (P3982). You can get "list" of human anatomical parts by following code. Structural data is not yet implemented! --Was a bee (talk) 04:07, 23 September 2017 (UTC)
# Get human anatomical part list
SELECT ?item ?itemLabel ?TA98 ?TA98_Latin WHERE {
  ?item wdt:P1323 ?TA98 .
  ?item wdt:P3982 ?TA98_Latin .
  SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
order by (?item)
Try it!
@Was a bee: Woah! That's a great solution for getting human anatomy. The structural data will be the icing on the cake! Thanks a lot : ) Floatingpurr (talk) 10:36, 25 September 2017 (UTC)

Reviewing properties[edit]

We currently have a lot of properties proposed that need about 3 support votes so they can be created. I know it is a little tedious especially for string properties that are anyway included in the infobox, but this process makes sure that we don't create too many or poorly defined properties. So if you have some time follow the links on the project page and look at the different proposals. --Tobias1984 (talk) 09:42, 6 June 2013 (UTC)

Added support for all the no-brainers. I was wondering about how properties such as route of administration would be handled: with certain allowed values, will it give some sort of drop-down menu that you can choose from? Or will it result in a mix of various descriptions of the same thing, e.g. intravenous, intravenous infusion, IV, intravenous catheter, venous, infusion, catheter, injection. And should it list the most usual routes or 'official' routes, or all possible routes (e.g. you could administer a drug such as morphine in lots of different ways). --Wouterstomp (talk) 10:11, 6 June 2013 (UTC)
A drop-down menu is planned. Currently you can enter anything but we can set up a bot to report weird entries (Its really easy to do: Template:Constraint). Probably the most commons ways are better than "any-thinkable-way". But we can also create a qualifier which distinguished between "most often" and "sometimes". We can add the things we work out to the documentation of the property, so people will know how to use it. And thank you for reviewing ;) --Tobias1984 (talk) 10:34, 6 June 2013 (UTC)

Question regarding organisation[edit]

Given that malaria (Q12156) is an instance of tropical disease (Q1345671) which is a subclass of disease (Q12136), should malaria also be an instance of disease? Also instance of and subclass of are often used interchangeably, some diseases are listed as instances and others as subclasses, what is correct here? --Wouterstomp (talk) 09:57, 6 June 2013 (UTC)

I think it is enough to add one instance of statement because malaria being a disease is already implied from the statement that malaria is a tropical disease and tropical diseases is a subclass of diaseses. I think that instance of is more correct. Either way we should make a guideline on the project page so we do everything consistently. --Tobias1984 (talk) 10:18, 6 June 2013 (UTC)
Some guidelines would definitely be helpful. Especially for borderline cases such as skin cancer (Q192102), which would be regarded by laymen as a disease (instance of) and by doctors as a (sub)class of diseases. Perhaps it could even be both in this case. --WS (talk) 10:47, 6 June 2013 (UTC)
I think we should definitely go with the professional opinion of the field. Ideally we can find some good sources and just follow what they have worked out. How about skin cancer (Q192102) subclass of disease and instance of cancer? --Tobias1984 (talk) 10:55, 6 June 2013 (UTC)

There is some discussion about this in a more general context over here: Wikidata:Requests for comment/How to classify items: lots of specific type properties or a few generic ones? --WS (talk) 13:10, 6 June 2013 (UTC)

I think that the organisation of diseases is not done well. As indicated above, there are diseases that are both instances of disease (Q12136) and subclasses of disease (Q12136), according to the accepted definitions of instance and subclass. Although this is not a modelling error per se, it does complicate the use of diseases. I was looking around and was unable to find even a simple statement of principles on what a disease is and how they should be organized. Is there such a statement, and I'm just unable to find it? Peter F. Patel-Schneider (talk) 13:11, 19 October 2015 (UTC)
@Peter F. Patel-Schneider: You have certainly found the right page to talk about the issue. The problem is that many people use instance of (P31) the wrong way and even remove correct subclass of (P279) statements. Some items have a history of unintentional back and forth editing of these 2 properties. In general the ontology should be built using subclass of (P279). But we are still far away from having good coverage of the existing disease ontologies. --Tobias1984 (talk) 15:59, 19 October 2015 (UTC)
@Tobias1984: What is the right way? I was unable to find a clear description of just how diseases are to be modelled. Peter F. Patel-Schneider (talk) 16:09, 19 October 2015 (UTC)
@Peter F. Patel-Schneider: We don't have any hard guidelines yet, but these two things should help (but feel free to ask more). Wikidata does not build 1 ontology but multiple ontologies. Different sources might classify diseases differently, so we need to add multiple statements to subclass of (P279). One branch of this ontology tree could be supported by 10 sources another one just by one. In total our ontology branches are built using millions of sources, because no single source could ever cover every piece of knowledge in the universe. - As a rough guideline you can look at which is a good first source to add if you add a p279 statement. You can also look at the disease items in the Reasonator. That tool pulls in information from related items and shows the ontology in the box called "Classification": --Tobias1984 (talk) 16:15, 19 October 2015 (UTC)
@Tobias1984: That's not what I was asking for. What I was trying to find out is how are diseases to be modelled in Wikidata? How is a particular disease to be related to malaria (Q12156)? What extra information is required for diseases? Peter F. Patel-Schneider (talk) 17:16, 19 October 2015 (UTC)
@Peter F. Patel-Schneider: According to disease ontology malaria (Q12156) is a subclass of (P279) of no label (Q18555201). That statement is already included in the item. But you might find sources that say something different. The other properties you can use in a similar way. For example if you press the "add" button below the last statement it will suggest you which properties don't have statements yet. There is an algorithm that compares items with similar statements and knows which are missing (for example an item about a person missing a birthdate). When I press that button for malaria (Q12156) I see that for example a statement with the ICD-9 code is missing. Feel free to keep asking until I say something that makes sense :) --Tobias1984 (talk) 17:40, 19 October 2015 (UTC)
@Tobias1984: I guess I was not specific enough. What I was looking for was how to set up instance and subclass links for diseases. For example, malaria has three such links. Are all three needed? What implications do these links have? What such links are needed for new diseases? (The reason that I ask is that I am interested in how Wikidata does metamodelling in general, and diseases appear to be a good exemplar. However, I am having trouble finding out how metamodelling is supposed to work for diseases.) Peter F. Patel-Schneider (talk) 17:53, 19 October 2015 (UTC)
@Peter F. Patel-Schneider: You could also start a discussion at Wikidata:WikiProject Ontology for some more expert advice on the ontology. - I don't think that malaria should have instance of (P31) statements because a disease is not an instance in the sense of a notable person being an instance of persons in general. Multiple statements in p279 are fine because they are different branches according to different sources. And that is a core concept of Wikidata. Is is a multi-ontology project. --Tobias1984 (talk) 19:30, 19 October 2015 (UTC)
@Tobias1984: There are two ways to be multi-ontologistic. One way is to have multiple domains, like diseases and colors. The other way is to have several ontologies of diseases. The first generally does not cause problems, although it is useful to have a common modelling methodology. The second can easily cause problems if the two ontologies are not correctly inter-related (or maybe correctly not inter-related). It appears to me that there are two ontologies of disease in Wikidata, one imported and one not, and they do not interact well at all, causing in particular the concept disease to have as instances both disease categories (populated) and particular diseases of individual entities (generally or completely unpopulated). To add to the confusion, there does not appear to be any discussion of the situation. Peter F. Patel-Schneider (talk) 13:41, 20 October 2015 (UTC)

──────────────────────────────────────────────────────────────────────────────────────────────────── @Peter F. Patel-Schneider: Wikidata is still a pretty young project, so not everything has been discussed in detail. There is so much to sort out that some areas have gotten little attention. - There are certainly problems with having several ontologies for diaseases. But that is the way Wikidata is built. The community is already developing a lot of tools, and will need to built more tools that will take into account this approach. - If you want to work on the whole disease ontology, be prepared that it will take a lot of work and discussions. Currently there are 9591 items in the subclass-tree of disease ([12136][][279]). --Tobias1984 (talk) 18:37, 20 October 2015 (UTC)

@Tobias1984: Sure, it is going to be work to come up with a better organization of diseases. However, how can I start the work without knowing why the current organization is how it is? Who can tell me how higher-level classes are supposed to work in general in Wikidata? I was hoping that this was recorded somewhere, and then hoping that someone who knew would respond. Peter F. Patel-Schneider (talk) 20:22, 20 October 2015 (UTC)
@Peter F. Patel-Schneider: As Tobias said, the current organization is how it is because semi-anonymous volunteers designed it to be so with the same context which you are finding. I encourage you to continue to ask questions and seek answers. One might say that project knowledge is transmitted through the Internet social custom called "lurk moar".
If you want to ask theoretical questions about the project, consider posting to Wikidata:Project chat. I see in the Wikidata-l mailing list that you already met some of the leading thinkers in the project and that you have read through An Ambitious Wikidata Tutorial. You seem like an insightful person - I expect that your intuition about the development of Wikidata is probably correct, whatever you might be thinking. In the mailing list you seem to have personal introductions to individuals with whom you might talk. I am not sure what else to provide you. What more would you request? Blue Rasberry (talk) 14:10, 21 October 2015 (UTC)
@Blue Rasberry: OK, I'll try the chat. I guess that I'm not finding a basic statement of modelling principles because there just isn't one. Peter F. Patel-Schneider (talk) 14:55, 21 October 2015 (UTC)

Route of administration[edit]

Is there a website that we can use as a source for most of the route of administration claims? Maybe even structured enough to do a bot-import? --Tobias1984 (talk) 07:37, 18 June 2013 (UTC)

Possibly Drugbank? It seems pretty comprehensive, even listing a soap for topical administration of morphine. --WS (talk) 15:32, 18 June 2013 (UTC)
You could try Drugs at FDA. See [2] for an example. Remember (talk) 16:56, 16 July 2013 (UTC)

Generating data about medicine across languages[edit]

I want to know how many medicine related articles are in all languages of Wikipedia. We have information for English here [3] which comes from the WP:MED templates on talk pages. Can we use the interlanguage links of these articles to determine the number in other languages? I realize that this would only be a rough estimate.

Additionally I am interested in having WPMED tags added to the talk pages of medical related articles in other languages. I see this as a first step to generating sums of page views for medicine like we have in En in other languages. Thoughts? Doc James (talk · contribs · email) (if I write on your page reply on mine) 10:43, 5 July 2013 (UTC)

Those sound like some interesting statistics. The current problem is that we need properties in the items that we can query. In the case of the diseases-list, Byrial queried all the items that have a MeSH code assigned. So our first priority is getting the properties of the different medical infoboxes ready (diseases infobox is almost done). --Tobias1984 (talk) 21:33, 5 July 2013 (UTC)

Copied from the project chat: --Tobias1984 (talk) 16:22, 8 July 2013 (UTC)

Analytics (lightning Kraken demo) - Andrew Otto (remote) / Evan Rosen - 3 minutes. WMF Metrics and activities meetings/2012-12-06

There is a project for better page view statistics, with the ominous name "Kraken" (Datenkraken?). See:

  • meta:Glossary#K, Kraken: the upcoming data services platform that the Wikimedia Foundation's Analytics team is working on. It will allow interested persons to query data to answer their questions about Wikimedia projects and users.
  • mw:Analytics/Kraken and mw:Analytics/Kraken/Blurbs
  • commons:File:Report_on_requirements_for_usage_and_reuse_statistics_for_GLAM_content.pdf (June 2013) "there is clearly a need for GLAM-related statistics, for research purposes, but mainly to enable GLAMs to use Wikimedia Projects as a mature distribution channel for their collections. (...) The next steps are up to the analytics team of the Wikimedia Foundation to start gathering the information required (using Kraken) and work on a page (using Limn) to display that information. The GLAM toolset project will continually and regularly liaise with the Wikimedia Foundation to ensure this development is prioritised".

I am quite worried about this kraken-data. When german Wikipedia prominently linked to page view statistics on every Wikipedia page, this resulted in SEO spammers using this tool to determin popular target articles for spamming their links on WP. Apparently now the GLAM community (composed of well-funded institutions with PR departments) is pushing for these analytics to be built by WMF (funded by donations for Wikipedia). Before that, the ISP partners for Wikipedia Zero (and WMF marketing) were asking for special page views statistics, including Saudi Telecom ("No, we never talked about censorship"). And while the toolserver had a privacy policy, that is not the case with kraken, afaik. Pandora box, anyone? --Atlasowa (talk) 13:02, 8 July 2013 (UTC)

There is also this. But currently limited to 500 results.[486] --Tobias1984 (talk) 07:45, 9 July 2013 (UTC)
We already list the 1500 most viewed medical articles here and it is updated monthly. [4] We have been doing this for years. The more viewed pages are also the more watched. So do not understand the issue? Doc James (talk · contribs · email) (if I write on your page reply on mine) 19:36, 9 July 2013 (UTC)

bot maintenance of drug and disease items[edit]

Hello all, I would like to start plans to write a bot that will maintain all drug and disease items, keeping in sync with source databases. Realizing there are many other people here (including bot owners) who are interested in that data, I want to be sure we're not stepping on any toes. By way of brief introduction, I run a biology research group and one of our main projects has been to maintain the ~10,000 gene/protein infoboxes on Wikipedia. These infoboxes are bot-updated in near-real-time with source databases so that all the data shown stays current. (See w:Portal:Gene Wiki and w:User:ProteinBoxBot for more info.) From that existing effort, we're trying to move in two directions. First, we want to move all that gene/protein data from Wikipedia templates to Wikidata -- that effort is being coordinated over at WD:MBTF. Second, we want to expand to disease and drug infoboxes, and we've done some very early prototyping at w:User:ProteinBoxBot/Phase_3. Anyway, this is another natural community to interact with, so we want to make sure we're coordinating with everyone else in this space.

Just in terms of first actions, I will add a column on the property tables shown on Wikidata:Medicine task force for "Data source". The goal is to identify as few resources as possible that contain the listed mappings and annotations in some structured format. We'd certainly welcome help compiling these data sources. And obviously, any other comments are welcome too! Cheers, Andrew Su (talk) 18:59, 16 July 2013 (UTC)

Hey Andrew! Thank you for your update. I don't think this project has a dedicated bot-operator yet, so your bot would be more than welcome. You could add the bot to the participants list so that people know with whom to coordinate their bot-task operations. --Tobias1984 (talk) 20:44, 16 July 2013 (UTC)
Done! Cheers, Andrew Su (talk) 22:22, 17 July 2013 (UTC)

Just a quick update on bot tasks. Kompakt is going to import the ICD-9 and ICD-10 codes for us. Does anyone know which source we should use for ICD-9? Please post at Wikidata:Bot_requests#ICD_9_.26_ICD_10 --Tobias1984 (talk) 08:45, 17 July 2013 (UTC)

Cool! Though I think long-term, we want to use content directly from some authoritative source (like Human Disease Ontology) rather than importing from Wikipedia infoboxes. (Longer term, that's something that our bot can do.) As for a source for ICD-9 codes, you could use one of these files (linked from [5]). Cheers, Andrew Su (talk) 22:22, 17 July 2013 (UTC)

links between genes/proteins, diseases, and drugs[edit]

I just started a discussion at WT:MBTF on how to add links between genes/proteins, diseases, and drugs. Input from this community would obviously be welcome and appreciated! Cheers, Andrew Su (talk)

French infobox[edit]

Hi everyone.

I don't really understand what this project involves, and if our work will have an impact on it.

To sum up the situation, on the french Wikipedia we are currently discussing to change, add or remove some parameters of the infobox disease. One of the contributors warned us about this project on Wikidata which we hadn't heard of.

So now we're wondering if our current work will have an impact on this projet, and if we have to take special precaution (we didn't really understood what was the point of this projet).

Thanks for your answers. --Woozz un problème? 09:28, 25 July 2013 (UTC)

Hi Woozz! I read the discussion you linked. My french is only good enough for reading, so I'm writing in English. Basically this project just tries to store and centralize different pieces of information. Those pieces can be used by all Wikipedias using special templates. It is important to mention that that doesn't mean that all Wikipedias have to have the same Infobox layout. We should try to gather the data for the new infobox here, so it can be used by other languages too. I looked at your example and we just have to propose the properties to store that information. I already proposed some of them here: Wikidata:Property_proposal/Term#medical_discipline. It would be helpful if a few people from the French project would look at how Wikidata works. I can help out with all the questions you have, so just ask them here. --Tobias1984 (talk) 14:19, 25 July 2013 (UTC)
Hi Woozz, coincidentally, almost the same discussion is going on on the English Wikipedia at wikipedia:en:Wikipedia talk:WikiProject Medicine#Infoboxes_-_any_consensus_for_changes.3F. If any parameters added are common to the both language wikipedias, all data added can be shared between the two languages with the help of wikidata. --WS (talk) 14:39, 25 July 2013 (UTC)

ICD 9 and 10[edit]

User:Kompakt has imported a few thousand ICD-9 and ICD-10 codes for us. The constraint violations are also updated and can be checked. --Tobias1984 (talk) 09:38, 4 August 2013 (UTC)

drug-drug interaction[edit]

We need to have a quick discussion on how we will use qualifiers for the property significant drug interaction (P769). We have to find a few qualifiers that should fulfill the following criteria:

  • Semantic
  • General enough to have wide applicability (e.g. a property "color" is better than "bird color" or "car color")
  • Specific enough to describe detailed interactions
  • Work for the other interactions discussed here: link between genes, proteins and drugs
  • Potentially work all over Wikidata

My rough idea for warfarin (Q407431) would look something like this:

  • increases chance of = bleeding (item datatype)
  • decreases chance of = absorption (item datatype)
  • occurs in number of cases = 30 % (this is an example of a numeric qualifier that could state statistical data)

But it is obviously more complicated than that. Best way as usual is to think of what kind of queries we would like to have answered. Somebody could query Wikidata on the above example: "What are drugs that influence warfarin (Q407431) and increase the likelihood of bleeding" and the database would return ticlopidine (Q420571). --Tobias1984 (talk) 19:38, 6 August 2013 (UTC)

Re: Tobias1984 I agree and think that the work we have done on a semantic model for drug safety statements can apply here. Specifically, I think the following qualifiers may be useful:
  • PharmacodynamicImpact - Information on the pharmacodynamic impact of a drug-drug interaction.
  • drug-toxicity-risk-increased - The drug-drug interaction is associated with an increased risk of toxicity.
  • drug-toxicity-risk-decreased - The drug-drug interaction is associated with an decreased risk of toxicity.
  • drug-efficacy-increased-from-baseline - The drug-drug interaction is associated with an increase in the efficacy of the drug.
  • drug-efficacy-decreased-from-baseline - The drug-drug interaction is associated with a decrease in the efficacy of the drug
  • influences-drug-response - The drug-drug interaction influences drug response
  • not-important - The drug-drug interaction is not associated with a clinically relevant pharmacodynamic effect
  • PharmacokineticImpact - Information on the pharmacokinetic impact of a drug-drug interaction.
  • absorption-increase - The drug-drug interaction is associated with an increase in absorption of the drug.
  • absorption-decrease - The drug-drug interaction is associated with a decrease in absorption of the drug.
  • distribution-increase - The drug-drug interaction is associated with a increase in distribution of the drug
  • distribution-decrease - The drug-drug interaction is associated with a decrease in distribution of the drug.
  • metabolism-increase - The drug-drug interaction is associated with a increase in metabolism of the drug
  • metabolism-decrease - The drug-drug interaction is associated with a decrease in metabolism of the drug.
  • excretion-increase - The drug-drug interaction is associated with a increase in excretion of the drug
  • excretion-decrease - The drug-drug interaction is associated with a decrease in excretion of the drug
  • not-important - The drug-drug interaction is not associated any clinically relevant pharmacokinetic with respect to the drug. "

So, for warfarin (Q407431) it would look something like this:

  • PharmacokineticImpact = absorption-decrease

Other qualifiers to begin with might come from the simple model found at [Meds] and could include mechanism, related drugs, and options (i.e., therapeutic options). We would need to discuss evidence grading separately because there are two kinds of evidence; the evidence that the interaction exists, and the evidence for patient harm/benefit. The definition of significant drug interaction (P769) assumes that the evidence for the existence of the interaction is sufficient to report publicaly. The latter evidence axis is much more challenging.Boycer (talk) 17:44, 9 August 2013 (UTC)

I like your approach too. Just a small concern is that we should put increase and decrease in the qualifiers. Otherwise we have to create two items for every item we would like to link to. In your example we have an item for "absorption" but no items for "absorption-increase" and "absorption-decrease". I would favor (your example):
  • PharmacokineticImpact decreases = absorption
What is your opinion on that? --Tobias1984 (talk) 15:10, 11 August 2013 (UTC)

Parasites and diseases they cause[edit]

We have a few parasites that have icd and other codes assigned to them. Wuchereria bancrofti (Q311109) for example causes 90 % of the no label (Q14514796) cases ( Should we assign the icd code to the parasite, both or just the disease? --Tobias1984 (talk) 15:04, 7 August 2013 (UTC)

I would think just the disease. In this specific case, it has been imported from the English Wikipedia, where it is listed in the same article because there is no separate article for the disease itself. --WS (talk) 07:03, 9 August 2013 (UTC)

Hong Kong flu vs H3N2[edit]

Could you look at the page Hong Kong flu (Q1069785) please? In most of the languages it is the Hong Kong flu (1968-1969) and there are two languages with the virus H3N2 (causing the Hongkong flu). I don't want to delete them as I have no idea about the content of these Japonese and Armenian articles; for those the correct place would be influenza A virus subtype H3N2 (Q13399926). I already corrected the Hungarian and the French one. --Hkoala (talk) 20:13, 5 January 2014 (UTC)

Thanks for noticing. I moved the two H3N2 articles to influenza A virus subtype H3N2 (Q13399926). From the machine translation they both looked like they are about the virus. --Tobias1984 (talk) 22:51, 5 January 2014 (UTC)


I just checked Ebola virus disease (Q51993) and it still has many missing statements. It is actually a good item to test, if we can fully describe a disease yet, with our existing properties. Also: our properties are almost not used by the community. Any ideas how this could be improved? -Tobias1984 (talk) 19:32, 2 August 2014 (UTC)

Medical Wikidata, Citation MetaData[edit]

There is now Wikidata:WikiProject Source MetaData, and Daniel Mietchen is starting off with malaria-related scientific articles. Is good source metadata of broad interest to the medical data community on Wikipedia? If so, is there a suitable place to ask them for input (for example, what metadata fields would medical editors like?)?

I've also been talking to the Cochrane Collaboration about sharing their database with Wikidata, and they are interested but understandably worried about labour costs. I'm told that the IdeaLab and Wikimedia Deutschland are both possible sources of funding for a data interoperability internship or some such; do you know of any other good routes? HLHJ (talk) 17:12, 11 August 2014 (UTC)

@HLHJ: What kind of metadata would Cochrane provide? I think that at the moment Wikidata and d:Wikidata:WikiProject Medicine should concentrate on alleviating medical editors from maintenance activities so they can concentrate on content. Primarily that includes interwiki-links and identifiers to medical catalogues. Possible next steps would be the centralization of categories (a ontology that is currently build in 270 different languages, wasting a lot of time). We are slowly moving towards this goal: The Drugbank-ID stored on Wikidata is now used in some Russian-Wiki templates (d:Property talk:P715). So if Cochrane can do one thing, they should try to make all medical identifiers freely available in a machine-readable form. --Tobias1984 (talk) 17:02, 14 August 2014 (UTC)
Since they seem to have databases, I can't imagine much is in non-machine-readable form. Risk of bias assessments of articles were mentioned, and other raw review data. With such metadata, when I found a relevant article, I could ask the database whether this article had been incorporated into any systematic reviews. I could ask it for a list of all the articles which had been incorporated into systematic reviews that also incorporated this article. I could ask for the proportion of those studies that were double-blinded, or had trial registrations listed. I could ask for the links to the articles and their registrations. I could plot how long the studies lasted, or how many people they involved... etc., automatically. Would this be useful enough to medical editors to justify the effort? HLHJ (talk) 19:03, 14 August 2014 (UTC)
@HLHJ:: I copied the discussion here so other data contributors can look at it. I do think that this data would be useful. Queries for studies with e.g. "more than 500 participants" would be a powerful tool to find the study that might not be answerable by a text search. -Tobias1984 (talk) 08:40, 16 August 2014 (UTC)

Intra language links[edit]

We are creating medical content for K'ichi that only exists in incubator. Is their a way to add this content to the intra language links? We have 4 translated articles as listed at the bottom here [6] Doc James (talk · contribs · email) (if I write on your page reply on mine) 10:09, 26 August 2014 (UTC)

@Jmh649: Incubator site-links have not been implemented yet, but are planned I think. It is probably not high on the priority-list because of the small amount of pages. Tobias1984 (talk) 10:34, 26 August 2014 (UTC)

Data aquisition disease infobox[edit]

Just putting this here, so it can go into the talk page archive. This was the initial acquisition of the diseases infobox. These identifiers are now stored on Wikidata. -Tobias1984 (talk) 12:15, 27 August 2014 (UTC)

Task Bot(s) # of claims Progress
Template:Infobox disease (Q6436840) - 5842 transclusions  ?
MeSH ID (P486) User:KLBot2
User:SamoaBot Task 25
922 15%?
ICD-9 (P493) User:Kompakt-bot 3650 99 %? ✓ Done
ICD-10 (P494) User:KLBot2
3914 99 %? ✓ Done
DiseasesDB (P557) User:KLBot2
2679 99%? ✓ Done
ICD-O (P563) User:Kompakt-bot 321 99% ✓ Done
OMIM ID (P492) User:KLBot2
1456 99%? ✓ Done
MedlinePlus ID (P604) User:KLBot2
1395 99%? ✓ Done
eMedicine (P673) User:Kompakt-bot 1897 99? ✓ Done
GeneReviews ID (P668) User:Kompakt-bot 147 99? ✓ Done

Disease Ontology import[edit]

As a member of the ongoing Gene Wiki project, I am are contemplating how to bring structured gene-disease relationships into wikidata. The thinking right now is to a) bring in the human gene concepts (see ProteinBoxBot's work), b) bring in the human disease ontology , and c) add claims that connect them based on repositories such as OMIM . Anyone here in project medicine have thoughts about that? --Genewiki123 (talk) 17:35, 10 September 2014 (UTC)

Genewiki123 (talkcontribslogs) It is a perennial proposal. It is my opinion that if you drew up a proposal then it would be like to get community support. User:Klortho might be able to tell you something about past proposals, as could User:Emw. My advice would be that if you did this, because of the magnitude of the project, consider making a proposal which includes a layman explanation of what you are doing, what sources you have to back your claims, and why the outcome matters. I would not recommend doing this unless you get consensus that the sources you use to back your claims are excellent, and that consensus should probably include people who have no idea what this project means, because I predict that this kind of project would get more scrutiny from odd observers than any typical gene database. Blue Rasberry (talk) 20:11, 11 September 2014 (UTC)
Genewiki123, Bluerasberry, I recommend reading The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration and Relations in Biomedical Ontologies by Barry Smith et al., which are major papers in biomedical ontology and describe key principles of OBO ontologies (of which Disease Ontology is one).
One of the main purposes of the Disease Ontology (DO) is as a class hierarchy for disease. When DO makes claims like "breast cancer is_a thoracic cancer" (as it does at, it is stating "breast cancer subclass of (P279) thoracic cancer". You can see this by opening (warning: big) and seeing how all uses of "is_a" are replaced by rdfs:subClassOf. P279 is mapped to rdfs:subClassOf and exists as such in the Wikidata OWL exports (see wikidata-taxonomy.nt.gz, which you can explore in Protege).
A hierarchy for disease already exists on Wikidata: As you're likely aware, there are many overlapping disease hierarchies from different authorities. Wikidata can theoretically support multiple hierarchies in a given domain like "disease", but we should designate one as preferred to make things tractable, at least for humans.
I'll follow up here this weekend with more comments. But I think I broadly support Genewiki123's proposal. Emw (talk) 13:00, 12 September 2014 (UTC)
Thanks for the advice, will definitely try to get something formal written out and posted here before taking any further steps. In terms of the existing disease hierarchy, my thinking is to leave it as it is, map DO terms to it where possible and then create new entities when there is something missing. A key modeling question for me is whether to use the subclass property to assemble a polyhierarchy (because it would contain both the existing structure, the DO structure, and potentially many others (e.g. MeSH)). I think, with the help of B.S. disciples like @EMW here, we could do this in a way that is ontologically correct and useful, but that would be a lot of challenging modeling work and I worry about how it would impact generic tools for viewing class hierarchies. It might actually be better attempt to leave a single wikidata-specific disease subclass hierarchy in place and then enhance the data with the unchanged hierarchies from external sources where these were modeled with source-specific properties (DO_subclass_of, MeSH_narrower_than, etc.) ??? --Genewiki123 (talk) 18:10, 15 September 2014 (UTC)

Looking at the hierarchy as it is right now we would like to propose a streamlined set of disease categories, with the idea that a disease could be a cancer as well as a disease of a certain body system. This upper level would include disease of each body system and e.g. cancer, infectious disease, metabolism disease, mental health disease, genetic disease and syndrome. And we would like to discuss the possibility of a separate category for physical disorders. emitraka lschriml

I just tried to find out what the licensing of the DO is. According to doi:10.1093/nar/gkr972, it is "the [sic!] Creative Commons license", which is not precise enough for most purposes, but sufficient to inform us that importing into Wikidata is not possible. --Daniel Mietchen (talk) 23:39, 31 October 2014 (UTC)
emitraka, lschriml, Daniel raises a salient point. Could you make the official licensing of Disease Ontology more precise? Unless Disease Ontology is in the public domain -- i.e. under CC0 -- it technically cannot be imported into Wikidata. More on CC0 here: Emw (talk) 16:50, 29 November 2014 (UTC)
Daniel Mietchen, Emw, sorry about us not being clearer on the issue beforehand. The official licensing of DO is under Creative Commons Attribution 3.0 Unported. Emitraka (talk) 22:01, 1 December 2014 (UTC)

subclass of disease: lo-fi DO import problematic[edit]

I recently noticed a batch addition of 'subclass of (P279) disease (Q12136)' statements to items about diseases. For example, see this revision of Alzheimer's disease: It has a few problems:

  • Low fidelity provenance. The statements have references like "stated in: Disease Ontology release 2014-11-14", but the corresponding Disease Ontology (DO) item for the disease does not directly state "subclass of disease". The Disease Ontology item for Alzheimer's disease states "subclass of tauopathy" and "subclass of dementia".
If we want to import an ontology, then we should import it. Do a Wikidata keyword search on the object of the ontology's subclass of (/ is a) statement, get its Q number, and make the precise statement the ontology makes. Compare DO's 'Xrefs' to Wikidata's identifier properties if disambiguation becomes an issue. Create items as needed, though this should be rare. We should not use references like "stated in: $ontology" to support claims that are only distant transitive entailments of what the imported ontology actually states.
  • Redundancy with a large set of existing claims. Most of these diseases had already been classified with more granular claims via subclass of (P279), and many of those claims are precisely what is stated in Disease Ontology. For example, per links above, DO states "Alzheimer's disease subclass of dementia", and this statement was already in the Wikidata item on Alzheimer's. The appropriate thing to do in such a case is to add a reference to the pre-existing claim when it matches the imported statement -- rather than adding a new, redundant claim.

Can we revert these problematic "subclass of disease" statements added by ProteinBoxBot? currently reports 4450 "subclass of disease"; that number should ideally be much lower. I support faithfully importing subclass of statements from Disease Ontology, but the batch edits made around November 24, 2014 are problematic and need to be fixed. Genewiki123, Bluerasberry, Daniel Mietchen, what do you think? Emw (talk) 16:37, 29 November 2014 (UTC)

I reverted these "subclass of" claims. Andrawaag (talk) 22:59, 29 November 2014 (UTC)
Thank you Andra (and for all your work). now lists 62 direct "subclass of disease" claims, which seems about right. Emw (talk) 00:59, 30 November 2014 (UTC)
Andrawaag (talkcontribslogs), thanks for responding to the criticism of your bot. Emw (talkcontribslogs), thanks for raising the issue. I agree with Emw when he says "If we want to import an ontology, then we should import it", and as I understand, in this case there was some partial import of some ontology which was not well established. If that was the case, and there was dispute about its validity, then I am glad to see the uploader withdraw the claims due to lack of explanation of the need for these edits to be in Wikidata. Blue Rasberry (talk) 16:05, 2 December 2014 (UTC)
Bluerasberry, it turns out that Andra was just being consistent with how genes were very broadly classified (e.g. APOE subclass of gene). General genes, specific diseases examines those different approaches to classification. Emw (talk) 01:16, 3 December 2014 (UTC)

Adding disease properties[edit]

We're developing a Disease Box in the same vein as the Protein Box. So here we would like to propose and initiate discussion to some properties we think are needed.

Possible properties:

→ to add to Disease:
symptoms: add list of symptoms from SYMP
symptom: [page is currently not populated]
pathogen transmission process: add terms from TRANS
add anatomical location, use UBERON; link out to specific anatomy terms or as an alternative pull UBERON into Wikidata
Add UBERON ID as property of Anatomy
add property Phenotype: populate from HPO
add Inheritance: genetic, monogenic, polygenic, autosomal, recessive, dominant, X-linked
add Orphanet ID
add prevalence
add ICD10 codes
add disease categories/upper level term; suggest upper level term e.g. cancer, genetic disease, metabolic disease, mental health disease etc.
add synonyms

For each of the ontologies mentioned above we propose to pull them into Wikidata. emitraka lschriml

Are any of these already being covered in Wikidata? Blue Rasberry (talk) 19:49, 19 September 2014 (UTC)
@emitraka, lschriml: Orphanet ID (P1550) is done. Did you have time to familiarize yourself with Wikidata? Do you need help with the property proposals? -Tobias1984 (talk) 17:02, 6 October 2014 (UTC)
Thank you! I'm finding my way around Wikidata, slowly but surely. Emitraka (talk) 17:17, 6 October 2014 (UTC)
@emitraka, lschriml, Emw, Andrew Su, Genewiki123: Just created UBERON ID (P1554). -Tobias1984 (talk) 06:35, 7 October 2014 (UTC)

Summary of relevant existing properties[edit]

Emitraka, Lschriml, welcome to Wikidata! It's exciting to have you both here -- I'm a fan of your work with the Disease Ontology. Several of the properties you propose exist:

Property Talk page All usages Creation discussion
symptoms (P780) Property_talk:P780 Wikidata:Property_proposal/Archive/12#P780
pathogen transmission process (P1060) Property_talk:P1060 Wikidata:Property_proposal/Archive/18#P1060
prevalence (P1193) Property_talk:P1193 Wikidata:Property_proposal/Archive/20#P1193
mode of inheritance (P1199) Property_talk:P1199 Wikidata:Property_proposal/Archive/19#P1199
ICD-10 (P494) Property_talk:P494 Wikidata:Property_proposal/Archive/7#P494 (and #P493)

As you can see, most of the above properties are rarely used at the moment, simply because they're little known.

Many other properties of interest:

We do not currently have a property for Uberon ID or Orphanet ID. Identifier properties like that virtually always pass property proposal without issue. I'd certainly be interested in hearing more about importing Uberon (presumably via subclass of); it seems like it could be a good candidate for a reference domain ontology for anatomy on Wikidata.

Note that you can search properties by prepending "P:" to your query in the search box at upper right in the Wikidata UI (e.g. P:icd).

Disease categories can be added with subclass of (P279). That property is mapped to rdfs:subClassOf and thus also has the semantics of BFO's is_a. Help:Modeling_causes#Malaria also seems pertinent. Synonyms can be added as aliases. Aliases have a nice side-effect of getting picked up by Wikidata's search. The Gene Wiki project cleverly set enabled searching by Entrez Gene IDs and UniProt IDs by setting those values as aliases in genes like RELN (Q414043) and proteins no label (Q13569356).

Hope this helps. You can ping me or other users by mentioning them like [[User:Emw|Emw]]. Cheers, Emw (talk) 06:07, 20 September 2014 (UTC)

Adding disease properties - amended[edit]

Emw thank you very much for the information. It looks like most of what we need already exists. What we still need, and would like to move forward with proposals for them, are the following properties:

Any comments will be greatly appreciated Emitraka (talk) 19:33, 30 September 2014 (UTC)

@Emitraka: I requested one of the properties here. You can request more properties in the same way: Wikidata:Property_proposal/Natural_science#Orpha.net_ID. -Tobias1984 (talk) 06:29, 1 October 2014 (UTC)
You can also check and make changes to the project's list of properties: Wikidata:WikiProject Medicine/Properties. Tobias1984 (talk) 06:32, 1 October 2014 (UTC)
Emitraka, Tobias1984, others, has phenotype and phenotype of are interesting potential properties. I think some explanation of the formal differences and relations between disease, phenotype and symptom would be helpful.
From what I can tell from Clinical Diagnostics in Human Genetics with Semantic Similarity Searches in Ontologies, Integrating phenotype ontologies across multiple species, The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data and browsing around HPO's Phenomizer, HPO seeks to link what it calls "phenotypic abnormalities" (synonym: "organ abnormalities", also called "clinical features" in aforementioned papers) with diseases -- more precisely, human genetic disorders that have a Mendelian mode of inheritance. For example, one might say "22q13 deletion syndrome (Q1926345) has phenotype 2-3 toe syndactyly.
One possible issue I see with HPO is that it classifies some things that do not have a genetic basis as phenotypic abnormalities. For example, "Abdominal pain" (HP:0002027) is transitively set to satisfy the statement "abdominal pain subclass of phenotypic abnormality". By the standard reading of subclass of, that would mean every instance of abdominal pain is also an instance of phenotypic abnormality. Clearly, that statement is not consistent with how the term "phenotypic" is used throughout the literature to mean an observable trait (aka characteristic, property or quality) that arises because of the genotype of the bearer (i.e. the organism) the quality inheres in. An example of abdominal pain that is not a phenotypic abnormality is abdominal pain caused by extremely spicy food.
Emw Though there is clearly a focus on disorders with obvious genetic components, I don't think the HPO folks or the larger community would consider phenotype to be solely related to genetics. Phenotype is the combination of genetics and environment . It makes sense to include things like abdominal pain because they are indeed such a combination. Even if we only cared about the genetics of disease, we would still want to include observables like abdominal pain or short stature etc. in our records. The determination of the relationships between environmental factors and genetic factors could and often would come after the recording of the phenotype. -- 18:57, 2 October 2014 (UTC)
When would be has phenotype be preferable to symptoms (P780) (i.e. has symptom), and vice versa?
Great question - and very hard to answer.. I would like to hear Peter Robinson answer it. One clear case would be phenotypes that were not necessarily related to diseases. For example blue eyes, freckles, etc. Has Symptom seems like sub-property of Has Phenotype that constrains the relation such that the items in its Domain are classified as diseases. -- 18:57, 2 October 2014 (UTC)
It would also be good to discuss why a potential has phenotype property would be needed distinct of the generic causation properties has cause (P828). (Note also related properties has immediate cause (P1478), has contributing factor (P1479) and their inverses cause of (P1542), immediate cause of (P1536), contributing factor of (P1537), and Help:Modeling causes.) Consider:
Option A: has effect (i.e. cause of)
2-3 toe syndactyly
22q13 deletion syndrome (Q1926345)
22q13 deletion (Qx)
Option B: has symptom
2-3 toe syndactyly
22q13 deletion syndrome (Q1926345)
22q13 deletion (Qx)
Option C: has phenotype
2-3 toe syndactyly
22q13 deletion syndrome (Q1926345)
22q13 deletion (Qx)
Which of the above options is preferable?
It seems to depend on what items you want to use the property on. If you are linking specific genetic variations to a phenotype that we know they cause (or play a role in causing), then option a makes sense. If you are linking a genetic event to a phenotype that seems to co-occur.. then option C. If you are linking a disease to a symptom/phenotype than B.
None of these questions are show-stoppers or blocking issues in my opinion, but they are probably worth contemplating to avoid confusion down the road. Emw (talk) 13:00, 1 October 2014 (UTC)

WDWP:Medicine mentioned in WM blog[edit] -Tobias1984 (talk) 15:11, 24 October 2014 (UTC)

Add UMLS concept id property?[edit]

What do you think of adding a property to link items to UMLS concept identifiers? The UMLS (Unified Medical Language System) provides a large-scale 'metathesaurus' that attempts to organize and inter-relate hundreds of biomedical terminologies. . To integrate ontologies, they produce a single 'concept identifier' and then link it up to the equivalent concepts in each of the ontologies that they import. This is very useful for large-scale data integration projects that often need to map from one vocabulary to another. Having umls concept ids could really help disambiguate wikidata items as we go forward. --Genewiki123 (talk) 19:34, 13 November 2014 (UTC)

Genewiki123, sounds good to me. Emw (talk) 15:04, 29 November 2014 (UTC)
Sounds good to me too. --Daniel Mietchen (talk) 03:38, 30 November 2014 (UTC)

Hyphens instead of dashes -- an appeal for simplicity[edit]

Many medical subjects have compound names, like "Creutzfeldt-Jakob disease" or "case-control study". Newspaper style guides often recommend using en dashes (–) to connect the first part of such disease names, as does English Wikipedia per MOS:DASH. However, hyphens (-) are used instead of dashes by almost all medical journals, medical institutions, non-English Wikipedias, editors, developers, and readers.

Dashes in labels cause more problems than they solve. If a user is trying to search a page for a disease by name, then pasting "Creutzfeldt-Jakob disease" into a web browser's Find bar (Control F or Command F) won't work if "Creutzfeldt" and "Jakob" are connected by a dash. If an editor searches for "Charcot-Marie-Tooth disease" they will currently be brought to Q18553896 instead of Q1052687, the former of which was created in mistake because the developer (understandably!) did not account for dashes. Search or label-dependent functionality in third party software will also likely encounter problems if we persist using dashes.

To address those problems, I propose we make it policy to always use hyphens and never dashes in labels that have compound words or ranges. This would simplify matters and reduce bugs by aligning our labels to those overwhelmingly used by medical and scientific journals and institutions, non-English Wikipedias, and our various types of users. What do you think? Emw (talk) 18:43, 6 December 2014 (UTC)

Doc James
Daniel Mietchen
Andrew Su
Projekt ANA
Pavel Dušek
Was a bee
Chris Mungall
Dr. Abhijeet Safai
Pictogram voting comment.svg Notified participants of WikiProject Medicine

Andrew Su
Marc Robinson-Rechavi
Pierre Lindenbaum
Michael Kuhn
Dan Bolser
Timo Willemsen
Salvatore Loguercio
Daniel Mietchen
Ben Moore
Alex Bateman
Vojtěch Dostál
Andra Waagmeester
Elvira Mitraka
David Bikard
Dan Lawson
Francesco Sirocco
Konrad U. Förstner (talk)
Chris Mungall (talk)
Kristina Hettne
Karima Rafes
Finn Årup Nielsen
Jasper Koehorst
Till Sauerwein
Amos Bairoch
Was a bee
Muhammad Elhossary
Pictogram voting comment.svg Notified participants of WikiProject Molecular biology

I'm not a native speaker, but I don't think that replacing dashes with hyphens could cause any problems. The label is anyway just a way for us to handle the database. If somebody sees the need for a typographic correct output, we should probably create a property for that. --Tobias1984 (talk) 10:02, 8 December 2014 (UTC)

Launch of WikiProject Wikidata for research[edit]

Hi, this is to let you know that we've launched WikiProject Wikidata for research in order to stimulate a closer interaction between Wikidata and research, both on a technical and a community level. As a first activity, we are drafting a research proposal on the matter (cf. blog post). It would be great if you would see room for interaction! Thanks, --Daniel Mietchen (talk) 01:34, 9 December 2014 (UTC)

Classifying surgical procedures[edit]

There is a discussion at Talk:Q15636253 about how to organize surgical procedures into a concept hierarchy, and how to label it. It touches on ICD-10-PCS and what constitutes reasonable medical terminology. Input from members of this WikiProject would be very welcome! Emw (talk) 17:27, 11 January 2015 (UTC)

Doc James
Daniel Mietchen
Andrew Su
Projekt ANA
Pavel Dušek
Was a bee
Chris Mungall
Dr. Abhijeet Safai
Pictogram voting comment.svg Notified participants of WikiProject Medicine

ICD-10-PCS copyright[edit]

Is ICD-10-PCS in the public domain? This question arises from an informative comment:

If the ICD can be used then obviously that is a classification system with a huge amount of international support and I would like to use it. At Wikimania 2012 the WHO sent two representatives to talk to Wikipedians. As I understood at the time, they only allowed the newer ICD systems to be used with licensing agreements, as these are non-free coding systems. We considered becoming more organized to ask that the ICD-11 be freely licensed, as described at en:Wikipedia:WikiProject Medicine/ICD11. What do you know about the circumstances under which these coding systems can be used freely?

While ICD-10 is made available by the World Health Organization (WHO), ICD-10-PCS is made available by the Centers for Medicare and Medicaid Services (CMS), part of the Department of Health and Human Services (HHS) of the US federal government. Per the page 1 footnote in 2015 Development of the ICD-10 Procedure Coding System, the system is developed through funding from federal government contracts (Nos. 90-1138, 91-22300, 500-95-0005, HHSM-500-2004-00011C and HHSM-500-2009-000555-C) by 3M Health Information Systems.

I have found no copyright notice in the resources at 2015 ICD-10 PCS and GEMs. Works by the US federal government are generally not copyrighted and thus in the public domain, but the situation is murkier for works produced by government contractors. The HHS Grants Policy Statement seems to be an authoritative statement on copyrightability of US government works funded by HHS contracts, but I see no particular mention of this work there.

The copyright status of ICD-10-PCS is unclear. If it is in the public domain, then it seems like a good option to consider using as the preferred subclass of (P279) hierarchy for medical and surgical procedures. Such use might also demonstrate to medical organizations like WHO, etc. that putting works like ICD-10 would be practically useful and in the public interest. Emw (talk) 23:48, 17 January 2015 (UTC)


Could we set up archiving on this page? I'm not sure how that's done on Wikidata but it'd be quite useful to keep only the recent and relevant threads here. --LT910001 (talk) 03:09, 29 March 2015 (UTC)

Moving ICD codes and ATC codes from navboxes to Wikidata[edit]

We recently had a similar process with Anatomy articles - identifiers for templates were moved to Wikidata. This significantly enhanced their readability. Could we do something similar for medicine templates?

An example template with the codes is here: en:Template:Respiratory pathology.

As stated previously the reason for doing this is that we can preserve the identifiers but increase the navigational value. Users do not use the numbers to navigate within the navboxes, and they visually clutter the title.

Thoughts? --LT910001 (talk) 03:11, 29 March 2015 (UTC)

I have previously proposed this here: Wikidata:Bot_requests#Move_all_template_ICD9_and_ICD10_references_to_wikidata but do not understand the reply, which I think is referring to articles rather than templates with duplicated ICD9 values (?). --LT910001 (talk) 03:14, 29 March 2015 (UTC)
@LT910001: I also do not understand the reply.
Can you more explicitly point out the codes in the template you presented as an example? Looking at that page I do not see what was changed. Blue Rasberry (talk) 18:43, 31 March 2015 (UTC)
Thanks for the ping, Bluerasberry (talkcontribslogs), I often forget to check wikidata. That is my point. The codes are still there ("J 460..."). They make the template name harder to read and don't add navigational value. We can move all those codes to wikidata and then remove them from the template headings. We recently did this with anatomy templates and I feel this improved their readability about 100% as now they are a lot less intimidating to casual readers. --LT910001 (talk) 22:10, 2 April 2015 (UTC)
@LT910001: I am still not sure what is being proposed. The reader-facing head of that template is currently "Pathology of respiratory system (J, 460–519), respiratory diseases". If you are proposing to change it to "Pathology of respiratory system, respiratory diseases" and to move the links to "J, 460–519" somewhere other than the title line then that could be an improvement, as obviously those codes are not intended for most readers, even though they do back structure of the template and ought not be removed entirely. Can you show an example anatomy template that was like this before, but in which someone has moved the links to these coding systems from the title to elsewhere and incorporated some connection with Wikidata? Blue Rasberry (talk) 13:36, 6 April 2015 (UTC)
That is exactly what I am proposing ("move the links to "J, 460–519" somewhere other than the title "). Readers will benefit by having a clearer title.
You ask for an example page. As stated at the beginning of this thread, en:Template:Respiratory pathology is an example of a template where this has happened. You can use the 'history' ability to see the difference. You can click 'wikidata' to see the related data. This has has in fact already happened on every anatomy template on wikipedia. --LT910001 (talk) 06:07, 12 April 2015 (UTC)

Infobox disease - update required...[edit]

There was a number of changes from DSM-IV to DSM-5 in names of diseases, in their definitions, new diseases emerged... at the same time Infobox disease remained unchanged. I think the Infobox requires modifications i.e. adding DSM-5 codes... --Pwlps (talk) 06:36, 3 May 2015 (UTC)

@Pwlps: Hi! Including the newest classifications would be a goal of this Wikiproject. Currently we have quite a high workload curating the existing data and trying to catch up to all the data that is spread the different languages of Wikipedia. If you would like to work on DSM-5, I could help you getting started. --Tobias1984 (talk) 10:04, 3 May 2015 (UTC)
@Tobias1984: Thank you for your offer but I have zero knowledge on modifying templates and I'm far more effective as an editor of polish WikiMedicine Project articles. However I think due to DSM-5 changes updateing Infobox disease is rather necessary... --Pwlps (talk) 06:26, 4 May 2015 (UTC)
@Pwlps: I am ignorant of how things work here also but I often look at infoboxes and want to know how I could add information here that would update content in Wikipedias of different languages. The best thing that I can say is that even though I know almost nothing, I would look at any proposal here with others, but I also am learning slowly. Blue Rasberry (talk) 19:35, 4 May 2015 (UTC)
@Pwlps, Bluerasberry: I requested the inclusion here: Wikidata:Property_proposal/Natural_science#DMS_V_.28DSM_5.29 --Tobias1984 (talk) 08:04, 6 May 2015 (UTC)

No MeSH term property?[edit]

Correspondence between MeSH data and Wikidata peoperties (from [1]).

We have already two MeSH (Medical Subject Headings) related properties.

But it seems that we don't have "MeSH term" property yet (described as "MeSH Heading" in the figure at right). In English Wikipeida, "MeSH term" is generally reffered as "MeshName". If we search at PubMed, this "MeSH term" data is needed (instruction video, search result example). So I suppose we need proposing and create property for MeSH terms. What do you think? Thanks. --Was a bee (talk) 04:56, 7 June 2015 (UTC)

@Was a bee: That search-video seems really practical. A few of my questions: Do we want to rebuild a system that already works well and is public? Can we get to comparable search results using the existing propoerties? Are we even allowed to copy the MeSH terms (It seems to go beyond storing identifiers). --Tobias1984 (talk) 14:01, 7 June 2015 (UTC)
@Tobias1984: Thank you for questions. I think my explanation was not good :p MeSH terms are already widely used. So there are no actual changes for this. For example, article en:Dengue fever has the line"MeshName = Dengue" in infobox. This parameter "Dengue" is MeSH term. MeSH term is short nouns like "Dengue", not a definition text about Dengue. Difference is, as same as other similar data (e.g. MeSH Code (P672), MeSH ID (P486)), simply saved in various wikis in distributed manner (old style) or saved at one place (at Wikidata). As far as I searched, PubMed (and most of MeSH related website) is not afford another style of input. --Was a bee (talk) 15:47, 7 June 2015 (UTC)
I worry about copyright infringement. How much of the MeSH system can we copyright without encroaching on the parts which are not allowed to be copied? If we can import MeSH terms then I think that would be useful, because it gives a simple human readable explanation of what the other identifiers are. If it is allowed to copy these terms then I support the creation of whatever is necessary to include the information here. Blue Rasberry (talk) 13:34, 10 June 2015 (UTC)


This is a list of things we would like to save as meta-data for medical publications. Suggestions for mappings to Wikidata are welcome (just add them in the same row). You can also add more suggestions for meta-data.--Tobias1984 (talk) 15:23, 4 October 2015 (UTC)

* Addresses
* Autobiography
* Bibliography
* Biography
* Books and Documents
* Case Reports
* Classical Article
* Clinical Conference
* Clinical Trial
* Clinical Trial, Phase I
* Clinical Trial, Phase II
* Clinical Trial, Phase III
* Clinical Trial, Phase IV
* Comment
* Comparative Study
* Congresses
* Consensus Development Conference
* Consensus Development Conference, NIH
* Controlled Clinical Trial
* Corrected and Republished Article
* Dataset
* Dictionary
* Directory
* Duplicate Publication
* Editorial
* Electronic Supplementary Materials
* English Abstract
* Evaluation Studies
* Festschrift
* Government Publications
* Guideline
* Historical Article
* Interactive Tutorial
* Interview
* Introductory Journal Article
* Journal Article
* Lectures
* Legal Cases
* Legislation
* Letter
* Meta-Analysis
* Multicenter Study
* News
* Newspaper Article
* Observational Study
* Overall
* Patient Education Handout
* Periodical Index
* Personal Narratives
* Portraits
* Practice Guideline
* Pragmatic Clinical Trial
* Published Erratum
* Randomized Controlled Trial
* Research Support, American Recovery and Reinvestment Act
* Research Support, N.I.H., Extramural
* Research Support, N.I.H., Intramural
* Research Support, Non-U.S. Gov't
* Research Support, U.S. Gov't, Non-P.H.S.
* Research Support, U.S. Gov't, P.H.S.
* Research Support, U.S. Government
* Retracted Publication
* Retraction of Publication
* Review
* Scientific Integrity Review
* Systematic Reviews
* Technical Report
* Twin Study
* Validation Studies
* Video-Audio Media
* Webcasts

Drug prices[edit]

I did some tests on test.wikidata to see how we could store information about drug prices. I think that we should create separate items for drug-packagings. For the example of Isoniazid we would have this structue:

  • Item for the pharmaceutical substance (the molecule, or mixture)
  • Item for 100 mg tablets
  • Item for 100 mg bottles
  • I am not sure if we should do separate items for different manufacturers would be a good idea.
  • Item for 500 mg tablets
  • ...

An example for a packaging is here:

The statements include the defined daily dose and the dosage (amount of active ingredient). The property prize then takes statements qualfified with year, bulk-packaging-size, and the country or region where the prize was quoted. For bottles or injections we would need another qualifier for the volume of the liquid.

We should spend some time thinking about how to model drug prices. I would also still like to include descriptions of tablets, but those might be dependent on the manufacturer and that means creating subitems, for the dosage items listed above. --Tobias1984 (talk) 09:04, 5 October 2015 (UTC)

@Tobias1984: I created en:WP:Prices to collect discussion on this. If we had the kind of data you describe, then I would like to see it in Wikidata. Blue Rasberry (talk) 17:08, 5 October 2015 (UTC)
@Bluerasberry: Thanks for gathering all those dicussions and the very eloquent summary of the problem. User:Doc James is interested in having this data on Wikidata. There would be a lot of pontential for querying, time-series-data and putting the data on maps (e.g. The source would be the reports linked on this page ( I don't think WP:PRIMARY is a problem in this case, because the reports are one step removed from the pharmaceutical companies. Let's see if we can get this through the review: Wikidata:Property_proposal/Generic#price. --Tobias1984 (talk) 17:45, 5 October 2015 (UTC)
I have emailed the ERC to see if they will release it under an open license. Doc James (talk · contribs · email) (if I write on your page reply on mine) 13:13, 6 October 2015 (UTC)

Item needing checking thread[edit]

related items[edit]

I have created a sub-page for cleaning items that seems related to the same subject. Ske (talk) 13:41, 28 October 2015 (UTC) :

/related items

Wikimania 2016[edit]

Only this week left for comments: Wikidata:Wikimania 2016 (Thank you for translating this message). --Tobias1984 (talk) 11:58, 25 November 2015 (UTC)

How significant is significant?[edit]

Regarding significant drug interaction (P769), what is the level of significance required for a drug interaction to be considered significant? That the interaction would be significantly hazardous for the patient's health, or that it would just lead to side effects? I ask because fluvoxamine (Q409236) is known to interact with caffeine (Q60235) by slowing the rate at which it's metabolized. This significance isn't recorded in Wikidata, even though I am pretty sure you could find it in the appropriate medical literature (it's by no means an obscure interaction). Was it excluded because the relatively benign effects of the interaction do not rise to the level of "significant," or is it just missing from Wikidata? Harej (talk) 05:26, 5 December 2015 (UTC)


How do I merge Q21797739 and Q1499629? Doc James (talk · contribs · email) (if I write on your page reply on mine) 01:46, 21 December 2015 (UTC)

@Doc James: Do they both have the same ICD code? Ther merge button was moved to the 'More' tab next to the search field. --Tobias1984 (talk) 18:25, 26 April 2016 (UTC)
Will need to look. Travelling right now. Doc James (talk · contribs · email) (if I write on your page reply on mine) 18:26, 26 April 2016 (UTC)
Joint swelling is just the lay term for joint arthrosis. Doc James (talk · contribs · email) (if I write on your page reply on mine) 02:37, 14 July 2016 (UTC)

Storing ICD9 and 10 codes from EN medical templates[edit]

I've made a proposal for a bot to do this here: [7], with a view to storing them here and removing them ultimately from template titles, which only serves to make them less readable to readers. The current proposal is to store the related ICD codes in Wikidata; once this is done, a separate proposal will be made on Wikipedia itself. Please comment. --LT910001 (talk) 22:15, 10 February 2016 (UTC)

Please comment on my Individual Engagement Grant talk page about my proposal for Guided Checklist for Health Topic Experts[edit]

Hello everyone,

I created a new Individual Engagement grant to try and fix a problem. m:Grants:IdeaLab/Effective Engagement with Health Topic Experts using Guided Checklists

From my work with Cochrane as a Wikipedian in Residence and my observations of other attempts to engage health topic experts in editing, I've come to the conclusion that the quality of the contributions of new health topic expert recruits does not match their level of expertise and effort the we as Wikimedians put into training new contributors. So, I decided to create a new project to develop a Guided Checklist that would assist a health topic expert in assessing the quality of a health articles on Wikipedia, and then guide their contributions toward making edits to correct the lack of quality.

My individual engagement grant would involve interviewing health topic experts and active medical editors, as well as a community consultation on Wikipedia English WikiProject Med. Additionally, because health topics are interrelated on Wikipedias, Commons, and Wikidata, I'm inviting people who are active in WikiProject Medicine on Wikidata to commment and participate. Please add yourself as a volunteer if you would like to participate. Or leave suggestions on the talk page on Meta. Or endorse if you support the idea. Going forward, I'll keep this project updated on the proposal. Sydney Poore/FloNight♥♥♥♥ 23:25, 15 April 2016 (UTC)

Queries and Translations[edit]

I now started a page to collect some showcase and maintenance queries (Wikidata:WikiProject Medicine/Queries). I also looked at some queries that could highlight gaps in our data. This query for example shows 12000 items that are related to medical topics and shows 460 missing translations into English (Query). For French it is around 5800 missing labels (fr-query). And that order of magnitude seems to be the case for most other languages. --Tobias1984 (talk) 20:12, 3 July 2016 (UTC)

antibacterial drug (Q24153252) and antibiotic (Q12187)[edit]


@Tobias1984 and I discussed whether these two items should be merged or not.

From User_talk:Tobias1984:

Hi Tobias,

I think antibacterial drug (Q24153252) should be merged with antibiotic (Q12187), but there are many links to antibacterial drug (Q24153252). Excuse me, but could you please tell me how it can be resolved? Thank you, --Okkn (talk) 15:13, 16 August 2016 (UTC)

It might be a highly controlled vocabulary, but I also fail to see the distinction. @Putmantime: might have a better overview over the topic? Otherwise we should maybe discuss it at Wikiproject Medicine so more people see the discussion. --Tobias1984 (talk) 17:40, 16 August 2016 (UTC)
@Okkn: Forgot to ping. --Tobias1984 (talk) 17:40, 16 August 2016 (UTC)
antibacterial drug (Q24153252) has only one external identifier and its web page [8] is citing en:Antibiotics... This means antibacterial drug (Q24153252) == antibiotic (Q12187), doesn't it?
Anyway, where can we discuss about this topic at Wikiproject Medicine? --Okkn (talk) 19:59, 16 August 2016 (UTC)
@Okkn: You can just copy the thread to this talk page: Wikidata talk:WikiProject Medicine. --Tobias1984 (talk) 20:54, 16 August 2016 (UTC)

The following points shall be taken into account:

Please let us know your opinion. Thanks, --Okkn (talk) 10:09, 17 August 2016 (UTC)

From the top of my head:

  • Antibiotics includes all antibacterials, but also antimycotics and some antiprotozoal agents
  • Not all antibiotics are useful as drugs — though they may still be useful in laboratory settings. This makes them distinct from antiseptics (I don't know if they have an item atm) — which in turn would be used on surfaces etc. It might not be as straight-forward as one first assumes.

Which means: Antibacterials ≠ antibacterial drugs
A solution would be to have:

  • antibacterial agent
    • antibacterial drug
  • antimycotic agent
    • antimycotic drug

etc., with all being subtypes to antibiotics Thoughts? Too complex? CFCF (talk) 12:29, 21 August 2016 (UTC) CFCF (talk) 12:29, 21 August 2016 (UTC)

As an extra note antibiotic is roughly equivalent to antimicrobial under some definitions, but under others antibiotic = antibacterial. It may be useful to get a listing of what definitions exist. CFCF (talk) 12:32, 21 August 2016 (UTC)
I agree with you on the whole. But irrespective of the definition of "antibiotic", what en:antibiotics refers to may be "antibacterial agents", and we don't have to have an item whose label is "antibiotic". So how about this plan?
  1. Rename the label of antibiotic (Q12187) to "antibacterial agent". (Sitelinks remain on it.)
  2. Create a "subclass of" tree as follows (suggested by MeSH):
--Okkn (talk) 21:30, 21 August 2016 (UTC)
In English Wikipedia these terms have also been confused. People discussed this in en:Talk:Antibiotics/Archive_1#This_article_should_be_named_.22Antibiotics.22. I think that @Okkn:'s tree, if taken from MeSH, should be used. The tree has the advantage of being supported by MeSH and matching what most English speakers would understand. At the same time, it has the disadvantage of the term "antibiotic" having an etymological meaning of "against life", which makes people think it should be a term for something kills bacteria, viruses, and a range of things. Despite the etymology, the term "antibiotics" refers to bacteria, and "antimicrobial" is the general term for a drug or agent that kills a range of microbes. Blue Rasberry (talk) 19:15, 22 August 2016 (UTC)
Antibiotic is used to refer to antifungals and antiprotozoal medicines both in clinical practice and in the literature, without being "wrong" — there are simply two different definitions. English speakers seem to tend towards a stricter definition, but even in English it isn't entirely clear which is most common. However if MeSH uses this definition I think that is a good argument for us doing so as well. After all MeSH is very authoritative, but maybe we should look at which definition SNOMED-CT uses?
Does really MeSH place non-drug agents under pharmaceutical drug? Is there any way in which we can categorize some agents as not drugs, while drugs as both agents and drugs?
How does Wikidata support the type of ambiguity surrounding antibiotics & antimicrobials? Is there some way to make that they are sometimes referred to synonymously, where for example some languages make a distinction while others don't? CFCF (talk) 11:10, 25 August 2016 (UTC)
Sorry for misinforming you. There is no MeSH term that is corresponding to pharmaceutical drug (Q12140). Only "Chemicals and Drugs Category" exists in MeSH. You can check the tree around "Anti-Infective Agents" here (lower part of the page). Also please make sure that "antimicrobial agents" is a alias (Entry Term) of "Anti-Infective Agents"(MeSH term), and "antibiotics" is that of "Anti-Bacterial Agents"(MeSH term). --Okkn (talk) 18:06, 25 August 2016 (UTC)

This is further complicated by use of the term biocide, which carries with it a few different definitions as well. CFCF (talk) 05:55, 6 September 2016 (UTC)

The meaning of physiological condition (Q7189713)[edit]

Now health problem (Q2057971) is a subclass of physiological condition (Q7189713) and physiological condition (Q7189713) refers to any state or condition of body, organ or cell on Wikidata, but the meaning of en:Physiological condition is the opposite of "artificial laboratory condition" or "pathological state". (This definition of "physiological condition" is the same as that of Japanese "生理的状態".)

In order to prevent any misunderstanding and confusion, how about renaming "physiological condition" to "physical condition" and creating a new item which corresponds to en:Physiological condition?

--Okkn (talk) 08:41, 22 August 2016 (UTC)

I agree there needs to be some form of renaming, but "physical condition" doesn't cut it seeing as a "psychiatric condition" can be a "pathological condition" without being physical. Unfortunately I don't have any suggestions for a better classification either, "health condition" doesn't seem right to me. CFCF (talk) 11:13, 25 August 2016 (UTC)
The current state is completely wrong. A pathological condition is per definition not physiological. CFCF (talk) 11:14, 25 August 2016 (UTC)
Hmm. I didn't consider a mental condition. We may be able to say "mental or physical condition", instead of "physical condition", but it is a little long... Does "biological condition" make sense in English? Or can we just simply use "condition" as a physical/mental condition?
By the way, is the meaning of the English word "health" (health (Q12147)) containing not only healthy (good) conditions, but also bad conditions? A "disease" can be a "health"? (Bad health?) If that is so, I think health (Q12147) is much the same as "physical/mental condition". --Okkn (talk) 16:55, 25 August 2016 (UTC)
I disagree with the source wikipedia page about physiological not excluding lab conditions I commented on the talk page. I also disagree with the statement that pathological conditions are not physiological; there are conditions that are both physiological and pathological. Having said that, I'm not sure "physiological" is a useful qualifier. "biological condition" can potentially be very generic, including molecular properties as well as properties of populations. To some readers it may be exclusive of some medical conditions where the biological basis is unclear. Why not just go for a very generic conception of condition/state/attribute, and focus on more specific groupings, rather getting stuck on upper ontology issues that IMHO are not very useful Cmungall (talk) 05:25, 13 September 2016 (UTC)
My suggestion has been to make Q7189713 a generic conception of condition/state/attribute of organisms. This is because health problem (Q2057971), which is a superclass of disease (Q12136) and symptom (Q169872), has already been a subclass of Q7189713.
By the way, if there is a condition that is both physiological and pathological, I think we just make it have property subclass of (P279) with both physiological and pathological condition classes. --Okkn (talk) 16:50, 13 September 2016 (UTC)

Hackathon around infectious diseases and climate change, Vienna, Nov 3-5[edit]

A hackathon is being organized for November 3-5 in conjunction with the International Meeting on Emerging Diseases and Surveillance in Vienna. Its focus is on bringing together data about infectious disease outbreaks and climate change. I am in contact with the hackathon organizers, and we are exploring how Wikimedia-focused activities (particularly around Wikidata) might fit in there. If that resonates with you, please let me know. --Daniel Mietchen (talk) 23:08, 25 August 2016 (UTC)

Data model for trials[edit]

We now have Identifier (P3098) but not yet many items that use it, so I think it's a good time to think about which statements/claims/references to add by default or under specific circumstances. I suggest to use Study of GLS-5700 in Dengue Virus Seropositive Adults (Q26762063) as a showcase item, perhaps together with items about trials of a different nature.

Doc James
Daniel Mietchen
Andrew Su
Projekt ANA
Pavel Dušek
Was a bee
Chris Mungall
Dr. Abhijeet Safai
Pictogram voting comment.svg Notified participants of WikiProject Medicine. --Daniel Mietchen (talk) 20:26, 2 September 2016 (UTC)

@Daniel Mietchen: I do not feel strongly about which one might be a model, but I worked on PARAMOUNT trial (Q17148583) as an interesting case. I got a CC license to the informed consent document for that trail and so far as I know, it is the only freely licensed consent document in existence. I cannot speak to how interesting that trial itself might be, but I developed the Wikipedia article for it to the extent that I could. Blue Rasberry (talk) 20:38, 2 September 2016 (UTC)
We probably need a number of showcase items, not just one. In any case, I have started Wikidata:WikiProject Medicine/Data models/Trials to get things going. --Daniel Mietchen (talk) 02:03, 4 September 2016 (UTC)

syndrome (Q179630) and no label (Q18971517)[edit]

What's the difference between syndrome (Q179630) and no label (Q18971517)? Why is no label (Q18971517) a Wikimedia permanent duplicated page (Q21286738)? --Okkn (talk) 06:32, 4 September 2016 (UTC)

No idea either. @Genderforschung, Andrawaag, Sebotic: any thoughts? --Daniel Mietchen (talk) 00:42, 6 October 2016 (UTC)
@Daniel Mietchen, Okkn, Genderforschung: I don't see a reason why not to merge them, especially because the claim on one that it is a 'physiological condition' is not references at all. Furthermore, merging would benefit integration of Disease Ontology into Wikidata and association with Wikipedia articles. Sebotic (talk) 07:39, 6 October 2016 (UTC)

Defined daily dose / Price[edit]

Defined daily dose is a property used by WHO to compare medications. This ref gives DDD.[15] for a bunch of medications. Can we add the DDD by bot from this site to Wikidata with this site as the reference? Maybe we could pull the pricing info aswell? The prices listed generally represent the wholesale price in the developing world. We could add both in the same bot run. Doc James (talk · contribs · email) (if I write on your page reply on mine) 09:40, 6 September 2016 (UTC)

We also have [16]. I am not sure which will be easier to pull from. Doc James (talk · contribs · email) (if I write on your page reply on mine) 09:49, 6 September 2016 (UTC)
Proposed here Doc James (talk · contribs · email) (if I write on your page reply on mine) 13:33, 16 September 2017 (UTC)

How do I mark that in some people Rectus abdominis muscle is innervated by the Sixth intercostal nerve while in another it isn't[edit]

I take information about innervation from Anatomy and Human Movement Structure and Function SIXTH EDITION (Q27050364). It tells me that in sometimes rectus abdominis muscle (Q275150) is innervated by Sixth intercostal nerve (Q27058097) while in other cases it isn't. How do I model this? ChristianKl (talk) 13:46, 2 October 2016 (UTC)

Chart outlining which antibiotics to use for which bacterial infections[edit]

Screenshot of one version of the chart

There is an interesting discussion over at enwiki about how to represent information about which antibiotics to use for which bacterial infections, with a sideline of the discussion touching upon whether and how Wikidata could be used for such purposes. Worth a look! --Daniel Mietchen (talk) 13:25, 4 October 2016 (UTC)

Data model for vaccines[edit]

I started a discussion on that over at Talk:Q134808, i.e. the talk page of vaccine (Q134808). --Daniel Mietchen (talk) 00:36, 6 October 2016 (UTC)


With data quality being discussed here Doc James (talk · contribs · email) (if I write on your page reply on mine) 20:32, 11 December 2016 (UTC)

By the way who is going to fix all the issues created by this bot run?[17]
User:ProteinBoxBot/User:Andrew Su maybe move that parameter form "drug used for treatment" to "drug studied in" because that is what the ref supports.
20:56, 12 December 2016 (UTC)
I've asked at the bot's talk page. Jytdog (talk) 21:45, 12 December 2016 (UTC)

Keep an eye on the top 1000 medical articles[edit]

Would someone be so kind as to watchlist the 1,000 top viewed medical articles on EN Wikipedia here? This sort of stuff is popping up[18]. Doc James (talk · contribs · email) (if I write on your page reply on mine) 02:57, 23 January 2017 (UTC)

Property suggestion for systematic review register[edit]

I would like to inform you about property suggestion I made, Wikidata:Property proposal/PROSPERO, that is relevant for medicine. It is an identifier in an online database of systematic reviews. — Finn Årup Nielsen (fnielsen) (talk) 20:27, 12 February 2017 (UTC)

Origin and insertion[edit]

Hello! I'm working in eu:Txantiloi:Anatomia_infotaula and I wonder which are the correct "origin" and "insertion" properties for muscles and bones so we can get this data automagically. It seems that we can only use connects with (P2789) for articulations, but we can't insert the data of where biceps brachii (Q201363) is inserted. Could you help me with this? -Theklan (talk) 12:05, 7 March 2017 (UTC)

We have muscle insertion (P3491) and muscle origin (P3490). ChristianKl (talk) 12:48, 7 March 2017 (UTC)
Thanks ChristianKl (talkcontribslogs)! Maybe they must be added in Wikidata:WikiProject_Medicine/Properties and in the medical template! And worse... I had them in the template! Shame! Now... I wonder if they can be used also in bones. I mean, triceps surae (Q431282) muscle insertion (P3491) > calcaneus (Q13075). So can we add just muscle insertion (P3491) > triceps surae (Q431282) in calcaneus (Q13075)? -Theklan (talk) 20:02, 7 March 2017 (UTC)
I added them to the list. I think there should be a way to display the inverse of "muscle insertion" in a template without adding the data in both direction. Unfortunately I'm not skilled at writing the templates and knowing the formatting code. ChristianKl (talk) 05:37, 8 March 2017 (UTC)

World Health Organization list of essential medicines[edit]

The Wikipedia community has had conversations about collaborating with the World Health Organization since at least 2012, when the WHO sent some people to Wikimania. It is an important relationship and also a model for collaboration between Wikipedia and other health organizations. Recently the World Health Organization applied a Wikipedia-compatible copyright license to their World Health Organization Model List of Essential Medicines (Q37155). This is a list of about 400 medicines and Wikipedia's coverage of each of these drugs is important, as is the precedent of getting the copyright to a list and managing it effectively.

Does anyone have ideas for what sort of property would be appropriate for indicating that a particular drug is on the "WHO essential" list? I was thinking of something like "member of list". Something that is sort of close is award received (P166), but this is not an award, and more of a designation in a particular version of a list. Different countries have their own essential medicine lists so it would be nice to have a property which be used in various situations. Thoughts from anyone? Blue Rasberry (talk) 18:20, 13 June 2017 (UTC)

catalog (P972) seems to me to be the most fitting. Afterwards there's collection (P195). ChristianKl (talk) 18:24, 13 June 2017 (UTC) S
@ChristianKl: I used catalog at metformin (Q19484). How does that look to you? Thanks for the suggestion. Blue Rasberry (talk) 14:27, 14 June 2017 (UTC)
I think this example looks well. I'm not sure about cases where different countries have different essential medicine lists. For that case it might be better to have items like "WHO essential medicine list for Ethopia" that lists the drugs it contains with has part (P527). ChristianKl (talk) 14:41, 14 June 2017 (UTC)

Restriction Enzymes[edit]

I proposed new properties to describe the restriction enzymes, about three months ago. Would you please comment on here? Thanks, --Okkn (talk) 17:47, 15 June 2017 (UTC)

Soliciting suggestions of new data sources[edit]

Dear all, we on the Gene Wiki / ProteinBoxBot team are doing some planning and prioritization of future biomedical data sets to load, and we'd like to solicit suggestions from the broader Wikidata community. Historically, the scope of our bot loading effort has revolved around genes, proteins, drugs, diseases, and microbes. And more recently we've also helped related groups load data on genetic variants and pathways. We would welcome suggestions of either other related entity types that should be systematically loaded, or data sources that describe relationships between these entity types. Obviously, availability of a high-quality, CC0-licensed data source is essential. Please let us know if you have any suggestions. (Cross posting to WD:MB, WD:MED, and Wikidata:WikiProject_Chemistry.) Best, Andrew Su (talk) 20:03, 23 June 2017 (UTC)

The nutrient content of various types of foods that's stored in the USDA Food Composition Databases provides interesting data. ChristianKl (talk) 22:57, 23 June 2017 (UTC)
@ChristianKl: USDA Food Composition sounds promising. (Incendentally, reminds me of But it would be most compelling if those records were linked with other data sources (and for our grant proposal, ideally if those data sources drifted in the biomedical direction). Are you aware of any such sources? Best, Andrew Su (talk) 17:12, 26 June 2017 (UTC)
There are two links that come to mind. On the one hand, there are RDI and RDA values. Historically those values changed and EU and US authorities publish different values.
From a global health perspective, it could be very useful to have data about vitamin content of different foods, RDI and RDA in Wikidata as that allows people in third world countries to access that information.
On the other hand, you have taxons. Wikidata already organizes taxons in a phylogenetic tree. This information could be useful for determining where in the history of when plant mutations happened, that drastically changed the Vitamin concentration of the plant. There might also be data that links different genes to different plants. ChristianKl (talk) 18:24, 26 June 2017 (UTC)
TA98 data would be interesting to import . Their website says the data is free but they are not explicit about the license. It might be worth to ask them. ChristianKl (talk) 19:01, 24 June 2017 (UTC)

Infection versus disease[edit]

In ontological terms, is a human papillomavirus infection (Q184627) an infection or an infectious disease or both? The identifiers used seem to treat the item as a disease, yet it marked as an instance of an infection. Furthermore, a separate papillomavirus infectious disease (Q18966672) item was created, and is referred to by a number of other items. Lowren160 (talk) 18:01, 6 August 2017 (UTC)

Incidence and prevalence[edit]

We have properties for this already:



We have some incidences for 2005 and 2015 here

And we have some prevalences for 2005 and 2015 here and

Doc James (talk · contribs · email) (if I write on your page reply on mine) 07:30, 31 August 2017 (UTC)

New Properties for Anatomical Structures[edit]

Hello everyone,

I have proposed new proparties to describe the relationship between anatomical structures:

At the moment, for example, we can't create links between pericardium (Q193302) and heart (Q1072) or joint (Q9644) and bone (Q265868). Would you please support us to create these essential properties? I would appreciate your cooperation. --Okkn (talk) 10:15, 16 September 2017 (UTC)


I know LOINC is used in at least a couple medical infoboxes (diagnostic and medical intervention) on English Wikipedia, so I was surprised when I couldn't find a property for it. Is there a reason it's not included, or has it just not been proposed yet? —ShelleyAdams (talk) 01:17, 24 September 2017 (UTC)

Created this proposal yesterday. —ShelleyAdams (talk) 13:48, 27 September 2017 (UTC)

Change Detector & Diseases[edit]

We have been working on a change detector which monitors items and generates a report of all changes over a certain time period. The change detector is very generic and can be run on any set of items over any time period. As a prototype, it has been running on a schedule now generating a monthly report of all changes to all disease items. It is running under a Jenkins system located here. You can see the latest report under the "Last Successful Artifacts" link (or for October). This is a work in progress. You can post suggestions here or on our issue tracker.

There are three tabs. The first sheet ("changes") lists everything. The "changes_filtered" lists the same information minus edits made by ProteinBoxBot and KrBot, and the "labels" sheet lists label, description, or alias changes. Each row represents one change on one statement on one item. The change can either be "ADD" or "REMOVE" (the change_type column). The qid/qid_label columns are the item the change was made to (the subject), the pid/pid_label is the property/predicate, the value/value_label is the value/object. If the statement had a reference, its in the ref_str column. Just a note, I didn't parse changes in reference statements, it is just pulling the reference if the value of the statement changes.

There has been lots of activity on diseases in the past month! There were 253 edits (not including ProteinBoxBot or KrBot) from 42 unique users, the most active being Netha Hussain, Jmarchn, Andreasmperu, Diptanshu Das, JhealdBatch. There were statements edited from 38 different properties. However, almost none of the statements contain any references. It is very important to add references so the data can be checked, updated, queried and reused effectively. Please add references. Thanks! Gstupp (talk) 21:09, 3 October 2017 (UTC)

Drug categories[edit]

Hello all, I modeled drug/chemical compound categories for 3 drugs (chlorhexidine (Q15646788), givinostat (Q426257), ansamitomicin p-3 (Q27110111)) (subclass of (P279), or 'is a' OBO edge type). All of these categories are either attached to classes in the ChEBI and/or NCI thesaurus ontologies. I post this here to get your ideas/feedback on it. Thanks, Sebotic (talk) 20:26, 9 October 2017 (UTC)