User talk:ArthurPSmith

From Wikidata
Jump to: navigation, search
Logo of Wikidata

Welcome to Wikidata, ArthurPSmith!

Wikidata is a free knowledge base that you can edit! It can be read and edited by humans and machines alike and you can go to any item page now and add to this ever-growing database!

Need some help getting started? Here are some pages you can familiarize yourself with:

  • Introduction – An introduction to the project.
  • Wikidata tours – Interactive tutorials to show you how Wikidata works.
  • Community portal – The portal for community members.
  • User options – including the 'Babel' extension, to set your language preferences.
  • Contents – The main help page for editing and using the site.
  • Project chat – Discussions about the project.
  • Tools – A collection of user-developed tools to allow for easier completion of some tasks.

Please remember to sign your messages on talk pages by typing four tildes (~~~~); this will automatically insert your username and the date.

If you have any questions, please ask me on my talk page. If you want to try out editing, you can use the sandbox to try. Once again, welcome, and I hope you quickly feel comfortable here, and become an active editor for Wikidata.

Best regards! Liuxinyu970226 (talk) 09:07, 25 August 2015 (UTC)

Pywikibot for half-life claims[edit]

Hi! I am just writing a tutorial script on how to add quantities and units to Wikidata. It would probably be easy to adapt the script to add the NNDC values. Will post it a little later today to Wikidata:Pywikibot - Python 3 Tutorial. --Tobias1984 (talk) 14:52, 14 October 2015 (UTC)

I posted the example, but there might still be a problem with the pywikibot-api (Wikidata:Contact_the_development_team#Pywikibot:_Float_numbers). Will add more text and explanations to the example tomorrow. --Tobias1984 (talk) 21:23, 14 October 2015 (UTC)
Thanks for that example - I had been working on the other end (automated extraction of the data from the nndc page) so this will tie it together nicely. Does this pywikibot version require Python 3 or would it work in 2.7 also? ArthurPSmith (talk) 13:24, 15 October 2015 (UTC)
I ran the script against pywikibot core (See: Wikidata:Pywikibot - Python 3 Tutorial/Setting up Shop). According to this page it should work with Python 2.7, but I think you need to import from __future__ import print_function, unicode_literals in the first line of the script so you don't have to change the print-functions and don't need to put u"something" in front of every string. --Tobias1984 (talk) 14:13, 15 October 2015 (UTC)
But I think we should wait for the (Wikidata:Contact_the_development_team#Pywikibot:_Float_numbers) issue to be resolved. It causes some really human-unfriendly numbers and diffs. --Tobias1984 (talk) 16:58, 15 October 2015 (UTC)

Noble gas[edit]

You reverted my edit on Q1307. You are right, but the czech translation is wrong. How could I edit czech translation of noble gas in this section?

Hi @Dvorapa: - looks like you figured out what to do, it's just a matter of editing the language labels on the target item, group 18 (Q19609). ArthurPSmith (talk) 01:54, 27 November 2015 (UTC)
Yeah, I figured it out finally. --Dvorapa (talk) 21:43, 5 December 2015 (UTC)

Property creator[edit]

You've just been granted the right of Property Creator. Good luck with it. And I hope you will use it wisely. New properties can be created at Special:NewProperty. Most important step is to choose the right datatype as it can't be altered afterwards. If you have any questions feel free to as another property creator or an admin with experience on creating properties. Mbch331 (talk) 21:27, 26 November 2015 (UTC)

Thanks! I really wasn't sure if I had enough experience for this, so I'm grateful for the trust. I will definitely be very cautious to start with while I'm still learning how things work here! ArthurPSmith (talk) 02:05, 27 November 2015 (UTC)

GRID mappings[edit]


Missed the completion of adding GRID ids to wikidata and you'd asked me a question which was then cleared out I think, sorry about that. Great to see it added!

Yeah, mapping things by name is a real pain, it's part of why we built a tool for mapping unstructured text to ids. There's some good examples of issues here:

All the added data to the db is found or checked manually, so we might be able to add some but it'll depend on time/speed/cost. A list of potential things to add (e.g. things marked as universities in wikidata) would be a great start.

Although not all the metadata is filled in (wikipedia URLs, ISNI, etc) we've focused on the institutes which have the most scientific output as much as we can, so hopefully the top 1/4 with more automatically mappable IDs should represent the most 'important' ones to link up.

I'm always up for more of a chat about this, if you drop me a line at and we'll see how we can help out.

Ok, I'll probably be in touch! I have started adding the grid id's for institutions in your data with wikipedia references via the Quick Statements tool. Note there's a few wikipedia entries in your data that don't look right: some have an additional '#' linking to a portion of the wikipedia page (not helpful for identifying!) - for example 'Nobel Foundation'. And at least one ('American Medical Association') has an extra space at the end of the wikipedia string link. This is from the 12-14-2015 dataset. ArthurPSmith (talk) 17:08, 6 January 2016 (UTC)
Great! I'll have a look at the wikipedia page urls with anchors in. Thanks for the note about trailing spaces, it's an issue that pops up in some of the fields occasionally, I'll look at getting that fixed for the next release. If you have any lists of problems or possible issues then we can turn that into review tasks to check the data is right. For things without a wikipedia url I think there might be some more we can help with on our side rather than doing some fuzzy matching on names. IanCalvertDsci (talk) 10:59, 8 January 2016 (UTC)
By the way I completed the import from the latest dump based on wikipedia URL's - almost 10,500 entries. I'm looking at the ISNI values now. Those are also a little inconsistent - most are in standard ISNI format (4 groups of 4 digits separated by spaces) but a few of them are missing the spaces, some are missing leading zeros (I assume) and some have some extra space characters. So I need to normalize a little before doing a search. ArthurPSmith (talk) 16:46, 8 January 2016 (UTC)
Ha! I did a bunch of work on ISNI, then thought to run the query to see how many organizations actually have ISNI entries in wikidata: see http:// - only 245! So it will actually not help much with matching. Oh well. It could help with a bit of cross-checking between the two cases though. Also I'm starting to look at constraint issues - there are a bunch of duplicates in GRID! Will email you. ArthurPSmith (talk) 21:53, 8 January 2016 (UTC)
oh - actually there were a few more, see this query - - including subclasses just for university there are 440. Still it seems to be a small fraction of the actual assigned ISNI's for organizations. The vast majority of the ones in your list don't seem to match (by ISNI) anything currently in wikidata. ArthurPSmith (talk) 22:02, 8 January 2016 (UTC)

is a list of (P360) for categories[edit]

Thank you for belatedly returning to that RfC.

However, having thought about it since, given that there was (some) opposition to using is a list of (P360) on categories, perhaps a neater solution might be instead to introduce a parallel property to be specifically for categories, eg "category contains", but using the same syntax.

That way people could query specifically either for categories or lists with this kind of specification, without having to then filter their results as to whether the corresponding item was a category or a list.

What do you think? Jheald (talk) 18:58, 15 April 2016 (UTC)

Hi Jheald - A separate property would be ok with me, but we'd have to fix a lot of existing uses. I don't understand your example use case though - if you were doing a query to find all items that had the property is a list of (P360) xxx item with yyyy qualifier, wouldn't you want to see both categories and lists? If it was really important to limit to one or the other type you can always do that with an additional instance of (P31) criterion. As far as the label on the property being confusing in the case of categories, maybe we should add a "wikidata usage instructions" statement to explain what it means in the category case? ArthurPSmith (talk) 19:39, 15 April 2016 (UTC)


exact match (P2888) is ready. --Tobias1984 (talk) 19:30, 5 June 2016 (UTC)

Pywikibot badge[edit]

I moved the badges to the general user namespace. So it is {{User Pywikibot advanced}}. --Tobias1984 (talk) 16:38, 11 July 2016 (UTC)

Thanks! ArthurPSmith (talk) 18:08, 11 July 2016 (UTC)


Hi Arthur, you might be interested in the issue raised here. Thanks. --Gikü (talk) 17:38, 16 August 2016 (UTC)

Thanks, I left a note on the property proposer's talk page. ArthurPSmith (talk) 18:36, 16 August 2016 (UTC)


Hello ArthurPSmith, thank you for creating this property. I think it should only be used as a qualifier for academic degree (P512). Could you please add that to the property properties? Jonathan Groß (talk) 16:04, 1 November 2016 (UTC)

  • @Jonathan Groß: - thanks, I hadn't remembered that it was planned to be used only as a qualifier. I adjusted the instance of (P31) statement; if there are other changes you think needed you should be able to make them yourself now that the property exists. ArthurPSmith (talk) 18:05, 1 November 2016 (UTC)

wikidata-externalid-url tool[edit]

Hi! It seems like there is an issue regarding the wikidata-externalid-url tool and the property IMDb ID (P345). See example. Regards —Cocu— t 20:13, 15 November 2016 (UTC)

  • Thanks @Cocu: investigating right now. I know there was some maintenance going on on the tools server but it looks like it's been down for over a day now. ArthurPSmith (talk) 20:27, 15 November 2016 (UTC)
  • Huh, I thought they would restart tools automatically - I guess not. It's back up now. I'll have to pay more attention to those downtime messages I guess! ArthurPSmith (talk) 20:29, 15 November 2016 (UTC)

Enciclopedia Treccani[edit]

Hello ArthurPSmith. I stumbled upon Enciclopedia Treccani ID (P3365) today. How is it different from Dizionario Biografico degli Italiani (P1986)? Jonathan Groß (talk) 10:38, 24 November 2016 (UTC)

As I understand it from the property proposal discussion the dictionary is a part of the encyclopedia. ArthurPSmith (talk) 12:10, 24 November 2016 (UTC)

That seems right. I'm still wondering how the two properties should be used, respectively. Jonathan Groß (talk) 09:09, 26 November 2016 (UTC)

Share your experience and feedback as a Wikimedian in this global survey[edit]

Periodic table[edit]

Are you the maintainer of It looks like someone has broken it. :( - Nikki (talk) 15:26, 14 January 2017 (UTC)

@Nikki: - actually Ricordisamoa is the maintainer, but I've helped and know a little about it. But it looks ok to me right now, what did you see that was broken? ArthurPSmith (talk) 23:29, 14 January 2017 (UTC)
Oh - somebody messed around with magnesium (Q660). Can you figure out the problem from the history? ArthurPSmith (talk) 23:32, 14 January 2017 (UTC)
@Nikki: Please see Wikidata talk:WikiProject Chemistry#Elements and periods --Ricordisamoa 23:33, 14 January 2017 (UTC)
@Nikki, Ricordisamoa: well for now I reinstated the subclass of (P279) relationship and the ptable app looks fine again. I guess if there is a firm decision to switch to "part of" we should adjust the app... ArthurPSmith (talk) 13:26, 17 January 2017 (UTC)
The query could be changed to support both subclass of (P279) and part of (P361), but I'd prefer the ontology, of which the Chemistry WikiProject shall be the authoritative source, to stay consistent :) --Ricordisamoa 01:46, 18 January 2017 (UTC)

Defining formula[edit]

Thank you for your valuable feedback on my edits regarding the defining formula. The reason why I am adding those formulae to Wikidata is that it is a part of my master's thesis in which I add mathematical formulae to Wikidata which are extracted from English Wikipedia and then based on these formulae I build a Question and answer system. This system will allow users to ask mathematical questions in English and Hindi, and based on those questions, the system will retrieve the related mathematical formula from Wikidata by parsing the natural language input and allow users to get a calculated result for their desired values. You guessed it correctly that I extracted the first formula from all the mathematics related articles on Wikipedia which is obviously not the most accurate way of getting correct results. However, the results are accurate enough to be uploaded to Wikidata. Since I was working on my thesis in the Wikimedia office in Berlin for one month, I showed the results to them and the community was satisfied with the results and gave me the permission to upload them into Wikidata. Furthermore, this sub-task of the whole project is very important because I need these formulae to be present in Wikidata so that they can be used by the question and answer system. I will try my best to eliminate most of the false results like muon lifetime and hope that my contribution to Wikidata can be more useful for further advancements. I will once again have a look at the formulae list to do the necessary amendments and then I will continue with the uploading.

@Yash nagar: you might want to filter your items based on their instance of (P31) and subclass of (P279) relationships. For example, nothing that is an instance or subclass of physical object (Q223557) should have a "defining formula". ArthurPSmith (talk) 15:10, 17 January 2017 (UTC)
@ArthurPSmith: Thank you very much for the suggestion and a good example to start with. I will definitely do the changes and I apologize for the errors I made earlier.

Importing identifiers from GRID[edit]


Congratulations for your work of aligning GRID with Wikidata, it looks very useful! I have noticed that GRID contains a lot of other identifiers, such as ISNI ids. It seems to me that many of these ids have not been imported in Wikidata. Is this a deliberate choice or due to a lack of time / resources? Do we need to wait for GRID to be entirely matched before that? It seems fairly easy to do with QuickStatements but I wanted to check with you before that. I was thinking about importing the following (only for items that currently have a GRID id):

What do you think? − Pintoch (talk) 22:51, 30 January 2017 (UTC)

@Pintoch: Go ahead! I was planning on getting to that especially now that GRID is itself provided under CC-0, but it would be great to have help in getting this data imported! If you can, please add references to the GRID release you are using for the information. ArthurPSmith (talk) 15:42, 31 January 2017 (UTC)
Okay, my QuickStatements are ready (with references), I will perform them after a few checks. − Pintoch (talk) 20:02, 1 February 2017 (UTC)

Thanks for adding all these statements to so many institutions, it really helped! The constraint violation report for Ringgold identifiers just got a lot smaller. − Pintoch (talk) 16:37, 3 March 2017 (UTC)

Ah, it was my attempt to whittle down the constraint violations list for GRID itself - glad to help! ArthurPSmith (talk) 16:42, 3 March 2017 (UTC)

Please tell your bot to stop adding GRIDs of subsidiaries to parent companies (like Microsoft (Q2283), BASF (Q9401)), as we already agreed on and revert those incorrect adds. Thanks. --Jklamo (talk) 22:31, 18 May 2017 (UTC)

It's not running now, I was just running it this afternoon based on some analysis run over the last few weeks. However, I don't think these additions were incorrect - they were based on the URL's and wikipedia links provided by GRID - GRID itself is pointing to these wikidata QID's in one way or another. I think GRID has gone a bit overboard with these separate id's for each country a company has some presence in, and it would be better to merge the GRID id's. If that's not happening then these duplicates are still a good source for cases where we do want to split them up into separate items (if that's what we want). ArthurPSmith (talk) 23:22, 18 May 2017 (UTC)
@Jklamo: I am happy to do the splitting. I have a script for that. I doubt we can convince GRID to merge these identifiers, as this is consistent with their policy. I think it is still useful to have items for each national subsidiary, because there are quite a few claims that are specific to the local subsidiary (headquarters location, website sometimes, identifier in a national company database, leadership, inception date…) − Pintoch (talk) 17:11, 19 May 2017 (UTC)
Thanks Pintoch  ! ArthurPSmith (talk) 17:40, 19 May 2017 (UTC)
By the way, still no sign of their May release - maybe we have to wait until June to get the reported duplicates sorted out? ArthurPSmith (talk) 17:42, 19 May 2017 (UTC)
GRID is not always correct with the URL to wikis, just do not relay on them. Approach having different identifier for different entities is absolutely correct, problem is just Wikidata sometime do not have items for subsidiaries. Adding GRID indentifiers of subsidiaries to parent company item is simply wrong, they are not matching and only one correct identifier is obscured by these.--Jklamo (talk) 18:33, 20 May 2017 (UTC)
@Jklamo: I agree with you that we do not want these ids to end up on the same item. But the good thing is that we have constraint violation reports for that, and the fix can be automated. You can see what the process looks like on GlaxoSmithKline (Q212322) (history). APSbot added a bunch of GRID ID (P2427) in December 2016, and I moved them to separate items last month. Ideally that could be done directly by APSbot but creating items with bots can be a bit dangerous sometimes. Putting the ids in Wikidata is already useful: otherwise, we do not even know they exist. − Pintoch (talk) 11:47, 21 May 2017 (UTC)

Your feedback matters: Final reminder to take the global Wikimedia survey[edit]

(Sorry to write in Engilsh)

Formatting URLs[edit]

Hello there. I think I need your help to have UltraSignup runner ID (P3585) work, possibly through this. Can you have a look? Thierry Caro (talk) 14:13, 6 March 2017 (UTC)

Happy to help - what should a working URL look like, do you have an example? ArthurPSmith (talk) 16:40, 6 March 2017 (UTC)
@Thierry Caro: - just pinging you in case you didn't see my query, do you have more details on what's needed? ArthurPSmith (talk) 15:04, 7 March 2017 (UTC)
Thank you. A working URL for Ruby Muir (Q20801602) is Unfortunately, with formatter URL (P1630) set to$1 and Muir's ID set to fname=Ruby&lname=Muir, as they should be, the URL is automatically transformed into, which does not lead anywhere. Thierry Caro (talk) 15:10, 7 March 2017 (UTC)
Ah, ok, URL encoding issues. Ok, I'll look into this today! ArthurPSmith (talk) 15:18, 7 March 2017 (UTC)
@Thierry Caro: it's working now! ArthurPSmith (talk) 18:33, 7 March 2017 (UTC)
Awesome. Unfortunately it still does not work in fr:Modèle:Bases sport, but whatever! At least it works here! I may have another request of the same type then. Europeana Fashion creator ID (P3482) works fine with one-word IDs like Carven for carven (Q2940704) but it fails to provide a correct link when you have to use something like Academy of Art University for Academy of Art University (Q2822225). Spaces are automatically converted into something like, but should not. Thierry Caro (talk) 18:56, 7 March 2017 (UTC)
The problem was the '#' character, which never gets passed to the externalid-url service. You have to encode that too as %23 - I just updated the formatter URL for Europeana Fashion creator ID (P3482) and it should work now. ArthurPSmith (talk) 19:53, 7 March 2017 (UTC)
Perfect. Thank you very much for your diligence. Thierry Caro (talk) 20:19, 7 March 2017 (UTC)

WikiProject Universities[edit]

Logo for the Wikidata Wikiproject Universities


There are a few of us who seem to be interested in improving the coverage of academic institutions in Wikidata, so I have started the WikiProject Universities to coordinate our efforts. I'd like to make a sort of dashboard of the progress on various tasks, which would help us find out what is a good use of our time. (For instance by avoiding to add manually some stuff that can be imported from some database.) Feel free to join!

Cheers − Pintoch (talk) 08:49, 7 March 2017 (UTC)

Thanks, looks great, I have a couple of ideas... ArthurPSmith (talk) 15:04, 7 March 2017 (UTC)

Wikidata ExternalId URL[edit]

Please respond to phab:T160205 otherwise I'll go ahead and request the removal of this tool. Also pinging @Jura1, Pigsonthewing, Lydia Pintscher (WMDE): —Dispenser (talk) 18:49, 21 March 2017 (UTC)

@Dispenser: I've responded, however I'm not sure what you are asking for. Privacy issues for this were discussed previously on project chat. Tool labs hides all incoming IP addresses via a proxy. ArthurPSmith (talk) 18:57, 21 March 2017 (UTC)
The privacy policy does not allow WMF domain to third-party domain redirects. You can add an interstitial (like Special:GoToInterwiki/google:) or host on a non-WMF server. Dispenser (talk) 19:47, 21 March 2017 (UTC)
Step through it for me, I'm not following. What is the "third-party domain redirect" here? ArthurPSmith (talk) 19:51, 21 March 2017 (UTC)
Actually I just commented on the Phab ticket, I think I see what you are saying. Please continue discussion there. ArthurPSmith (talk) 20:00, 21 March 2017 (UTC)


What do you think of the proposal ? I think it's worthwhile to create it but it seems stalled and given the votes that are there, I think it would be good if at least another person supports it before creation. ChristianKl (talk) 21:51, 11 May 2017 (UTC)

@ChristianKl: I've looked at it but I really have no knowledge of medicine or anatomy or the quality of the reference in question, nor have I really looked at how this has been done in related cases, so I don't feel qualified to comment, sorry! ArthurPSmith (talk) 14:29, 12 May 2017 (UTC)