Wikidata:Requests for permissions/Bot/MuBot
From Wikidata
Jump to navigation
Jump to search
- The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Not done, lack of progress. Please resubmit in case you are working on it again. Vogone (talk) 15:45, 25 March 2015 (UTC)[reply]
MuBot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: Mushroom (talk • contribs • logs)
Task/s: Add IMDb ID (P345) to items having occupation (P106): actor (Q33999), or film actor (Q10800557), or voice actor (Q2405480).
Function details: The bot will:
- run on items from this list (currently 34654)
- search through linked Wikipedia articles for an {{imdb}} template
- extract the ID from the template and add it to the item
- add instance of (P31): human (Q5) when missing
- ignore items that already have an IMDb ID (P345)
- ignore items with date of birth (P569) < 1800 or date of death (P570) < 1880
- ignore items that are marked as twins (Q14756018), duo (Q10648343), or double act (Q1141470)
The bot made a dry run on the first 100 items with no errors. Mushroom (talk) 18:39, 24 April 2014 (UTC)[reply]
- Support but:
- please check as many Wikipedia articles as possible, since most IMDb-type templates are not very reliable: my bot made lots of mistakes some time ago;
- make sure that the item is actually about a human (Q5), even if the item does not have instance of (P31) yet: this can be easily checked on Wikipedia or on external databases;
- strive to add the most reliable and free sources you can find;
- make as few edits as possible on Wikidata (also adding several statements at once), either by calling the wbeditentity API directly, or by using an advanced framework.
- --Ricordisamoa 14:29, 25 April 2014 (UTC)[reply]
- @Ricordisamoa: Thanks for the support and suggestions. I have modified the bot to always look at multiple Wikipedias and added a few checks to make sure the item is about a human (Q5). The number of edits shouldn't be a problem since I'm using item.editEntity() from pywikibot core. What about the sources though? Right now the bot just adds imported from Wikimedia project (P143): [language] Wikipedia. Is there a reliable external database I can query and use as source? Mushroom (talk) 14:12, 26 April 2014 (UTC)[reply]
- The Echo extension requires the ping being inserted in the same edit as the signature to work :-) pywikibot/core's editEntity allows setting arbitrary data (even sources) but you would have to build the JSON data manually, so I don't make it a requirement for now; data in the RDF/XML (Q48940) format by Integrated Authority File (Q36578) under Creative Commons CC0 License (Q6938433) are probably a good source for instance of (P31)human (Q5). --Ricordisamoa 01:58, 28 April 2014 (UTC)[reply]
- @Ricordisamoa: Oh I see, that's why it wasn't working :D Ok I will use the GND for instance of (P31)human (Q5). What I was wondering, though, is: should I find a proper source for statement IMDb ID (P345)ID or is imported from Wikimedia project (P143) enough? Because unfortunately I can't find any authority control source linking to IMDb (except for Freebase, but I don't think it's reliable). Mushroom (talk) 10:40, 4 May 2014 (UTC)[reply]
- I found many articles in fr:Catégorie:Actrice suisse that had fr:Modèle:Imdb nom, but no IMDb ID (P345) on Wikidata. Rather than checking as many languages as possible, focusing on categories and templates of a given language might improve results. --- Jura 06:55, 17 May 2014 (UTC)[reply]
- Thanks Jura, unfortunately I have been very busy so I had to stop working on the bot. I think I will follow your suggestion and limit its scope for now, so it will be easier to manage. Mushroom (talk) 17:06, 18 May 2014 (UTC)[reply]
- @Bene*, Vogone, Legoktm, Ymblanter: Any 'crat to comment?--GZWDer (talk) 10:52, 11 August 2014 (UTC)[reply]
- What is the situation with the bot right now?--Ymblanter (talk) 07:08, 14 August 2014 (UTC)[reply]
- Unfortunately I have not had time to update and operate the bot because I am too busy in real life, so please consider this request suspended for now. Hopefully I will have more time in a few months. Mushroom (talk) 09:39, 16 August 2014 (UTC)[reply]
- What is the situation with the bot right now?--Ymblanter (talk) 07:08, 14 August 2014 (UTC)[reply]