Wikidata:Requests for permissions/Bot/AliciaFagervingWMSE-bot 15
The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Approved--Ymblanter (talk) 13:08, 25 November 2017 (UTC)[reply]
AliciaFagervingWMSE-bot 15[edit]
AliciaFagervingWMSE-bot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: Alicia Fagerving (WMSE) (talk • contribs • logs)
Task/s: The function of the bot is to import data about immovable cultural heritage to Wikidata as part of Wikimedia Sverige's Connected Open Heritage Project.
This request is for data about cultural heritage monuments in Serbia from the Wiki Loves Monuments Database.
Code: The bot uses Python and the Pywikibot framework. The code is up on Github: Framework, Specific table processing script
Function details:
The bot processes data from the Wiki Loves Monuments Database, in this case rs, about 2,300 items. They will all make use of cultural heritage monument in Serbia ID (P4245) as an identifier.
Below you can see:
- The mappings: Wikidata:WikiProject WLM/Mapping tables/rs_(sr)
- An example of how the data is translated into Wikidata properties: Wikidata:WikiProject WLM/Mapping tables/rs_(sr)/preview
- Matched items which will be updated: Wikidata:WikiProject_WLM/Mapping_tables/rs_(sr)/matches
Some test edits have been made: Q16085401, Q3280273, Q20434834, Q42841509, Q42841499.
Ping @André Costa (WMSE): -- Alicia Fagerving (WMSE) (talk) 07:38, 10 November 2017 (UTC)[reply]
- Language code for labels should probably be "sr-ec" instead of just "sr" [1]. Please add more specific P31 than "Q2065736"
--- Jura 07:56, 10 November 2017 (UTC)[reply]- @Jura1: There is no information in the tables which would allow for the determination of a more specific instance of (P31). Note that cultural property (Q2065736) is only used in items that don't already have a instance of (P31) value. /André Costa (WMSE) (talk) 12:09, 10 November 2017 (UTC)[reply]
- An automated translation of the label gave me "archeological site". Q839954 would be sufficient.
--- Jura 12:30, 10 November 2017 (UTC)[reply]- @André Costa (WMSE):@Alicia Fagerving (WMSE): Any reaction to this?--Ymblanter (talk) 11:50, 19 November 2017 (UTC)[reply]
- @Jura1:. Sorry missed your reply. Relying on free text matching of the labels for P31 is something we tried before with quite bad results (even with the support of someone speaking the language). The problem was too many false positives (e.g. labels may include names of people or places which, like "Newcastle", look like a type description) which were hard to detect and fix afterwards. When there had been a "type" column in the tables we have done such making with much more successful results. /André (logged out for holidays) 19:05, 19 November 2017 (UTC)[reply]
- An automated translation of the label gave me "archeological site". Q839954 would be sufficient.
- @Jura1: There is no information in the tables which would allow for the determination of a more specific instance of (P31). Note that cultural property (Q2065736) is only used in items that don't already have a instance of (P31) value. /André Costa (WMSE) (talk) 12:09, 10 November 2017 (UTC)[reply]