Wikidata:Requests for permissions/Bot/MidleadingBot 3
- The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Not done @Midleading: This request seems to be abandoned, please reopen it if that is not the case. Thanks. Mike Peel (talk) 20:40, 21 July 2020 (UTC)[reply]
MidleadingBot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: Midleading (talk • contribs • logs)
Task/s:This bot will import courtesy name (P1782)、temple name (P1785)、posthumous name (P1786) and art name (P1787) from Wikipedia articles. These properties are basic properties of historical Chinese people yet their usage is extremely low. Almost everbody should have courtesy name (P1782) set.
Code:The code is not public because it must be reprogrammed to adapt to varied composition of Wikipedia articles before each run.
Function details:It may import the data from anywhere in a Wikipedia article using regular expression. The preprocessing is manual and usually begins with a Wikipedia articles export, followed by manual checking, pattern matching and testing, and the final data will be imported and is sourced to the Wikipedia article it imported from.
It may also import from Wikisource text imported from Wikimedia project (P143)Chinese Wikisource (Q19822573). --Midleading (talk) 14:50, 9 December 2019 (UTC)[reply]
- You may found CBDB as another possible source to import.--GZWDer (talk) 13:18, 10 December 2019 (UTC)[reply]
- I found people already imported the information from CBDB as alias, but they just create new items instead of checking for existing items linked to Wikipedia. I imported about 2000 courtesy name (P1782) values, and they contributed to nearly 2000 duplicated items found by matching imports from Wikipedia and CBDB. I merged about 1000 of them without sex or gender (P21) using this bot and those with sex or gender (P21) will be merged later, due to Wikidata Query Service lag followed by incident of QuickStatements. Also I don't trust CBDB so much as they create duplicated records with so little information to link them to a Wikidata item, and the effort to link them is better to be used to import from trusted text in Wikisource.--Midleading (talk) 14:59, 10 December 2019 (UTC)[reply]