Wikidata:Requests for permissions/Bot/MsynBot 9
The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Approved--Ymblanter (talk) 20:11, 9 November 2022 (UTC)[reply]
MsynBot 9[edit]
MsynBot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: MisterSynergy (talk • contribs • logs)
Task/s: manage exisiting GND ID (P227) identifier claims
Code: at Github; will be tweaked where necessary, then run weekly in the msynbot
tool on Toolforge
Function details: Wikidata currently has 1.35 million GND ID (P227) claims. Since there are some oddities about this identifier, some of these claims need a special treatment. There is a relatively active community involved in maintaining P227 values in good shape, but some tasks can easily be automated in order to save time. I propose to do in particular with my bot:
- Tn-type entries in GND: set to deprecated rank, add corresponding qualifier (these are technically not part of GND anymore)
- Redirects in GND: set to deprecated rank, add corresponding qualifier, add redirect target as separate claim with normal rank if still missing
- There are a couple of other maintenance jobs involving P227 claims, such as ranking adjustment, and detection of invalid/inexistent entries. While those are not really P227-specific, I aim to fix them regularly since I have all the necessary data available.
- Besides WDQS, the bot would rely on input from User:Wurgl (as the maintainer of the persondata tool in German Wikipedia), and public SPARQL query endpoint at ZBW, managed by User:Jneubert.
- The code is not new, I have occasionally run it already under my regular account.
- Other users potentially interested here are @Kolja21, Epìdosis, Emu (and maybe others…)
—MisterSynergy (talk) 09:40, 2 September 2022 (UTC)[reply]
- Strong support and many thanks! --Epìdosis 09:59, 2 September 2022 (UTC)[reply]
- Support It would help if the bot deletes the Tn (de:Hilfe:GND#Personen) if an item has already a valid GND. Otherwise the duplicate check (Wikidata:Database reports/Constraint violations/P227) takes more time. --Kolja21 (talk) 14:12, 2 September 2022 (UTC)[reply]
- +1 on Kolja, of course. --Epìdosis 16:01, 5 September 2022 (UTC)[reply]
- Support Dankeschön! --Emu (talk) 14:57, 2 September 2022 (UTC)[reply]
- Support ~ MdaNoman (talk) 19:39, 5 September 2022 (UTC)[reply]
- Please show some test edits.--Ymblanter (talk) 16:19, 6 September 2022 (UTC)[reply]
- @MisterSynergy:. Thanks. Mike Peel (talk) 18:47, 24 September 2022 (UTC)[reply]
@Ymblanter, Epìdosis, Kolja21, Emu, MdaNoman, Mike Peel: finally an update. The (experimental) ZBW SPARQL endpoint for GND turned out to be too unreliable, so I had to change significant parts of the code. Now this is finally done and I have made some test edits: [1]. It shows:
- 5 cases/10 edits for Tn-identifiers to be deprecated and equipped with a reason for deprecated rank (P2241) qualifier
- 5 cases/14 edits for redirecting identifiers to be deprecated, equipped with a reason for deprecated rank (P2241) qualifier, and their redirect target added to the data item if missing
- 1 case/2 edits for a non-redirecting identifier that was incorrectly marked as a redirect
- 4 cases/4 edits for oddly-ranked identifiers (lower "preferred rank" to "normal rank" when no other "normal rank" identifiers are found in this item)
The most recent version of the source code is currently only on Toolforge, but it will eventually find its way to Github; the formerly linked code at PAWS is outdated. —MisterSynergy (talk) 08:12, 9 November 2022 (UTC)[reply]
- Re Supporting.- MdaNoman (talk) 08:26, 9 November 2022 (UTC)[reply]
- again Support, of course! --Epìdosis 08:36, 9 November 2022 (UTC)[reply]
- The source code is now available at Github, and no longer in my PAWS account. —MisterSynergy (talk) 09:35, 9 November 2022 (UTC)[reply]
- again Support, thanks for your great work. --Kolja21 (talk) 15:51, 9 November 2022 (UTC)[reply]