Wikidata:Requests for permissions/Bot/PintochBot 2
From Wikidata
Jump to navigation
Jump to search
The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Withdrawn − Pintoch (talk) 10:12, 19 July 2018 (UTC)[reply]
PintochBot 2[edit]
PintochBot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: Pintoch (talk • contribs • logs)
Task/s: remove square brackets at the boundaries of labels of scientific articles
Function details: Cleans up square brackets inserted by a bogus import of scientific articles. From a query by User:Daniel Mietchen
Example edits: Special:Contributions/PintochBot
− Pintoch (talk) 21:31, 18 May 2018 (UTC)[reply]
- Feedback
- Could you insert a statement about the language of the article as well? Currently these brackets are the indication that the articles aren't written in English.
--- Jura 11:03, 22 May 2018 (UTC)[reply]- Not sure where this information would be available? @Daniel Mietchen:, any idea? − Pintoch (talk) 11:16, 22 May 2018 (UTC)[reply]
- We discussed it at Wikidata:Requests_for_permissions/Bot/maria_research_bot.
--- Jura 11:22, 22 May 2018 (UTC)[reply]- I have not checked the discussion shared by Jura, but checking the PubMed identifiers of some scientific articles I found below the title "[Article in...]", let me give you examples: Diagnosis of para-influenza infection and the state of humoral immunity to para-influenza virus in children (Q40323731), "[Article in Russian]"; Retinal detachment and argon laser prophylactic treatment (author's transl) (Q33211252), "[Article in French]". Could not that information be automatically extracted from the PubMed identifier? I mean in this case at least, I couldn't check many other cases, but most of them are from PubMed. Regards, Ivanhercaz (Talk) 16:39, 22 May 2018 (UTC)[reply]
- I checked the above discussion, which does not contain a pointer to how to get this information from PubMed. I also checked the documentation, where they mention a 'LANG' filter, which may indeed do the trick. Otherwise, we may have to scrape that from the HTML, where they provide a
<div class="lang"> [Article in XYZ]</div>
tag. --Daniel Mietchen (talk) 07:29, 23 May 2018 (UTC)[reply] - Well, as mentioned in the discussion, we know that it's not English. In any case, this can be cast into a structured statement.
--- Jura 07:39, 23 May 2018 (UTC)[reply]