Wikidata:Bot requests/Archive/2021/11

From Wikidata
Jump to navigation Jump to search


request to fix "artwok" (2021-11-05)

Request date: 5 November 2021, by: Jura1

Task description
Discussion
Request process

Accepted by (Ammarpad (talk) 12:47, 10 November 2021 (UTC)) and under process
Task completed (09:57, 14 November 2021 (UTC))

I think that this discussion is resolved and can be archived. If you disagree, don't hesitate to replace this template with your comment. Ammarpad (talk) 09:57, 14 November 2021 (UTC)

request to import data for "Cheung Chau Piu Sik Parade" (2021-05-06)

Request date: 6 May 2021, by: Hkbulibdmss

Link to discussions justifying the request
Task description

https://www.wikidata.org/wiki/Wikidata:Dataset_Imports/Cheung_Chau_Piu_Sik_Parade

Please help to import the dataset. The URL of a spreadsheet is : https://docs.google.com/spreadsheets/d/1iUVrHNsXVmn94IygtZYj0-foeUg9yvdOcwQ_V-CQbto/edit?usp=sharing

Licence of data to import (if relevant)
Discussion

@Hkbulibdmss: I checked the Google spreadsheet and it seems to be a database of media files (photos, digital documents, videos). Thus, it may not be eligible for Wikidata. If the files are released under a Creative Commons license, you may be able to import them into Wikimedia Commons instead.Vojtěch Dostál (talk) 06:32, 26 November 2021 (UTC)

@Vojtěch Dostál Thank you for your suggestions. Best. Hkbulibdmss (talk) 07:16, 26 November 2021 (UTC)
Request process
I think that this discussion is resolved and can be archived. If you disagree, don't hesitate to replace this template with your comment. Vojtěch Dostál (talk) 07:47, 26 November 2021 (UTC)

request to remove inverses with P1830 for paintings (2021-11-23)

Request date: 23 November 2021, by: Jura1

Link to discussions justifying the request
Task description

The statements with owner of (P1830) duplicate the information already added with owned by (P127) and can be removed. --- Jura 11:36, 23 November 2021 (UTC)


Discussion


Request process

request to replace qualifiers in GND ID (2021-06-07)

Request date: 7 June 2021, by: Kolja21

Link to discussions justifying the request
Task description

Please replace in GND ID object named as (P1932) with subject named as (P1810)

  1. GND ID (P227) delete qualifier object named as (P1932)
  2. import name of object from GND with qualifier subject named as (P1810)
  3. add retrieved (P813)

Scope: 5.161 qualifiers object named as (P1932), see Wikidata:Database reports/Constraint violations/P227#Properties statistics.

 Comment (in German): Man könnte hinzufügen, dass man über die OpenRefine Reconciliation oder über https://d-nb.info/gnd/100045642/about/lds.ttl (gndo:preferredNameForThePerson) recht einfach und schnell die aktuelle Version abfragen kann. (User:Emu)

Example
Discussion
Request process

Accepted by (Ammarpad (talk) 14:01, 10 June 2021 (UTC)) and under process

@Ammarpad Are you still working on this? Vojtěch Dostál (talk) 20:32, 25 November 2021 (UTC)

request to cleanup DOI only items (2021-07-04)

Request date: 4 July 2021, by: Jura1

Task description

Items like Q57554778 consist mainly of DOI: the DOI is repeated as title and label.


@Daniel Mietchen: who created some or all of them. @Trilotat: who mentioned some on Wikidata:Request_a_query#Items_with_DOI_(P356)_that_start_with_10.1023/A:_without_a_Label_or_a_title_(P1476). --- Jura 13:24, 4 July 2021 (UTC)

@Jura1: To be precise, I was looking for items without a label, but I had seen this and did some research. A web search for any of the "DOI as title" DOIs will find that they are all or almost all noted in ResearchGate publication ID (P5875) items associated with Entomologia Experimentalis et Applicata (Q15753202) journal. These items are published in (P1433) CrossRef Listing of Deleted DOIs (Q53952674).
  • Q57554778 is 10.1023/A:1003902321787 and that DOI is mentioned in ResearchGate publication ID (P5875) 226608108. That researchgate item mentions the title and article details as Q107413498.
  • I added the deleted DOI to that matched item as deprecated (as withdrawn identifier value).
  • They should be merged, but I didn't as I thought it might confuse this bot request.
In the future, I think we can add the new DOI to the bad items and then rerun SourceMD as I did with Q57030816, right? Trilotat (talk) 14:54, 4 July 2021 (UTC)
List of items: User:Jura1/DOI as label. It was done using regexp 10\..+/ for title (P1476) values. — Ivan A. Krestinin (talk) 20:15, 21 July 2021 (UTC)
@Jura1 It seems that all the items listed in Ivan's query have defunct DOIs... Am I right? What would be the correct course of action there? Vojtěch Dostál (talk) 20:35, 25 November 2021 (UTC)
Request process

request to find references for novalue statements in "spouse" (P26) (2021-10-31)

Request date: 31 October 2021, by: Jura1

Task description
This is less so for novalue statements. Sample: Q12325#P26 for James Buchanan (Q12325).
Maybe there is a way to reference them with one or the other source.
The proposed task is to find a suitable source and added references to such statements. --- Jura 12:57, 31 October 2021 (UTC)
Discussion
  • Apparently Wikitree has a flag "no more marriages" for this (according to User:Lesko987a), but it's generally not filled. I think that for readers this is visible by the absence of spouse unknown (even if the person has no spouse). --- Jura 12:03, 27 November 2021 (UTC)
Request process

(1) Query to find them:

SELECT DISTINCT
  ?item ?itemLabel ?itemDescription
WHERE
{
  ?st a wdno:P26 .
  ?item p:P26 ?st .
  OPTIONAL { ?st prov:wasDerivedFrom ?source . 
            FILTER NOT EXISTS { ?source pr:P143 [] } 
           } 
  FILTER(!bound(?source)) 
  ?item wdt:P31 wd:Q5 ; wdt:P570 ?d . 
  FILTER ( YEAR(?d) > 1600 ) 
  FILTER NOT EXISTS { ?item wdt:P106 wd:Q1469535 }
  FILTER NOT EXISTS { ?item wdt:P26 []  }  
  SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
}
LIMIT 100

Try it!

(2) Currently 549 items on 12:57, 31 October 2021 (UTC)

request to mirror Wikipedia page moves: enwiki and/or others (2021-11-14)

Request date: 14 November 2021, by: Jura1

Link to discussions justifying the request
Task description

Follow page moves from a wiki (e.g. enwiki) and update sitelinks on Wikidata. See Project chat discussion above. Apparently a bot already does that for dewiki.

Discussion
Request process

request to remove ±1 (in width and height) from paintings (2021-11-15)

Request date: 16 November 2021, by: Jura1

Link to discussions justifying the request
Task description

There are some 10000 items that still have legacy ±1 in values. I don't think any of these correspond to actual information. This request is to remove it from height (P2048) and width (P2049)} statements for items with instance of (P31) = painting (Q3305213) --- Jura 09:35, 16 November 2021 (UTC)


Discussion

When those items were created, the pywikibot-framework forced this to have a value. I also recently figured out that this requirement was dropped (and that is good!). I will pick this one up in the coming days/week, unless someone else already fixed it... Edoderoo (talk) 12:13, 17 November 2021 (UTC)

Just noting that this occurs on many museum objects that are not paintings as well. - PKM (talk) 21:13, 18 November 2021 (UTC)
It's still present in many fields .. don't hesitate to formulate cleanup requests .. --- Jura 21:17, 18 November 2021 (UTC)
We can not clean up just all of them without additional manual research, often the values are rather useless, but in many cases they were put there on purpose, and a bot can't see the difference between the two. Edoderoo (talk) 21:56, 19 November 2021 (UTC)
@Edoderoo: is your comment about paintings or about other fields? If it's for paintings (this request), could you provide some samples? --- Jura 11:33, 23 November 2021 (UTC)
For paintings I do not see an issue, as the measure of a frame will be pretty precisely measured, even when it is in mm. For all other items we first need to check ... maybe you can fix those too, but not all of them. Edoderoo (talk) 14:37, 23 November 2021 (UTC)
It's probably worth doing them one type at a time. My current focus is paintings. --- Jura 11:55, 27 November 2021 (UTC)
Request process

request to import the rest of Nomenclature for Museum Cataloging (P7749) (2021-11-18)

Request date: 18 November 2021, by: Vladimir Alexiev

Link to discussions justifying the request
Task description

See discussion. In brief:

  • Import the remaining 9.2k entries from this thesaurus
  • While linking intelligently into the WD class hierarchy and adding qualifiers "of" or "use"
  • Need programming & algorithmic knowledge, and ideally a bit of NLP
Licence of data to import (if relevant)

Open Data, see https://www.nomenclature.info/droitauteur-copyright.app?lang=en

Discussion

 Support as one of the editors who has done work manually matching this catalog. I have several comments:

  • Nomenclature is a bilingual database. The labels should be imported in both English and French. Some items have Canadian French and Canadian English labels as well; these should be added as aliases if practical. This is one reason why importing directly from MnM is not my preferred solution.
  • Nomeclature includes "non-preferred terms" (see example). These can be imported as aliases if practical.
  • The Getty Art & Architecture link (in "other references to this object" on the user interface) can be used to prevent duplicate entries. If the AAT ID does not exist in Wikidata, it should be added to the new item. If the AAT ID does exist in Wikidata, the Nomenclature ID can be added to the existing item.
  • Nomenclature has some blank subclasses (example) which need to be skipped.
  • Nomenclature has a hierarchy of sports equipment by sport which is not like anything in Wikidata, where it seems the standard is to use sport (P641) to qualify a type of equipment - see the list of direct subclasses of sports equipment. We need to decide if we want to import all of these classes or not.

- PKM (talk) 21:08, 18 November 2021 (UTC)

Thanks for the support and the excellent suggestion! We'll be publishing NOM as RDF entities in SKOS/SKOSXL soon, until then it's available as big RDF dumps, and I can make any tabular export desired (eg with the 4 languages dispatched to separate columns). Pinging @Crowjane7: who's one of the main editors --Vladimir Alexiev (talk) 06:37, 23 November 2021 (UTC)
@Vladimir Alexiev: I hadn't realized you were involved with Nomenclature on the tech side! :-) PKM (talk) 23:01, 27 November 2021 (UTC)
Request process

Adminbot deleting non-notable Semantic Scholar authors

Request date: 26 November 2021, by: Epìdosis

Link to discussions justifying the request


Task description

Given the following query

SELECT DISTINCT ?item ?itemLabel
WHERE { 
  ?item wdt:P4012 ?sesc .
  ?item wikibase:identifiers 1 .
  MINUS { ?other ?id ?item } .
  SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
}
ORDER BY ?itemLabel
Try it!

delete all these items (5874 as of now).

All these items are based uniquely on Semantic Scholar author ID (P4012) and have no incoming links; SemanticScholar itself is not sufficient for implying notability; moreover, there are strange cases of SemanticScholar IDs redirected to different names (e.g. Q64864826 "Aleksey Buzykaev" has https://www.semanticscholar.org/author/104224927 which now redirects to https://www.semanticscholar.org/author/W.-Buttinger/4134655 "W. Buttinger"), which could lead to future conflations here on Wikidata, and other IDs are inexistent (e.g. Q64855725 links to inexistent https://www.semanticscholar.org/author/95221647). --Epìdosis 11:32, 26 November 2021 (UTC)

Discussion
  • I tried to figure out who created them and found some created by QuickStatementsBot [1] without any indication of the user who requested it. Similarly by Reinheitsgebot (without any indication of a MxM catalogue). Further by IPs who didn't add additional statements .. so some cleanup would probably help. --- Jura 15:50, 27 November 2021 (UTC)
  • https://w.wiki/4Td4 shows when they were created. Seems to be mostly June and Sept 2019. --- Jura 16:09, 27 November 2021 (UTC)
Request process

request to automate marking preferred_rank for full dates. (2021-05-28)

Request date: 28 May 2021, by: Richard Arthur Norton (1958- )

Task description

We have year only dates and full dates for date_of_birth and date_of_death. See for instance Eliot Blackwelder (Q16785350). We need to mark the full date as "preferred rank" and add in the reason_for_preferred_rank=most complete record (Q105749746). The problem is when we have two dates of equal rank, both display in infoboxes. --RAN (talk) 04:45, 28 May 2021 (UTC)

Discussion
Request process

@Richard Arthur Norton (1958- ): What about references though? What if the less complete date has a reference and the other does not? Should we still do this? I might be able to find time to do this. BrokenSegue (talk) 05:21, 28 May 2021 (UTC)

I guess in the case where the two dates disagree we should not perform the update. BrokenSegue (talk) 05:22, 28 May 2021 (UTC)
That would be great, I haven't seen the bot in action yet, I am still plugging away by hand as I come across them. --RAN (talk) 20:20, 28 May 2021 (UTC)
No, my bot does not manipulate ranks. --Matěj Suchánek (talk) 11:52, 29 May 2021 (UTC)

@Matěj Suchánek: Are you interested in picking this task up? It does kinda overlap with the task Jura mentioned. Actually, hmm, there is some subtlety here that I can see being tricky (multiple dates with different qualifiers sometimes shouldn't be merged e.g. for start time (P580)s with a applies to part (P518)). If not I may still do it. BrokenSegue (talk) 12:40, 30 May 2021 (UTC)

Sorry, I am not right now. I guess it's easy now that we have Ranker (Q105394978), which can be driven by SPARQL. (Or maybe not that easy if the qualifier is also required, but QS can do this part.) I made up a query which can be used as basis.
What if the less complete date has a reference and the other does not? Preferred statements should always be sourced. If there is no evidence for the more precise date, it should be either removed or sourced (and then up-rank'd). --Matěj Suchánek (talk) 13:12, 30 May 2021 (UTC)
Thanks for the query; you're a SPARQL wizard. I write my bot actions self-contained in python so I don't need ranker. BrokenSegue (talk) 14:07, 30 May 2021 (UTC)
Excellent! I know there are several bots trying to fill in references for dates, but they are mostly pulling data from sources that give year-only dates. At one time I calculated that about 20% of year-only dates are off by a year because they are back calculated from the age at death in an obituary. --RAN (talk) 00:37, 1 June 2021 (UTC)
Do you know who is operating these bots? Wikibase in theory supports adding uncertainty in dates but in practice I believe the correct way to add a date with that kind of uncertainty is to use e.g. earliest date (P1319). BrokenSegue (talk) 01:31, 1 June 2021 (UTC)


  • @Vojtěch Dostál: it seems that preferred rank is also added when dates aren't in the same year. I don't think this should be done.
(I tried to find the sample that came up on Wikidata:Database reports/identical birth and death dates/1, but couldn't find it) --- Jura 11:54, 27 November 2021 (UTC)
@Jura1 Did you mean to tag @Matěj Suchánek? Vojtěch Dostál (talk) 14:57, 27 November 2021 (UTC)
No, I think it was an edit of yours, but I might be mistaken. If the request is being done by Matěj, I suppose we can close this anyways. --- Jura 15:00, 27 November 2021 (UTC)
I'm not involved in this request by working on it. --Matěj Suchánek (talk) 09:01, 28 November 2021 (UTC)

Help Bota .. (2021-07-27)

Request date: 27 July 2021, by: Takhirgeran Umar

Link to discussions justifying the request
Task description
Licence of data to import (if relevant)
Discussion
 Comment There are around 120,000 changes. --Matěj Suchánek (talk) 16:20, 6 August 2021 (UTC)
to clarify you want all items with that description replaced with that other description? Is there discussion around this? I can do it easily but no idea if this is an "Accepted" change. BrokenSegue (talk) 19:37, 15 August 2021 (UTC)
@Takhirgeran Umar Can you please answer the question? Vojtěch Dostál (talk) 15:45, 25 November 2021 (UTC)
The fact is that we have long changed the space of the name "Куцкеп" on "Кеп" (as in the dictionary) Takhirgeran Umar (talk) 17:00, 25 November 2021 (UTC)
@Takhirgeran Umar So you'd like all descriptions which have this precise string in Chechen: "куцкеп Википеди" to be replaced with this precise string: "Викимедин проектан кеп" OK? Is that so? @BrokenSegue can probably do that very easily but we need to be sure what we're doing because Google Translate isn't very useful there so we have to take your word for it. Vojtěch Dostál (talk) 20:41, 25 November 2021 (UTC)
@Vojtěch Dostál I opened a discussion. After a short time, write here. Takhirgeran Umar (talk) 20:55, 25 November 2021 (UTC)
@Vojtěch Dostál We decided that spelling will correctly "Викимеди проектан кеп" (On the forum). Takhirgeran Umar (talk) 10:15, 28 November 2021 (UTC)
Request process