Property talk:P7859

From Wikidata
Jump to navigation Jump to search

Documentation

WorldCat Identities ID (superseded)
former ID for a bibliographic page of an entity on WorldCat. Use P243 (OCLC control number) for books, and P10832 (WorldCat Entities ID) for authority control for people, places, and works
Applicable "stated in" valueWorldCat Identities (Q76630151)
Data typeExternal identifier
Template parameterTemplate:Authority control/WORLDCAT-LCCN (Q14467416), Template:WorldCat id (Q11247300)
Domain
Allowed valuesviaf-\d+|lccn-n[a-z]?[0-9\-]+|n[cps]-.+
ExampleWilliam Shakespeare (Q692)lccn-n78095332
Guillaume Caoursin (Q164470)lccn-n88603570
William Verbeck (Q76460081)np-verbeck,%20william$1861
np-verbeck,%20william
Aqua (Q622895)viaf-136244923
Buddy (Q1060353)lccn-no2012121338
Aphrodite (Q35500)lccn-no2014047558
Emmanuel (Q3891104)lccn-n86129619
Sourcehttps://www.worldcat.org/identities
External linksUse in sister projects: [ar][de][en][es][fr][he][it][ja][ko][nl][pl][pt][ru][sv][vi][zh][commons][species][wd][en.wikt][fr.wikt].
Formatter URLhttps://worldcat.org/identities/$1/
Tracking: usageCategory:Pages using Wikidata property P7859 (Q116843614)
See alsoVIAF ID (P214), Library of Congress authority ID (P244), FAST ID (P2163), OCLC control number (P243), GND ID (P227), WorldCat Registry ID (P5505), WorldCat Entities ID (P10832)
Lists
Proposal discussionProposal discussion
Current uses
Total1,873,119
Main statement1,872,424 out of 20,194,854 (9% complete)>99.9% of uses
Qualifier8<0.1% of uses
Reference687<0.1% of uses
Search for values
[create Create a translatable help page (preferably in English) for this property to be included here]
Format “viaf-\d+|lccn-n[a-z]?[0-9\-]+|n[cps]-.+: value must be formatted using this pattern (PCRE syntax). (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P7859#Format, SPARQL
Distinct values: this property likely contains a value that is different from all other items. (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303). Known exceptions: British Ceylon (Q918153), Dominion of Ceylon (Q2670092)
List of violations of this constraint: Database reports/Constraint violations/P7859#Unique value, SPARQL (every item), SPARQL (by value)
Scope is as main value (Q54828448), as reference (Q54828450): the property must be used by specified way only (Help)
List of violations of this constraint: Database reports/Constraint violations/P7859#Scope, hourly updated report, SPARQL
Allowed entity types are Wikibase item (Q29934200): the property may only be used on a certain entity type (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P7859#Entity types
Item “WorldCat Entities ID (P10832): Items with this property should also have “WorldCat Entities ID (P10832)”. (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303). Known exceptions: Empire Interactive (Q2714179), Ocean Software (Q1408498), Tree Care Industry Association (Q7837535)
List of violations of this constraint: Database reports/Constraint violations/P7859#Item P10832, search, SPARQL
Single value: this property generally contains a single value. (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P7859#Single value, SPARQL
Value doesn't match P214
value starting with "viaf-" doesn't match VIAF ID (P214)-value. (Help)
Violations query: SELECT ?item ?value { ?item wdt:P7859 ?value . OPTIONAL { ?item wdt:P214 ?viaf . BIND( concat("viaf-", ?viaf ) as ?viafvalue) } FILTER NOT EXISTS { ?item wdt:P7859 ?viafvalue } FILTER NOT EXISTS { ?item p:P7859 / ps:P7859 ?viafvalue } FILTER ( regex ( ?value, "^viaf-.*" ) ) } LIMIT 500
List of this constraint violations: Database reports/Complex constraint violations/P7859#Value doesn't match P214
Value doesn't match P244
value starting with "lccn-" doesn't match Library of Congress authority ID (P244)-value. Some false positives: need to replace -%d with %06d. (Help)
Violations query: SELECT ?item ?value ?lccn { ?item wdt:P7859 ?value . OPTIONAL { ?item wdt:P244 ?lccn . BIND( concat("lccn-", ?lccn ) as ?lccnvalue ) } FILTER NOT EXISTS { ?item wdt:P7859 ?lccnvalue } FILTER NOT EXISTS { ?item p:P7859 / ps:P7859 ?lccnvalue } FILTER ( regex ( ?value, "^lccn-.*" ) ) } LIMIT 500
List of this constraint violations: Database reports/Complex constraint violations/P7859#Value doesn't match P244

Conflations[edit]


type constraint[edit]

I added a new type constraint for the class event (Q1656682) because also conferences like German Librarians Day Conference 1973 (Q62033327) have a WorldCat Identities ID (superseded) (P7859). --Mfchris84 (talk) 05:45, 25 March 2020 (UTC)[reply]

novalue[edit]

Is it valid to use special novalue setting in this property? There are known cases when item has VIAF or LCCN (or both) but any WorldCatId created using those values does not work. Paweł Ziemian (talk) 21:00, 27 March 2020 (UTC)[reply]

negative type constraint[edit]

Vladimir Alexiev (talk) 11:59, 13 March 2017 (UTC) Jonathan Groß (talk) 17:52, 26 March 2017 (UTC) Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits Jneubert (talk) 13:47, 29 April 2017 (UTC) Sic19 (talk) 20:42, 12 July 2017 (UTC) Wikidelo (talk) 21:15, 8 May 2018 (UTC) ArthurPSmith (talk) 19:52, 22 August 2018 (UTC) PKM (talk) 19:40, 23 August 2018 (UTC) Ettorerizza (talk) 06:44, 8 October 2018 (UTC) Fuzheado (talk) 03:47, 19 December 2018 (UTC) Daniel Mietchen (talk) 16:30, 7 April 2019 (UTC) Iwan.Aucamp (talk) 21:48, 3 October 2019 (UTC) Epìdosis (talk) 23:49, 22 November 2019 (UTC) Sotho Tal Ker (talk) 00:52, 1 May 2020 (UTC) Bargioni (talk) 09:48, 02 May 2020 (UTC) Carlobia (talk) 14:34, 11 May 2020 (UTC) Pablo Busatto (talk) 03:22, 23 June 2020 (UTC) Matlin (talk) 10:53, 6 July 2020 (UTC) Msuicat (talk) 21:57, 27 August 2020 (UTC) Uomovariabile (talk) 10:04, 27 October 2020 (UTC) Silva Selva (talk) 17:21, 30 November 2020 (UTC) 1-Byte (talk) 15:52, 14 December 2020 (UTC) Alessandra.Moi (talk) 17:26, 16 February 2021 (UTC) CamelCaseNick (talk) 21:20, 20 February 2021 (UTC) Songceci (talk) 18:45, 24 February 2021 (UTC)]] moz (talk) 10:48, 8 March 2021 (UTC) AhavaCohen (talk) 14:41, 11 March 2021 (UTC) Kolja21 (talk) 17:37, 13 March 2021 (UTC) RShigapov (talk) 14:34, 19 September 2021 (UTC) Jason.nlw (talk) 15:15, 30 September 2021 (UTC) MasterRus21thCentury (talk) 20:22, 18 October 2021 (UTC) Newt713 (talk) 08:42, 13 March 2022 (UTC) Pierre Tribhou (talk) 08:00, 20 March 2022 (UTC) Powerek38 (talk) 17:21, 14 April 2022 (UTC) Ahatd (talk) 08:34, 4 August 2022 (UTC) JordanTimothyJames (talk) 00:54, 31 August 2022 (UTC) --Silviafanti (talk) 17:07, 14 September 2022 (UTC) Back ache (talk) 02:03, 1 November 2022 (UTC) AfricanLibrarian (talk) M.roszkowski (talk) 10:44, 4 January 2023 (UTC) Rhagfyr (talk) 19:36, 9 January 2023 (UTC) — Haseeb (talk) 13:10, 4 August 2023 (UTC) 13:26, 15 November 2023 (UTC) MrBenjo (talk) 15:20, 23 April 2024 (UTC)[reply]

Notified participants of WikiProject Authority control, also

@ديفيد عادل وهبة خليل 2, Epìdosis, Salgo60, Animalparty, Jura1, ArthurPSmith:

I noticed people use WorldCat Identities ID (superseded) (P7859) for books while they should use OCLC control number (P243). Eg see https://www.wikidata.org/w/index.php?title=Q48837289&action=history, and a full list of mistakes is below. Any takers to correct these? I fixed the first couple.

155331401	Q48837289
316763896	Q5223184
316942938	Q87140025
432822520	Q48837289
61935747	Q3236166
664802524	Q10316960
78751353	Q48837289
883743583	Q3391127
973542923	Q15595065

WorldCat Identities ID (superseded) (P7859) already has type constraint (thanks @Mfchris84:). Can someone in the know add a negative type constraint to forbid the classes allowed for OCLC control number (P243)?

I added a note in the description --Vladimir Alexiev (talk) 14:14, 10 April 2020 (UTC)[reply]

initial load and refresh[edit]

Vladimir Alexiev (talk) 11:59, 13 March 2017 (UTC) Jonathan Groß (talk) 17:52, 26 March 2017 (UTC) Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits Jneubert (talk) 13:47, 29 April 2017 (UTC) Sic19 (talk) 20:42, 12 July 2017 (UTC) Wikidelo (talk) 21:15, 8 May 2018 (UTC) ArthurPSmith (talk) 19:52, 22 August 2018 (UTC) PKM (talk) 19:40, 23 August 2018 (UTC) Ettorerizza (talk) 06:44, 8 October 2018 (UTC) Fuzheado (talk) 03:47, 19 December 2018 (UTC) Daniel Mietchen (talk) 16:30, 7 April 2019 (UTC) Iwan.Aucamp (talk) 21:48, 3 October 2019 (UTC) Epìdosis (talk) 23:49, 22 November 2019 (UTC) Sotho Tal Ker (talk) 00:52, 1 May 2020 (UTC) Bargioni (talk) 09:48, 02 May 2020 (UTC) Carlobia (talk) 14:34, 11 May 2020 (UTC) Pablo Busatto (talk) 03:22, 23 June 2020 (UTC) Matlin (talk) 10:53, 6 July 2020 (UTC) Msuicat (talk) 21:57, 27 August 2020 (UTC) Uomovariabile (talk) 10:04, 27 October 2020 (UTC) Silva Selva (talk) 17:21, 30 November 2020 (UTC) 1-Byte (talk) 15:52, 14 December 2020 (UTC) Alessandra.Moi (talk) 17:26, 16 February 2021 (UTC) CamelCaseNick (talk) 21:20, 20 February 2021 (UTC) Songceci (talk) 18:45, 24 February 2021 (UTC)]] moz (talk) 10:48, 8 March 2021 (UTC) AhavaCohen (talk) 14:41, 11 March 2021 (UTC) Kolja21 (talk) 17:37, 13 March 2021 (UTC) RShigapov (talk) 14:34, 19 September 2021 (UTC) Jason.nlw (talk) 15:15, 30 September 2021 (UTC) MasterRus21thCentury (talk) 20:22, 18 October 2021 (UTC) Newt713 (talk) 08:42, 13 March 2022 (UTC) Pierre Tribhou (talk) 08:00, 20 March 2022 (UTC) Powerek38 (talk) 17:21, 14 April 2022 (UTC) Ahatd (talk) 08:34, 4 August 2022 (UTC) JordanTimothyJames (talk) 00:54, 31 August 2022 (UTC) --Silviafanti (talk) 17:07, 14 September 2022 (UTC) Back ache (talk) 02:03, 1 November 2022 (UTC) AfricanLibrarian (talk) M.roszkowski (talk) 10:44, 4 January 2023 (UTC) Rhagfyr (talk) 19:36, 9 January 2023 (UTC) — Haseeb (talk) 13:10, 4 August 2023 (UTC) 13:26, 15 November 2023 (UTC) MrBenjo (talk) 15:20, 23 April 2024 (UTC)[reply]

Notified participants of WikiProject Authority control, also

@ديفيد عادل وهبة خليل 2, Epìdosis, Salgo60, Animalparty, Jura1, ArthurPSmith:

I finished importing 1683978 WorldCat ID from VIAF 20191104. I did it with QuickStatements batches (86 batches of 40k statements or 20k identifiers each). It took 2 months and 1 week to process them (from 4 Feb 2020 to 10 Apr 2020). 30326 (1.8%) failed to insert on this initial load because high load causes QS errors (timeouts when adding statements, then the corresponding reference also fails).

I also collected 57 errors (exceptions) from WD editor reverts of my inserts. I put them down at https://en.wikipedia.org/wiki/Wikipedia:VIAF/errors#WorldCat_Identities_errors. I will submit these to OCLC for correction.

Now I plan to do a refresh.

  • From WD I got 2072346 VIAF ids
  • From WD I got 1683967 WorldCat ids: 1096206+587684 (lccn+viaf)
  • From VIAF http://viaf.org/viaf/data/viaf-20200302-links.txt.gz (7Gb unzipped) I got 20194854 WorldCat ids: 8521769+11673083+2 (lccn+viaf+other). This is growth by 275671 (1.36%) in 4 months (compared to 20191104)
  • WD items with VIAF but no WorldCat: 377995
  • Intersect with VIAF-WorldCat and remove exceptions: 57962
  • Submitted 3 QuickStatements batches with the following number of lines
  20001 wd-identities-new-00
  20001 wd-identities-new-01
  17963 wd-identities-new-02

--Vladimir Alexiev (talk) 16:53, 10 April 2020 (UTC)[reply]

  • The refresh is pretty much finished. The 3 batches are finishing up: I've done "clear errors" to add the failed references, which may have caused some duplicate statements, see below.
  • I also reported the errors to OCLC though @Florentyna: is still adding more error reports. Cheers! --Vladimir Alexiev (talk) 09:03, 13 April 2020 (UTC)[reply]

avoiding duplicates[edit]

I got a number of reverts due to duplicates, so I'll try to fix them.

There are a number of "legitimate" duplicates due to several VIAF IDs. I wrote a query to find items that have two different statements but with the same value:

select ?x ?xLabel ?s1 ?s2 ?v {
  ?x p:P7859 ?s1,?s2
  filter(str(?s1)<str(?s2))
  ?s1 ps:P7859 ?v.
  ?s2 ps:P7859 ?v.
} limit 100

1. I've created some due to QS race conditions: item is created, reference can't be created because it still doesn't see the new item; then I "Reset errors" on the 3 batches, which creates some duplicates. I won't "reset errors" anymore, so hopefully that'll stop

2. @Gamaliel: is creating 800 duplicates through adding qualifier "quantity", see https://www.wikidata.org/wiki/Topic:Vkf0s6bt3kzm14j0. So I tried to filter out this reason but the query times out:

select ?x ?xLabel ?s1 ?s2 ?v {
  ?x p:P7859 ?s1,?s2
  filter(str(?s1)<str(?s2))
  ?s1 ps:P7859 ?v.
  ?s2 ps:P7859 ?v.
  ?s1 ps:P7859 ?v filter not exists {?s1 pq:P1114 ?q1}
  ?s2 ps:P7859 ?v filter not exists {?s2 pq:P1114 ?q2}
} limit 100
  • Will this query include the legitimate duplicates due to multiple VIAF IDs? If I can get a query that includes no false positives I can run a QS batch to eliminate them. Gamaliel (talk) 14:40, 19 April 2020 (UTC)[reply]
  • @Gamaliel: This does not return legitimate duplicates: it looks for the same item ?x having the same WorldCat ?v through two different statements. You have to process them in slices of 100 else the query times out. It would be good if you could preserve my References but that's not crucial because the form of WorldCat id shows where it came from (VIAF or LCCN). Cheers! --Vladimir Alexiev (talk) 14:08, 20 April 2020 (UTC)[reply]
select ?x ?s1 ?s2 ?v ?q1 ?q2 {
  ?x p:P7859 ?s1,?s2
  filter(str(?s1)<str(?s2))
  ?s1 ps:P7859 ?v.
  ?s2 ps:P7859 ?v.
  optional{?s1 pq:P1114 ?q1}
  optional{?s2 pq:P1114 ?q2}
} limit 100
Try it!

The "count" query on top suggests about 19k duplicates (the difference between values and items with value). @Gamaliel: Any progress? --Vladimir Alexiev (talk) 23:29, 30 April 2020 (UTC)[reply]

I'm been running another unrelated process in Open Refine for a couple of days. I didn't think it would take this long, but I'll work on this as soon as that one is finished. Gamaliel (talk) 01:53, 1 May 2020 (UTC)[reply]
@Vladimir Alexiev: Just ran a batch of 100 to test it out: https://tools.wmflabs.org/editgroups/b/QSv2T/1589562172614/ If everything looks good I will increase the number of edits with the next batch. Gamaliel (talk) 17:06, 15 May 2020 (UTC)[reply]
@Gamaliel: the query now returns just 58 results, so you're close to eliminating all these duplicates. cheers! --Vladimir Alexiev (talk) 16:57, 24 May 2020 (UTC)[reply]

Conflations[edit]

Please see Wikidata:Project_chat#Question_about_conflation_in_WorldCat_identities --- Jura 05:38, 3 July 2020 (UTC)[reply]

Now: Wikidata:Project chat/Archive/2020/07#Question about conflation in WorldCat identities. --Kolja21 (talk) 05:41, 5 April 2023 (UTC)[reply]

outdated WorldCat values[edit]

hi! by verifying hundred of WorldCat values together with thousend of authority control statements please note:

these values change dynamically with passing of time
especially values of the form "viaf-foo" may change to "lccn-bar". in many cases the links redirect in approximately 10 sec. sometimes they do not redirect. many values of th form "np-foo" are as well subject to hanges.
the presence of the "Library of Congress ID" might be a strong indication that an update is required.

please leave a short note if you can take (partially) care on this issue. thanks in advance! kind regards gangleri aka lery raynhart aka 17:37, 24 November 2022 (UTC) no bias — קיין אומוויסנדיקע פּרעפֿערענצן — keyn umvisndike preferentsn talk contribs no bias — קיין אומוויסנדיקע פּרעפֿערענצן — keyn umvisndike preferentsn talk contribs 17:37, 24 November 2022 (UTC)[reply]