Wikidata:WikiProject Authority control/WorldCat Identities errors

From Wikidata
Jump to navigation Jump to search

Intro[edit]

@Merilee: (with OCLC): The best way to alert OCLC about such errors is to send an email to bibchange@oclc.org.

I've imported 1.7M WorldCat Identities from VIAF to Wikidata (86 batches of 20k each): https://tools.wmflabs.org/quickstatements/#/batches/Vladimir%20Alexiev. The batches are now completed.

I got many reverts that I am recording here as errors , so that OCLC can fix them, and so subsequent reimports can skip them. Discussions:

--Vladimir Alexiev (talk) 15:29, 25 March 2020 (UTC)[reply]

Wrong entries[edit]

Columns: Wikidata ID, VIAF ID, WorldCat Identity (User who reported error): Note

  • Q57778033, 135186065, lccn-no98076226 (Syrio): it's a different church
  • Q153805, 451153289878932770009, viaf-451153289878932770009 (Maitake): The WorldCat link leads nowhere, there must be some mistake in it
  • Q3173342, 51725322, lccn-nr88012377 (François Malo-Renault): Problème d’attribution pas Jean Malo-Renault mais Émile Malo-Renault: Chansons de France; Ragotte by Jules Renard (illustration Book)
  • Q4954204, 63019463, lccn-n86008422 (The-Pope): footballer from Australia vs American author. Sick of these messed up world cat & viaf links being incorrectly added. How do we stop them?
  • Q61782159, 69305354, lccn-n85273696 (EncycloPetey): That WorldCat entry mixes more than one author. It's not clear who it's meant to be.
  • Q65429637, 10722105, viaf-10722105 (Florentyna): WorldCat entry is a collection of different people. This guy here only wrote 1 book. VIAF shows only 1 book.
  • Q11766005, 74151776777418011451, viaf-74151776777418011451 (Maitake): WorldCat link leads nowhere, there must be some mistake in it
  • Q35387360 , 126388866, viaf-126388866 (Tacsipacsi): these refer to Kaiserswerther Diakonie (Q1721765)
  • Q49493243, 83146936673413780942, lccn-no2016081285 (Richard Arthur Norton): This is for a cemetery in Connecticut
  • Q7376409, 86557549, lccn-n94075181 (Quakewoody): contains data for at least 2 people named Ruby Wright
  • Q4558691, 144230861, lccn-n79023328 (Pi.1415926535): geographic location vs railway station
  • Q9091588, 147444432, lccn-n80044968 (Pi.1415926535): geographic location vs railway station
  • Q7252977, 134118634, viaf-134118634 (Pi.1415926535): geographic location vs railway station
  • Q6747010, 140335927, lccn-no97018101 (Pi.1415926535): musical group vs railway station
  • Q81218263, 152354794, viaf-152354794 (Alienautic): not the same person
  • Q70009481, 140946366, viaf-140946366 (Panek): urban square vs publisher
  • Q62022935, 156151702,lccn-n92045028 (Sturm): musical group vs model railway company
  • Q15,123310773,lccn-n94103460 (Paweł Ziemian): musical group vs the continent (!!)
  • Q7020162,149050275,lccn-n89638121 (Pi.1415926535): village vs railway station
  • Q869691,201196481,lccn-n85001033 (Raymond): twelve Romanesque churches of Cologne *ensemble* vs Förderverein Romanische Kirchen Köln *association*
  • Q1274600,279598240,viaf-279598240 (Florentyna): Badminton player vs specialist in Protection of consumer goods and Intellectual Property Infringement Measures
  • Q2379388,282514559,viaf-282514559 (Florentyna): scientist (2010 doctorate) vs badminton player (participation in World Championships at the same time)
  • Q12373038,141838568,lccn-n90712773 (2001:7d0:81f7:b580:58d7:f635:71e1:6dc9): institution vs building
  • Q2346055,3425149068447765730000,viaf-3425149068447765730000 (2001:7d0:81f7:b580:58d7:f635:71e1:6dc9): broken Worldcat link
  • Q859010,153166745,viaf-153166745 (2001:7d0:81f7:b580:58d7:f635:71e1:6dc9): a castle vs the city of Tallinn
  • Q7722735,lccn-n2004101827,149880451 (2001:7d0:81f7:b580:bd3a:ff32:aad0:f96): palace vs art museum
  • Q3743006,263256345,lccn-nr94027511 (2001:7d0:81f7:b580:bd3a:ff32:aad0:f96): church in England vs Tallinn
  • Q28002467,25836742,lccn-n00047916 (Shadess): Chinese kickboxer vs physicist/mathematician
  • Q12365987,136230123,lccn-n2001023793 (2001:7d0:81f7:b580:cc1a:205b:ccd5:aa2d): parish in Estonia vs church in Germany
  • Q1240598,247813328,viaf-247813328 (2001:7d0:81f7:b580:cc1a:205b:ccd5:aa2d): broken Worldcat link
  • Q951302,237697409,viaf-237697409 (2001:7d0:81f7:b580:cc1a:205b:ccd5:aa2d): (!) WorldCat link is about a musician Sarema, VIAF is about Saaremaa Island (Estonia)
  • Q666221,255955607,viaf-255955607 (2001:7d0:81f7:b580:cc1a:205b:ccd5:aa2d): Estonian village vs mountain in Uganda. (!) VIAF record has different WD: Q35316081
  • Q3467256,255773589,viaf-255773589 (2001:7d0:81f7:b580:cc1a:205b:ccd5:aa2d): village in Estonia vs stream in Congo, commune in Nepal, Chinese animation company (seems like NDL JP mixup)
  • Q1024189,168337407,viaf-168337407 (2001:7d0:81f7:b580:413a:d22d:3664:ac10): Village in Estonia vs Pyramids in Sudan
  • Q1009062,153719114,lccn-n88291234 (2001:7d0:81f7:b580:413a:d22d:3664:ac10): village in Estonia vs city in Korea
  • Q963957,126286705,viaf-126286705 (2001:7d0:81f7:b580:413a:d22d:3664:ac10): village in Estonia vs Siera Leone vs a (German?) musician
  • Q3464352, 914145857899923021256, viaf-914145857899923021256 (2001:7d0:81f7:b580:413a:d22d:3664:ac10): village Soe, Estonia vs British special forces SOE
  • Q11034374,146677361,lccn-n87880252 (2001:7d0:81f7:b580:413a:d22d:3664:ac10): village in Estonia vs something Japanese (from NDL JP)
  • Q3456799, 5259153834756564450009, viaf-5259153834756564450009 (2001:7d0:81f7:b580:413a:d22d:3664:ac10): broken WorldCat link
  • Q3469834,146712044,lccn-n90676295 (2001:7d0:81f7:b580:413a:d22d:3664:ac10): village vs an album(?) Rouge by Desmond Child
  • Q3462210,123956743,viaf-123956743 (2001:7d0:81f7:b580:413a:d22d:3664:ac10): township vs some musical album(?)
  • Q3458520,143088736,lccn-n82070240 (2001:7d0:81f7:b580:413a:d22d:3664:ac10): township vs castle
  • Q102158,136049295,lccn-n81035878 (2001:7d0:81f7:b580:413a:d22d:3664:ac10): city vs cotton factory(?)
  • Q300656,154917432,lccn-n93025991: WorldCat link is broken
  • Q1010051,161398816,viaf-161398816: township vs musical group
  • Q1004865,238766473,viaf-238766473: WorldCat link is broken
  • Q724294,246325928,viaf-246325928: village Aseri in Estonia vs Alta Scuola di Economia e Relazioni internazionali (ASERI)
  • Q724304,246325928,viaf-246325928: municipality Aseri in Estonia vs Alta Scuola di Economia e Relazioni internazionali (ASERI)
  • Q3463981,252719769,viaf-252719769: village in Estonia vs a musical artist vs something Japanese
  • Q183436,253927315,viaf-253927315: village Aidu in Estonia vs Associazione italiana di diritto urbanistico vs something Japanese (AIDU )
  • Q3477461,260340357,viaf-260340357: village in Estonia vs something Japanese (Curl International Corporation?)
  • Q157835,316603361,lccn-n81062996 (Movses): Zaporijia city (Ukraine) vs pedagogical institute in that city. I left the WorldCat link, but is it correct?
  • Q3997302,94957142,lccn-n92003750 (Eru): Scottish/Canadian musician vs English(?) historian Q87183665
  • Q2986901,38726920,viaf-138726920 (Nezdek): current French intercommunal structure vs historic one Q60846274
  • Q2136287,156495956,lccn-n80126081 (Tacsipacsi): church building vs world-wide religious denomination Q42504
  • Q2136287,140677748,lccn-nr2004029629 (Tacsipacsi): church building vs world-wide religious denomination Q42504
  • Q5981319,316734778,viaf-316734778 (Galopax): Argentinian publishing house (Editorial Losada) vs Spanish city
  • Q86460453,745948,viaf-745948 (WaldiWuff): WorldCat broken link
  • Q698497,312869877,lccn-no2008005239 (GodeNehler): Kaufhof-Warenhaus AG (Köln) vs Galeria Kaufhof Berlin-Alexanderplatz vs Bahnhof Berlin-Alexanderplatz
  • Q469006,8397152502865010800009,viaf-8397152502865010800009 (Florentyna): WorldCat wrongly includes a book "Sixteen Lectures on Chemistry and Life" that is not related to the badminton player
  • Q26383339,135313059,viaf-135313059 (Peter James): French musical group vs building in Gloucestershire, UK
  • Q12373345,126381680,lccn-no93035773 (2001:7d0:81f7:b580:58b8:ead5:92f1:e19f): WorldCat mixes up several Ministries of Finance: at least Estonia and Netherlands
  • Q5401581,125696051,lccn-n2005013401: WorldCat mixes up Maritime Administration of Estoni vs Finland
  • Q448676,307258140,viaf-307258140 (Florentyna): WorldCat mixes Danish badminton player with a French(?) academic, or a Canadian badminton player (stated by Florentyna, I don't see that)
  • Q12376789,143463696,lccn-no2008147815: VIAF and WorldCat mix up the Health Care Board of Estonia vs Equador
  • Q581794,166530592,viaf-166530592: Tartu Observatory vs Tartu University, Physics Institute Q18625048. Mixup comes from LCNAF http://id.loc.gov/authorities/names/no2009101361
  • Q581794,5017149719113511130001,lccn-no2009101361: Tartu Observatory vs Tartu University, Physics Institute Q18625048. Mixup comes from LCNAF http://id.loc.gov/authorities/names/no2009101361 . WorldCat has two codes (viaf-166530592,lccn-no2009101361) for the same thing
  • Q7686866,203893134,viaf-203893134: anon Estonian user claims mixup between Tartu Art School and Tartu Art College (Q7686865)
  • Q604487,133037006,lccn-n91124605: mixup between Tallinn University of Technology and its Library (Q12376203)
  • Q12374025,758149719128411130000,viaf-758149719128411130000: mixup between State Nature Conservation Centre and Estonian Radiation Protection Centre (Q12366955)
  • Q20529203,165833969,lccn-n84116159: mixup between Social Insurance Board of Estonia vs Poland
  • Q16411346,157397052,lccn-no94010029: mixup between Tax and Customs Board of Estonia, Denmark and perhaps other countries
  • Q12373165,57149542576400300130,viaf-57149542576400300130: WorldCat page mixes up Ministries of Agriculture of several countries, at least Estonia and Germany
  • Q25514677,3621149544587600490007,viaf-3621149544587600490007: WorldCat page mixes up Labour Inspectorate of Estonia vs Netherlands
  • Q1353513,103695322,viaf-103695322 (Florentyna): badminton player vs researcher (published on Photosensibilisation)
  • Q1366280,121494915,viaf-121494915 (Florentyna): badminton player vs art photographer?
  • Q2064578,2045147553259353800005,viaf-2045147553259353800005 (Florentyna): WorldCat: CZ badminton player vs RO scholar
  • Q707205,2320152502962710800003,viaf-2320152502962710800003 (Florentyna): WorldCat broken link
  • Q456077,253366631,viaf-253366631 (Florentyna): WorldCat broken link
  • Q1174457,142324289,lccn-no2010172479 (Florentyna): badminton player vs radio personality/musician from the 1980s
  • Q961729,305858487,lccn-n2012208141 (Florentyna): badminton player vs Computers in Finance researcher vs musical artist?
  • Q464630,292159880,viaf-292159880 (Florentyna): WorldCat broken link
  • ‪Q1606237,287980884,viaf-287980884‬ (Florentyna): badminton player vs oceanographic researcher
  • Q506333,280709327,viaf-280709327 (Florentyna): WorldCat broken link
  • Q12521649,280280523,viaf-280280523 (Florentyna): WorldCat broken link
  • Q464589,283692043,viaf-283692043 (Florentyna): Indonesian badminton player vs someone (or several people) Chinese
  • Q523745,3101152502843410800008,viaf-3101152502843410800008 (Florentyna): WorldCat broken link
  • Q2161742,3121145857149622922167,lccn-nb2016003356 (Florentyna): badminton player vs employee competencies researcher
  • Q26945269,311276121,viaf-311276121 (Florentyna): badminton player vs researcher in interdisciplinary science teaching and language acquisition
  • Q319346,316739207,viaf-316739207 (Florentyna): badminton player vs molecular biologist (Cytoplasmic Function)
  • Q1207551,81306904,lccn-n2012078459 (Florentyna): Indian badminton player vs some Indian soap-opera stories?
  • Q16410059,126130153,viaf-126130153: WorldCat: Estonian publisher vs something Danish (publisher? group?)
  • Q20530239,268973301,viaf-268973301: VIAF: Veterinary and Food Board of Estonia vs France
  • Q116014,9497154983567667860002,viaf-9497154983567667860002: WorldCat: Estonian railway vs 17th century publisher (E.R.)
  • Q16411248,233872304,viaf-233872304: Eesti Lairiba Arenduse Sihtasutus (Estonian organization) vs European Landscape Architecture Student Association (ELASA)
  • Q20529164,131739370,lccn-no2008079055: VIAF: Estonian carsharing company vs some musical group vs Norwegian noble family
  • Q22675031,126013078,viaf-126013078: VIAF: Greek traditional music band vs Estonian publisher (Q16410559) vs vehicle manufacturer in Ontario, Canada (Q4653685). VIAF merges several WD records, which is almost always wrong
  • Q20529148,lccn-no95036559: mixup of two Estonian art organizations. Q20529148=n2014077632= lccn-n2014077632 while Q12361278=126399458=lccn-no95036559
  • Q25522016,132513394,viaf-132513394: Säde (Estonian voluntary association) vs Sade (musical group)
  • Q12047031,151454481,lccn-n99006430: PRISMA Finnish hypermarket chain vs Programa Regional de Inversiones y Servicios de Madrid
  • Q3736474,302309365,viaf-302309365: Panga cliff (Estonia) vs some Polish theatrical plays ?!?
  • Q7632840,4527149544594900490007,lccn-n83011677: VIAF: Estonian Socialist Workers' Party vs Ukrainian Communist Party
  • Q12361220,13149542762100301255,viaf-13149542762100301255: Eesti Haridusliit vs Q16412608 Eesti Vabaharidusliit (Educational Association vs Non-formal Education Association)
  • Q913551,126702489,lccn-no89008175: Social Democratic Party of Estonia (Q913551) vs Turkey (Q6027832)
  • Q16408920,150700536,viaf-150700536: Estonian insurance company (Q16408920) vs Estonian newspaper (Q16412891) vs Estonia the country (Q191)
  • Q3179327,84896940,viaf-84896940 (Mormegil): Czech volleyball player vs engineer
  • Q44594687,4297151717062113900003,viaf-4297151717062113900003 (Silewe): politician vs social worker
  • ‪Q47344187‬,220491045,viaf-220491045 (Silewe): Austrian politician vs deutscher Arzt
  • Q24413760,43021015,lccn-n85198967 (MrProperLawAndOrder): American slave vs English painter Q6142248
  • Q46847386,309621505,viaf-309621505 (Silewe): Maurice Walter vs Walter Maurice
  • Q46141435,317076951,viaf-317076951 (Silewe): Politiker vs deutscher Maschinenbauingenieur
  • Q40999439,27894962,lccn-n93095321 (Silewe): Politiker vs Psychologe (geb. 1959)
  • Q21544895,49581672,lccn-n83825277 (MrProperLawAndOrder): another person
  • Q15814835,81771751,lccn-n90639661 (Axolotl Nr.733): politician vs geographer
  • Q63838,64866631,viaf-64866631 (MrProperLawAndOrder): Nazi general vs chemist who worked on coal carbonization (Q93939900). The WorldCat page is deleted

More wrong entries[edit]

Here's a further (raw) dump of reverts of my edits between 28 December 2021 and 10 July 2022.

OK entries[edit]

  • Q525207,133910243,lccn-n2006084742 (2001:7d0:81f7:b580:413a:d22d:3664:ac10): WorldCat page is about the Helme municipality

Conflation in WorldCat identities[edit]

I've been side-tracked into trying to clean up some authority control links and have been discovering what many of you probably already know, that it's practically the Herculean labor of the Augean stables in the case of a common name, what with bots merrily linking to a slight resemblance or to a mistake that another bot already made. But it's usually possible either to move an authority to its correct item or to create a conflation item for it. WorldCat Identities puzzle me, though. They are usually linked to a Library of Congress authority record, but they then go on to list books supposedly by that author, which in most cases aren't, but are rather by multiple different authors. The LOC record doesn't name those books, so I don't know where WorldCat is getting its authorship information. Should I treat such WorldCat records as conflations? Levana Taylor (talk) 03:46, 3 July 2020 (UTC)[reply]

  • @Levana Taylor: You describe a serious problem and I think your description is accurate, and your comparison to the Augean stables is apt. Two questions:
    • Utility vs errors: given these errors, is it worth keeping WorldCat identities? I'd say yes because it points to relevant publications, although there are also irrelevant ones.
      • I personally find the ratio of relevant to irrelevant in Identities listings low enough that they're not useful to me, but other people may disagree; also WorldCat might well find a way of improving their matching in the future, so yeah, they're a worthwhile external identifier for WD. It would be nice if WorldCat changed their heading from "works by this author" to "works which may be by this author" so that unwary visitors would know what they're getting :-) In the meantime, is there a way to warn WD editors not to cite Identities as a source of authorship attribution? Levana Taylor (talk) 21:18, 5 July 2020 (UTC)[reply]
    • How to improve the situation. I don't think we can help but we can record problems and submit them to OCLC for consideration (see start of this page). --Vladimir Alexiev (talk) 09:36, 3 July 2020 (UTC)[reply]
  • Hi @Levana Taylor: I don't think WD still have a good solution of this difficult problem, but soon a property will be created "error page for this external-id" that will at least create more visibility.

Error in LCNAF/WorldCat Identities[edit]

It appears that there is an incorrect link in the English Wikipedia article for Amine (French singer) (Amine (Q2843405)). The existing LC authority record [ n2014008348 ] links to a French/Moroccan oud player also named Amine (full name: Amine M'Raihi); I don't think the singer has an LC name authority record. ID.LOC.GOV link for the oud player: http://id.loc.gov/authorities/names/n2014008348 The VIAF cluster combines both of them; the RERO link also is for the oud player. Not sure this is the proper place for this, but when I went to Wikipedia:VIAF/errors, it sent me here to report WorldCat Identities problems.--FeanorStar7 (talk) 16:03, 8 January 2021 (UTC)[reply]