Property talk:P1447

From Wikidata
Jump to: navigation, search

Documentation

Sports-Reference.com Olympic athlete ID
identifier for an Olympic athlete at www.sports-reference.com/olympics/athletes/
Represents sports-reference.com (Q18002875)
Data type External identifier
Corresponding template Template:Sports-reference (Q10964954), Template:Cite sports-reference (Q14444672)
Template parameter "1" or "ID" in en:Template:Sports-reference and no:Mal:Sports-reference
Domain human (Q5) and business enterprise (Q4830453)
Allowed values ([a-z]{2}|[aeiou])/(\1[a-z\-]*|[a-z\-]*-\1[a-z\-]*)-1?\d
Example Usain Bolt (Q1189)bo/usain-bolt-1
Anne Jahren (Q256757)ja/anne-jahren-1
Formatter URL http://www.sports-reference.com/olympics/athletes/$1.html
Robot and gadget jobs My bot can import values from various Wikipedias
Tracking: differences no label (Q32582750)
Tracking: usage Category:Pages using Wikidata property P1447 (Q26759926)
Tracking: local yes, WD no Category:Sports Reference ID not in Wikidata (Q17783922)
See also TheSports.org athlete ID (P4391), International Olympic Committee athlete ID (P3171)
Lists
Proposal discussion Property proposal/Archive/25#P1447
Current uses 100,185
[create] Create a translatable help page (preferably in English) for this property to be included here
Format “|([a-z]{2}|[aeiou])/(\1[a-z\-]*|[a-z\-]*-\1[a-z\-]*)-1?\d”: value must be formatted using this pattern (PCRE syntax).
Exceptions are possible as rare values may exist. Known exceptions: Evert-Jan 't Hoen (Q2553574)
List of this constraint violations: Database reports/Constraint violations/P1447#Format, SPARQL
Single value: this property generally contains a single value.
Exceptions are possible as rare values may exist. Known exceptions: Q16231840, Q740650, Q289705, Q499846, Q499917, Q2452902, Q2662759, Q3641580, Q15075503, Q5001045, Q16203439, Q1061577, Q5259172, Q5316780, Q927816, Q1336053, Q5368170, Q1356785, Q5409760, Q5444507, Q919733, Q16223213, Q1340962, Q3445567, Q5504921, Q1715619, Q566370, Q1109723, Q61846, Q263287, Q1815158, Q246381, Q5626870, Q517722, Q373071, Q443568, Q5733248, Q1618073, Q5771331, Q717789, Q2532606, Q14552162, Q970076, Q11974062, Q1700988, Q6370901, Q3430530, Q316356, Q11870922, Q6395403, Q6396527, Q6408633, Q486585, Q2420479, Q3260687, Q459402, Q2421787, Q2736867, Q2649291, Q6759191, Q19637513, Q10092, Q1907283, Q1661933, Q452168, Q377701, Q1381850, Q3601707, Q6891018, Q2427454, Q458032, Q1370766, Q1705695, Q2898896, Q3361650, Q2020853, Q822208, Q3887865, Q455843, Q180749, Q197772, Q1724823, Q2121556, Q25905, Q448002, Q433232, Q3069124, Q1314733, Q722261, Q4398834, Q4919256, Q518305, Q118002, Q3647280, Q718913, Q8082853, Q254252, Q265376, Q312283, Q337053, Q350107, Q381591, Q392545, Q452443, Q456836, Q460656, Q517339, Q558495, Q562429, Q562562, Q710874, Q762557, Q928042, Q952921, Q976239, Q981381, Q1032960, Q1252572, Q1252587, Q1340371, Q1404883, Q1546613, Q1560788, Q1572172, Q1705469, Q2347411, Q2456182, Q3142423, Q3391342, Q3443170, Q3616497, Q3742857, Q3763056, Q4681562, Q4767316, Q4780715, Q5200166, Q5247949, Q5353179, Q5556353, Q5991746, Q6037371, Q6960456, Q7398279, Q7927731, Q8046727, Q8048693, Q8059776, Q16073404, Q16141359, Q17182972, Q18201556, Q26088, Q3293885, Q3434227, Q7264075, Q7399836, Q11852995, Q1942707, Q934207, Q982021, Q2029220, Q7308471, Q16599539, Q534823, Q2031229, Q2628951, Q2635179, Q2683239, Q3386606, Q3425074, Q3732142, Q4024416, Q6150532, Q6258572, Q6303584, Q6413996, Q7493833, Q7792799, Q7965202, Q11735674, Q11969923, Q16015733, Q17489845, Q24006480, Q5516024, Q1392330, Q2628634, Q4818311, Q6830784, Q237507, Q434216, Q438757, Q482912, Q2030985, Q3723812, Q3887070, Q4526063, Q4777566, Q5070382, Q6790873, Q7342102, Q9440273, Q16202855, Q16612108, Q16658139, Q19728729, Q26950410, Q112435, Q10118, Q229124, Q359540, Q530898, Q550983, Q616779, Q1741512, Q1772497, Q1892545, Q2537992, Q2697837, Q53983, Q172794, Q432444, Q440917, Q1338391, Q3432399, Q5483870, Q5561399, Q6231603, Q7106350, Q15976230, Q16749849, Q18920535, Q13021345, Q3888663, Q432210, Q289490, Q293331, Q3440522, Q4081269, Q7347762, Q16561142, Q19655566
List of this constraint violations: Database reports/Constraint violations/P1447#single value, SPARQL
Distinct values: this property likely contains a value that is different from all other items.
List of this constraint violations: Database reports/Constraint violations/P1447#distinct values, hourly updated report, SPARQL (every item), SPARQL (by value)
Type “human (Q5), business enterprise (Q4830453): element must contain property “instance of (P31)” with classes “human (Q5), business enterprise (Q4830453)” or their subclasses (defined using subclass of (P279)).
List of this constraint violations: Database reports/Constraint violations/P1447#Type Q5, Q4830453, hourly updated report, SPARQL
Item “participant of (P1344): Items with this property should also have “participant of (P1344)”.
Exceptions are possible as rare values may exist.
List of this constraint violations: Database reports/Constraint violations/P1447#Item P1344, SPARQL
This property is being used by:

Please notify projects that use this property before big changes (renaming, deletion, merge with another property, etc.)

Discussion[edit]

Only Olympics?[edit]

It seems to me that P1447 as currently defined only adresses the "Olympic" section of sports-reference.com. While I'm absolutely fine with this, as it fulfills my purpose, I'm wondering if the definition of the Property should stress that only olympians are adressed? --VicVal (talk) 00:03, 25 October 2015 (UTC)

Format[edit]

@Jon Harald Søby: I think, it would be great to see, what results would be, if we change format a little bit. See for example lo/pierre-lorin-1. The last word before digit begins with the same two letters, which are before the slash. Although, there are exceptions (those which have only one letter before slash and people with different word order, Ted Huang (Q2399776), for example), this should highlight some errors, I think. Ideas? --Edgars2007 (talk) 13:16, 31 January 2016 (UTC)

@Edgars2007: I think there would be way too many exceptions for the list to be useful, precisely because of East Asian performers like {{Q|2399776}. It would be better to in some way extract all values and match them to a regex in a different way, I think. Jon Harald Søby (talk) 14:15, 31 January 2016 (UTC)
@Jon Harald Søby: yes, that was my second option. But as there are more than 85tk items with this ID, I didn't want to download them :D Maybe you could perform such scan? --Edgars2007 (talk) 14:25, 31 January 2016 (UTC)
@Edgars2007: An idea could be to change it temporarily for a day or two, so the constraint report will report on those errors, and then revert to the normal format? That way we can copy the results from that day to work on. Jon Harald Søby (talk) 14:28, 31 January 2016 (UTC)
@Jon Harald Søby: yeah, sure. That would be fine. --Edgars2007 (talk) 14:35, 31 January 2016 (UTC)
@Edgars2007: Okay, this regex should do the trick: ([a-z]{2}|[aeiou])/(\1[a-z\-]*|[a-z\-]*-\1[a-z]*)-1?\d. It will match stuff on the form xx/xxyyy-yyy-0 or xx/yyy-xxyyy-0, so that it will tackle East Asian name order as well. If there aren't too many false positives it could be made the default regex. I'll replace it here now, so the report tomorrow or the day after will include these matches. Jon Harald Søby (talk) 14:46, 31 January 2016 (UTC)
@Jon Harald Søby: OK, current results are pretty bad :D First two letters before slash should match any word beginning. Then the format should work fine. --Edgars2007 (talk) 06:46, 1 February 2016 (UTC)
@Edgars2007: Okay, fixed. 😊 Jon Harald Søby (talk) 08:40, 1 February 2016 (UTC)
@Jon Harald Søby: Thanks, now it looks fine. Looked at some 10 random items, all of them were wrong. Nice job :) If you could take a look at your talkpage at nowiki and help with that template we can call it Latvian-Norwegian collaboration week :) --Edgars2007 (talk) 07:07, 2 February 2016 (UTC)

Probably some changes required soon[edit]

Bill Mallon hints changes on sports-reference/oly. --62.159.86.77 12:55, 22 August 2016 (UTC)

Thanks for letting us know. Just a note: even after sports-reference will be taken down, the statments do not need to be removed immediately. We probably need a new property for the new website once the situation is clearer, and it might be useful to have the old links still in place even if they are dead (we might want to remove the URL pattern when its dead, or something similar). —MisterSynergy (talk) 15:00, 22 August 2016 (UTC)
Yes, thanks for sharing. Well, let's hope the new statistics profiles will be good. And that they'll provide some profile redirect, so that adaption here goes more smoothly and our work done this year isn't thrown into garbage. --Edgars2007 (talk) 15:25, 22 August 2016 (UTC)

They took it offline today [1]. I’ll suggest to keep the claims for now, they might or might not be useful for the successor website they promise to make. Is there any possibility to mark this database offline in the meantime? —MisterSynergy (talk) 18:42, 1 December 2016 (UTC)

Small update December 16, 2016[edit]

There's been an update :

Site Closing: We are sorry to inform you that due to a change to our data licensing agreement we are shutting down our Olympic site sometime during the early part of 2017. The providers of our dataset are working with another publisher to create an extensive site chronicling the history of the Olympic Movement. We will provide information here when that site is available. We will continue to have Olympic Ice Hockey Stats and Olympic Basketball Stats. More Information from Our Data Providers and their Future Plans.
—Updated December 16, 2016.

Best regards Migrant (talk) 03:07, 19 December 2016 (UTC)

New information from January 2, 2017[edit]

From a reply to a questionnaire named Liam. There is kinda some new bits of information from BMallon :

Thanx for your kind words. I think it will be fine when the IOC takes over our data. We have our own private website, http://www.olympedia.org, which is the source for the sports-reference site. Olympedia is what the IOC is purchasing from us and we think Olympedia is even better than SR/olympics, although the styles are little different. I don’t think we can donate stuff to Wikidata because of contractural obligations with the IOC. However, I have been in contact with the Wikipedians who do Olympic stuff and talked to the IOC about this so that we can preserve links for them on Wikipedia so they don’t lose all their references. Hopefully we’ll be able to work that out.
—January 2, 2017 at 8:31 PM

Best regards Migrant (talk) 01:25, 11 January 2017 (UTC)

New information from February 24, 2017[edit]

Question asked January 24, 2017

Thanks for your great work with olympic statistics. I am one of those who contribute to Wikipedia about mainly winter olympic results (preferably speed skating) and such related biographies, and I just wonder in what type of time frame do you expect to see these statistics published publicly elsewhere for instance at IOCs websites ? Do you think it will be available in time for the 2018 Olympic Winter Games, updated with the 2016 Olympic Summer Games results ?
—Best regards Frank Skillinghaug

Answered February 24, 2017

Frank – no timeline for when our private site, Olympedia, goes public as the IOC statistical site. We have signed our contracts with the IOC and are now in discussions with the IT people so things are moving along. Sports-reference will stay open until the IOC site becomes available.
—– Bill M

Best regards Migrant (talk) 18:41, 17 March 2017 (UTC)

2018 Winter Olympics and still no real database like the other one[edit]

It's been a more than a year since the first hint of this change of homeplace for olympic stats. Would it be an idea to contact IOC about this new database with reference to the above expressed links and answers, since it is no updates on this at the olympstats.com-site and it is in a building-up to a winterseason with only 159 days to go (per September 2., 2017) to 2018 Winter Olympics in Pyeongchang in Republic of Korea ? Best regards Migrant (talk) 14:37, 2 September 2017 (UTC)

My recommendation would be to contact Bill Mallon of Sports-Reference.com. He has a Wikimedia account (User:Billbambam) with an email address at enwiki. —MisterSynergy (talk) 18:29, 2 September 2017 (UTC)
Disagree there... I think it would be better to contact the right person at IOC to ask about this upcoming database with reference to what mr. Bill Mallon have said at Olympstats.com-site. But who would be the correct person to ask there ? Bill Mallon might have the answer for that though. Best regards Migrant (talk) 22:14, 3 September 2017 (UTC)
Well, that’s basically the problem with asking “the IOC”. It is a big organization, and I doubt that they really care about our needs—unlike Mallon somewhat does. If he can’t help us it would still be possible to think about alternatives… —MisterSynergy (talk) 04:50, 4 September 2017 (UTC)
Yeah, thats right IOC are a bit bigger than an ordinary small-club organization, but they are only people like us others. So if we can find the right person at IOC I really think we should ask about a possible timeline for the database to be released at the new site ? Is this a task you as an administrator could or would take on to ask about and share the answer here ?
BTW. Have you seen the newly, but not yet complete Olympic database at this site http://www.olympiandatabase.com/index.php ? Best regards Migrant (talk) 00:19, 5 September 2017 (UTC)
If you want to approach the IOC directly, you’ll probably just try a generic email address such as info <at> olympic <dot> org and hope that they forward it to the right person. Larger organizations often do not expose information about staff positions such as database engineers to the public. IMO it is not relevant to be a Wikidata admin (which is an internal role); much more important is that you give your real name.
I have seen the other site, but I am not yet convinced whether it is useful. Many results are still missing. —MisterSynergy (talk) 05:03, 5 September 2017 (UTC)

Broken links[edit]

Quote from http://www.sports-reference.com/olympics/: "Site Closing We are sorry to inform you that due to a change to our data licensing agreement we are shutting down our Olympic site effective December 1st, 2016. The providers of our dataset are working with another publisher to create an extensive site chronicling the history of the Olympic Movement. We will provide information here when that site is available. "

I set the formatter url to deprecated. Multichill (talk) 21:28, 1 December 2016 (UTC)

  • At least some of these do not seem to be broken as of right now (was the website restored?) In any case we can change the formatter URL to use archive.org - prefix with "https://web.archive.org/web/*/" and it should be useable (the ones I tried had a handful of archive.org copies). ArthurPSmith (talk) 15:44, 2 December 2016 (UTC)
    • I just reverted Multichill's deprecatet URL-setting to written off back to normal. It was correct yesterday, they were broken, but as you say they look fine today. User Multichill, Stryn and Sjoerddebruin can verify this. See also this blogpost about it... http://olympstats.com/2016/08/21/the-olymadmen-and-olympstats-and-sports-reference/ , it certainly looks like they are about to be moved to another site which looks to me as they re being a little bit more official data. Best regards Migrant (talk) 19:59, 2 December 2016 (UTC)

ASCII[edit]

SR has upgraded with Rio sportspeople - yeah! But they have screwed up encoding... Topic:U1cn8jd9grsmkl5h. Maybe somebody wants to talk with Bill? --Edgars2007 (talk) 18:44, 14 November 2017 (UTC)

Yes, I already added a lot of fresh profiles mid-October. Mind that there are also plenty of updates, additions and corrections of facts (such as person data) within the old profiles, so a re-comparison with Wikidata-values might be valuable.
Regarding the encoding: I am not sure whether Bill Mallon can actually fix this problem. To my knowledge, he (+team) delivers data to sports-reference and this company then displays it on their website. The current output indicates that the backend is indeed UTF8 (or similar) encoded, which is good, but the output is for some reason mis-interpreted. I have developed a workaround for manual editing by c&p the odd input displayed on SR to a service such as 2cyr.com/decode/ with settings Expert: source encoding: “UTF-8” displayed as: “ISO-8859-1” postfilter: “”. —MisterSynergy (talk) 19:14, 14 November 2017 (UTC)
Thanks, MS. Very valuable link. --Edgars2007 (talk) 13:52, 16 November 2017 (UTC)