Property talk:P1447

From Wikidata
Jump to navigation Jump to search

Documentation Olympic athlete ID
identifier for an Olympic athlete at (Q18002875)
Data typeExternal identifier
Corresponding templateTemplate:Sports-reference (Q10964954), Template:Cite sports-reference (Q14444672)
Template parameter"1" or "ID" in en:Template:Sports-reference and no:Mal:Sports-reference
Domainhuman (Q5) and architectural firm (Q4387609)
Allowed values([a-z]{2}|[aeiou])\/(\1[a-z\-]*|[a-z\-]*-\1[a-z\-]*)-1?\d
ExampleUsain Bolt (Q1189)bo/usain-bolt-1
Anne Jahren (Q256757)ja/anne-jahren-1
Formatter URL$1.html
Robot and gadget jobsMy bot can import values from various Wikipedias
Tracking: sameCategory:Sports-Reference template with ID same as Wikidata (Q25813704)
Tracking: differencesCategory:Sports-Reference template with ID different from Wikidata (Q32183031)
Tracking: usageCategory:Pages using Wikidata property P1447 (Q26759926)
Tracking: local yes, WD noCategory:Sports Reference ID not in Wikidata (Q17783922)
See athlete ID (P4391), International Olympic Committee athlete ID (P3171), OlympicChannel athlete ID (P5815), athlete ID (P3520)
Proposal discussionProperty proposal/Archive/25#P1447
Current uses117,657 out of 135,584 (86% complete)
Search for values
[create] Create a translatable help page (preferably in English) for this property to be included here
Format “|([a-z]{2}|[aeiou])\/(\1[a-z\-]*|[a-z\-]*-\1[a-z\-]*)-1?\d”: value must be formatted using this pattern (PCRE syntax). (Help)
Exceptions are possible as rare values may exist. Known exceptions: Evert-Jan 't Hoen (Q2553574)
List of this constraint violations: Database reports/Constraint violations/P1447#Format, SPARQL, SPARQL (new)
Distinct values: this property likely contains a value that is different from all other items. (Help)
List of this constraint violations: Database reports/Constraint violations/P1447#Unique value, hourly updated report, SPARQL (every item), SPARQL (by value), SPARQL (new)
Type “human (Q5), architectural firm (Q4387609): element must contain property “instance of (P31)” with classes “human (Q5), architectural firm (Q4387609)” or their subclasses (defined using subclass of (P279)). (Help)
List of this constraint violations: Database reports/Constraint violations/P1447#Type Q5, Q4387609, hourly updated report, SPARQL, SPARQL (new)
Qualifiers “named as (P1810): this property should be used only with the listed qualifiers. (Help)
List of this constraint violations: Database reports/Constraint violations/P1447#Allowed qualifiers, hourly updated report, SPARQL, SPARQL (new)
Single value: this property generally contains a single value. (Help)
Exceptions are possible as rare values may exist.
List of this constraint violations: Database reports/Constraint violations/P1447#Single value, SPARQL, SPARQL (new)
Scope is: the property must be used by specified way only (Help)
List of this constraint violations: Database reports/Constraint violations/P1447#scope, hourly updated report, SPARQL (new)
Pictogram voting comment.svg Human without participation (P1344)
Item with a Olympic athlete ID (P1447) identifier should have a participant of (P1344) claim for their Olympic participation (Help)
Violations query: SELECT DISTINCT ?item WHERE { ?item wdt:P1447 ?id; wdt:P31 wd:Q5 . MINUS { ?item wdt:P1344 [] } } ORDER BY ASC(xsd:integer(STRAFTER(STR(?item), 'Q')))
List of this constraint violations: Database reports/Complex constraint violations/P1447#Human without participation (P1344)
Pictogram voting comment.svg Human without occupation (P106)
Q5 without occupation (P106) (Help)
Violations query: SELECT DISTINCT ?item WHERE { ?item wdt:P1447 ?id; wdt:P31 wd:Q5 FILTER NOT EXISTS { ?item wdt:P106 [] } } ORDER BY ASC(xsd:integer(STRAFTER(STR(?item), 'Q')))
List of this constraint violations: Database reports/Complex constraint violations/P1447#Human without occupation (P106)
Pattern ^(.*)\.html$ will be automatically replaced to \1.
Testing: TODO list

This property is being used by:

Please notify projects that use this property before big changes (renaming, deletion, merge with another property, etc.)


Only Olympics?[edit]

It seems to me that P1447 as currently defined only adresses the "Olympic" section of While I'm absolutely fine with this, as it fulfills my purpose, I'm wondering if the definition of the Property should stress that only olympians are adressed? --VicVal (talk) 00:03, 25 October 2015 (UTC)


@Jon Harald Søby: I think, it would be great to see, what results would be, if we change format a little bit. See for example lo/pierre-lorin-1. The last word before digit begins with the same two letters, which are before the slash. Although, there are exceptions (those which have only one letter before slash and people with different word order, Ted Huang (Q2399776), for example), this should highlight some errors, I think. Ideas? --Edgars2007 (talk) 13:16, 31 January 2016 (UTC)

@Edgars2007: I think there would be way too many exceptions for the list to be useful, precisely because of East Asian performers like {{Q|2399776}. It would be better to in some way extract all values and match them to a regex in a different way, I think. Jon Harald Søby (talk) 14:15, 31 January 2016 (UTC)
@Jon Harald Søby: yes, that was my second option. But as there are more than 85tk items with this ID, I didn't want to download them :D Maybe you could perform such scan? --Edgars2007 (talk) 14:25, 31 January 2016 (UTC)
@Edgars2007: An idea could be to change it temporarily for a day or two, so the constraint report will report on those errors, and then revert to the normal format? That way we can copy the results from that day to work on. Jon Harald Søby (talk) 14:28, 31 January 2016 (UTC)
@Jon Harald Søby: yeah, sure. That would be fine. --Edgars2007 (talk) 14:35, 31 January 2016 (UTC)
@Edgars2007: Okay, this regex should do the trick: ([a-z]{2}|[aeiou])/(\1[a-z\-]*|[a-z\-]*-\1[a-z]*)-1?\d. It will match stuff on the form xx/xxyyy-yyy-0 or xx/yyy-xxyyy-0, so that it will tackle East Asian name order as well. If there aren't too many false positives it could be made the default regex. I'll replace it here now, so the report tomorrow or the day after will include these matches. Jon Harald Søby (talk) 14:46, 31 January 2016 (UTC)
@Jon Harald Søby: OK, current results are pretty bad :D First two letters before slash should match any word beginning. Then the format should work fine. --Edgars2007 (talk) 06:46, 1 February 2016 (UTC)
@Edgars2007: Okay, fixed. 😊 Jon Harald Søby (talk) 08:40, 1 February 2016 (UTC)
@Jon Harald Søby: Thanks, now it looks fine. Looked at some 10 random items, all of them were wrong. Nice job :) If you could take a look at your talkpage at nowiki and help with that template we can call it Latvian-Norwegian collaboration week :) --Edgars2007 (talk) 07:07, 2 February 2016 (UTC)

Probably some changes required soon[edit]

Bill Mallon hints changes on sports-reference/oly. -- 12:55, 22 August 2016 (UTC)

Thanks for letting us know. Just a note: even after sports-reference will be taken down, the statments do not need to be removed immediately. We probably need a new property for the new website once the situation is clearer, and it might be useful to have the old links still in place even if they are dead (we might want to remove the URL pattern when its dead, or something similar). —MisterSynergy (talk) 15:00, 22 August 2016 (UTC)
Yes, thanks for sharing. Well, let's hope the new statistics profiles will be good. And that they'll provide some profile redirect, so that adaption here goes more smoothly and our work done this year isn't thrown into garbage. --Edgars2007 (talk) 15:25, 22 August 2016 (UTC)

They took it offline today [1]. I’ll suggest to keep the claims for now, they might or might not be useful for the successor website they promise to make. Is there any possibility to mark this database offline in the meantime? —MisterSynergy (talk) 18:42, 1 December 2016 (UTC)

Small update December 16, 2016[edit]

There's been an update :

Site Closing: We are sorry to inform you that due to a change to our data licensing agreement we are shutting down our Olympic site sometime during the early part of 2017. The providers of our dataset are working with another publisher to create an extensive site chronicling the history of the Olympic Movement. We will provide information here when that site is available. We will continue to have Olympic Ice Hockey Stats and Olympic Basketball Stats. More Information from Our Data Providers and their Future Plans.
—Updated December 16, 2016.

Best regards Migrant (talk) 03:07, 19 December 2016 (UTC)

New information from January 2, 2017[edit]

From a reply to a questionnaire named Liam. There is kinda some new bits of information from BMallon :

Thanx for your kind words. I think it will be fine when the IOC takes over our data. We have our own private website,, which is the source for the sports-reference site. Olympedia is what the IOC is purchasing from us and we think Olympedia is even better than SR/olympics, although the styles are little different. I don’t think we can donate stuff to Wikidata because of contractural obligations with the IOC. However, I have been in contact with the Wikipedians who do Olympic stuff and talked to the IOC about this so that we can preserve links for them on Wikipedia so they don’t lose all their references. Hopefully we’ll be able to work that out.
—January 2, 2017 at 8:31 PM

Best regards Migrant (talk) 01:25, 11 January 2017 (UTC)

New information from February 24, 2017[edit]

Question asked January 24, 2017

Thanks for your great work with olympic statistics. I am one of those who contribute to Wikipedia about mainly winter olympic results (preferably speed skating) and such related biographies, and I just wonder in what type of time frame do you expect to see these statistics published publicly elsewhere for instance at IOCs websites ? Do you think it will be available in time for the 2018 Olympic Winter Games, updated with the 2016 Olympic Summer Games results ?
—Best regards Frank Skillinghaug

Answered February 24, 2017

Frank – no timeline for when our private site, Olympedia, goes public as the IOC statistical site. We have signed our contracts with the IOC and are now in discussions with the IT people so things are moving along. Sports-reference will stay open until the IOC site becomes available.
—– Bill M

Best regards Migrant (talk) 18:41, 17 March 2017 (UTC)

2018 Winter Olympics and still no real database like the other one[edit]

It's been a more than a year since the first hint of this change of homeplace for olympic stats. Would it be an idea to contact IOC about this new database with reference to the above expressed links and answers, since it is no updates on this at the and it is in a building-up to a winterseason with only 159 days to go (per September 2., 2017) to 2018 Winter Olympics in Pyeongchang in Republic of Korea ? Best regards Migrant (talk) 14:37, 2 September 2017 (UTC)

My recommendation would be to contact Bill Mallon of He has a Wikimedia account (User:Billbambam) with an email address at enwiki. —MisterSynergy (talk) 18:29, 2 September 2017 (UTC)
Disagree there... I think it would be better to contact the right person at IOC to ask about this upcoming database with reference to what mr. Bill Mallon have said at But who would be the correct person to ask there ? Bill Mallon might have the answer for that though. Best regards Migrant (talk) 22:14, 3 September 2017 (UTC)
Well, that’s basically the problem with asking “the IOC”. It is a big organization, and I doubt that they really care about our needs—unlike Mallon somewhat does. If he can’t help us it would still be possible to think about alternatives… —MisterSynergy (talk) 04:50, 4 September 2017 (UTC)
Yeah, thats right IOC are a bit bigger than an ordinary small-club organization, but they are only people like us others. So if we can find the right person at IOC I really think we should ask about a possible timeline for the database to be released at the new site ? Is this a task you as an administrator could or would take on to ask about and share the answer here ?
BTW. Have you seen the newly, but not yet complete Olympic database at this site ? Best regards Migrant (talk) 00:19, 5 September 2017 (UTC)
If you want to approach the IOC directly, you’ll probably just try a generic email address such as info <at> olympic <dot> org and hope that they forward it to the right person. Larger organizations often do not expose information about staff positions such as database engineers to the public. IMO it is not relevant to be a Wikidata admin (which is an internal role); much more important is that you give your real name.
I have seen the other site, but I am not yet convinced whether it is useful. Many results are still missing. —MisterSynergy (talk) 05:03, 5 September 2017 (UTC)

August 2018 update: now there is a new website[edit]

According to a blog post by Bill Mallon, the new website is now online at I have already created a property proposal for that website, cf. Wikidata:Property proposal/OlympicChannel athlete ID.

Any idea how to efficiently migrate identifiers? Is anyone in contact with Mallon regarding a mapping of old and new identifiers? —MisterSynergy (talk) 18:36, 31 August 2018 (UTC)

Broken links[edit]

Quote from "Site Closing We are sorry to inform you that due to a change to our data licensing agreement we are shutting down our Olympic site effective December 1st, 2016. The providers of our dataset are working with another publisher to create an extensive site chronicling the history of the Olympic Movement. We will provide information here when that site is available. "

I set the formatter url to deprecated. Multichill (talk) 21:28, 1 December 2016 (UTC)

  • At least some of these do not seem to be broken as of right now (was the website restored?) In any case we can change the formatter URL to use - prefix with "*/" and it should be useable (the ones I tried had a handful of copies). ArthurPSmith (talk) 15:44, 2 December 2016 (UTC)
    • I just reverted Multichill's deprecatet URL-setting to written off back to normal. It was correct yesterday, they were broken, but as you say they look fine today. User Multichill, Stryn and Sjoerddebruin can verify this. See also this blogpost about it... , it certainly looks like they are about to be moved to another site which looks to me as they re being a little bit more official data. Best regards Migrant (talk) 19:59, 2 December 2016 (UTC)


SR has upgraded with Rio sportspeople - yeah! But they have screwed up encoding... Topic:U1cn8jd9grsmkl5h. Maybe somebody wants to talk with Bill? --Edgars2007 (talk) 18:44, 14 November 2017 (UTC)

Yes, I already added a lot of fresh profiles mid-October. Mind that there are also plenty of updates, additions and corrections of facts (such as person data) within the old profiles, so a re-comparison with Wikidata-values might be valuable.
Regarding the encoding: I am not sure whether Bill Mallon can actually fix this problem. To my knowledge, he (+team) delivers data to sports-reference and this company then displays it on their website. The current output indicates that the backend is indeed UTF8 (or similar) encoded, which is good, but the output is for some reason mis-interpreted. I have developed a workaround for manual editing by c&p the odd input displayed on SR to a service such as with settings Expert: source encoding: “UTF-8” displayed as: “ISO-8859-1” postfilter: “”. —MisterSynergy (talk) 19:14, 14 November 2017 (UTC)
Thanks, MS. Very valuable link. --Edgars2007 (talk) 13:52, 16 November 2017 (UTC)

This problem still persists on SR. In case anyone is interested (@Edgars2007?), there is also a handy Python module for exactly this mojibake (Q152869) problem: ftfy (“fixes text for you”). Once installed, it is very simple to use and works in many (but not all) cases. Some examples:

import ftfy

print(ftfy.fix_encoding('René Dybkær')) # from Q7313700 ---
> René Dybkær

print(ftfy.fix_encoding('Володимир Володимирович Кличко')) # from Q18797 ---
> Володимир Володимирович Кличко

print(ftfy.fix_encoding('馬 琳')) # from Q317851 ---

print(ftfy.fix_encoding('سفيان العبيدي')) # from Q7553713 ---
> سفيان العبيدي

print(ftfy.fix_encoding('ฉัตรชัย บุตรดีŠ')) # from Q2029835 ---
> ฉัตรชัย บุตรดีŠ # Thai language fails completely!

print(ftfy.fix_encoding('אסתר רוט שחמורוב')) # from Q434481 ---
> אסתר רוט שחמורוב

print(ftfy.fix_encoding('Milan Janša')) # from Q3313990 ---
> Milan JanÅ¡a # fails! Some few (East-European) characters fail as well, like Czech šŠ or Romanian îÎ, but most work

As you can see, this works with most scripts and letters, but it is advisable to glance over the outcome before further processing. —MisterSynergy (talk) 08:26, 22 September 2018 (UTC)

Wow! Really nice. When I did this, I simply did manual work on finding "screwed up" letters and finding what they really should be like (that was very interesting morning, as I remember). --Edgars2007 (talk) 08:29, 22 September 2018 (UTC)

Single value violations[edit]

@Pichpich: that's not how we do things here. Both profiles are about the same person, so that isn't wrong claim. And in this way, we can keep track of those profiles (we have list of exceptions to single value violation, as you can see at the beginning of this page or property page itself). And also keeping the value tells, that there is no need to find match for this ID. For others: this is about Hamilton de Oliveira (Q5645270) and Antoon Uytterhoeven (Q20747957) (see history). --Edgars2007 (talk) 10:37, 8 February 2018 (UTC)

Single value violation management[edit]

To manage the plenty single value constraint violations, I have switched this property to the separator (P4155) method. The situation was no longer been handled properly by the software. This means that from now on there is not a single value constraint violation of there are different named as (P1810) qualifiers added to the multiple Sports-Reference identifiers in an item. I am currently adding those qualifiers for all existing single value constraint violations. —MisterSynergy (talk) 23:00, 1 September 2018 (UTC)

Outdated profiles management[edit]

As some of you might now, SR's update process is a bit complicated. The OlyMADmen group around Bill Mallon does the research, and occasionally they push updates to, which apparently is nothing but a web hosting company for them. During an upgrade, old versions of the profiles are simply overwritten with new versions. Unfortunately, the identifiers are not really 100% stable, thus the following things can happen when an identifier itself was updated:

  • Profile at old identifier remains on server (some of their database updates), with or without stylesheets, and often containing incomplete or outdated information
  • Alternatively: profile at old identifier does not remain on server (other database updates), leaving a dead link
  • Identifiers are rarely re-used for other athletes

In the past weeks I updated almost 1000 SR identifiers to the most recent form, for cases where the identifier used here in Wikidata was outdated (which means: the identifier is no longer listed on the SR athletes index). Situation right now is:

  • There are only two items with more than one listed profile (Victoria Wright (Q518305) and Nimrod Shapira Bar-Or (Q2898896)); these are true duplicates in the SR database, so if someone has contact to them, please report this.
  • There are two SR profiles which are apparently broken (ko/nozomi-komuro-1 and nd/aminata-ndong-1). Both are not listed on the index, but this seems to be an error (can also be reported to SR)
  • All ~118k other items using this property have one bestrank identifier which is listed at the SR athletes index, i.e. up-to-date versions of the profiles.
  • In case of multiple identifiers (of which only one is currently valid), I have used ranks to prefer the current value. If there is more than one identifier in an item, all SR identifier claims have named as (P1810) separator qualifiers.
  • In some cases, profiles formerly published at SR are no longer listed in the SR athletes index. This typically happens if they find out that someone withdrew due to illness before the competition started, or if someone was a team member in a team sport, but did not have active participation during the entire Olympic tournament. SR does not cover these participants any longer. I have used preferred no value Help claims to indicate that the former identifiers with normal rank are outdated (query to find these cases).

MisterSynergy (talk) 20:18, 1 October 2018 (UTC)

  • Pictogram voting comment.svg Comment seems like exemplary approach even for people not interested in the topic. @Magnus Manske: what do you think? --- Jura 20:50, 1 October 2018 (UTC)