Hi there, I recently modified the code for the wikipedia:Template:ESPNcricinfo template on en.wikipedia to categorise any misalignment between the ID used in the template and the ID stored in wikidata, or if there was no P2697 statement at all in Wikidata. Currently there are over 600 pages listed in wikipedia:Category:Cricinfo maintenance that use the ESPNcricinfo template in en.wikipedia, but don't have a Wikidata P2697 statement. Is your bot able to do a sweep regularly though that category to add Wikidata statements? Thanks.
About this board
Welcome to Wikidata, Edgars2007!
Wikidata is a free knowledge base that you can edit! It can be read and edited by humans and machines alike and you can go to any item page now and add to this ever-growing database!
Need some help getting started? Here are some pages you can familiarize yourself with:
- Introduction – An introduction to the project.
- Wikidata tours – Interactive tutorials to show you how Wikidata works.
- Community portal – The portal for community members.
- Contents – The main help page for editing and using the site.
- Project chat – Discussions about the project.
If you have any questions, please ask me on my talk page. If you want to try out editing, you can use the sandbox to try. Once again, welcome, and I hope you quickly feel comfortable here, and become an active editor for Wikidata.
Previous discussion was archived aton 2015-12-09.
ESPNcricinfo.com player ID (P2697)
OK, will do the import (probably in the weekend), but I really do suck in doing things regularly. Pinging will be fine :)
Thanks for the script to list up the WTA players with an invalid code. I saw your bot is also working hard to fix the issue, way quicker then I can do this by hand. Do you have a script for that, and if so, in what language? I might be interested, especially when it runs on Python...
About edits by bot.
I scraped WTA ranking pages (till 19th page) and did simple name matching between SPARQL results and WTA scrape results. It got approx. 450 results.
Later edits by bot. Well, those are practically manual edits. Was using Google links from that SPARQL query I posted at WT:Tennis and put the new IDs in text editor and then simply fed up Python script, so that I don't have to edit at Wikidata myself.
Aha, nice to know! It means that I will have to help (by hand ;-) to get the job done. I will start with all the Dutch players on the list, and afterwards pick some others too.
Hello Edgars2007. What do you think of ESBL.ee : Eesti spordi biograafiline leksikon (Q12361673) as a property-identifier ? Or is it out of scope since it is a less popular language ? Best regards Migrant.
Short answer - I think it's OK for a property. According to etwiki, it has 6tk articles, so it's pretty big one.
Long answer - later :)
The answer is in this edit: https://www.wikidata.org/w/index.php?title=Q27020041&diff=379913089&oldid=379912968
Yes, Edoderoo is right. Informally speaking, Q27020041 is for leagues, Q1539532 - clubs. There was disscusion about this on Project chat (I think around creation of Q27020041)
Sports-Reference athlete index
Hey Edgars, do you happen to know whether Sports-Reference.com has an index of athletes, linked to their profiles? I’d like to crawl all rowing athletes before they shut down the site, but I can’t find a useful list of links to crawl on their site. An alternative would be to just crawl profiles linked from Wikidata, but then I’d miss all athletes without a Wikidata item (~50% of all Olympic rowers). Cheers!
The only index, that I'm avare of, is this. But it isn't complete for some reason. Other idea would be to go trough all rowing events in SR and crawl athletes from there, but that adds complexity :)
Thanks! Yes, that adds quite some complexity. Right now I have a nice crawler, but it needs input in form of URLs or identifiers. I need a good idea how to solve that, quickly... :-)
I have thought about an import of all sports-reference profiles to Wikidata anyways. There are currently ~96.000 Wikidata items with sports-reference identifieres, and they have around 120.000 profiles to my knowledge. Thus, most of the athletes are already here anyway. However, I'm not aware of legal issues with such an approach and one needs a parser that is capable of extracting all the information from Sports-Reference profiles in HTML format. It's probably too late to do that before sports-reference shuts down. And I'm afraid about the promised future database's quality, to be honest.
When I'm adding participant of (P1344) with bot, sourcing it with SR, I'm also saving some data about athlete (their "infobox") into .txt file - when will have time (ha ha ha) will upload the data to WD. So theoretically, I have a (Python) scraper.
Hello Edgars2007. I was about to create a new wikidata item for a retired speed skater and was looking for his Sports-reference.com-profile ID but it looks like that sportssite-part are about to be moved somewhere else, but not yet done. What is your suggestion to all those ID's at wikidata. See what page I've got to http://www.sports-reference.com/olympics/. Regards Migrant.
Hmm, looks like the Sports-reference.com/olympics site is up again today but yesterday it wasn't. See also this blogpost http://olympstats.com/2016/08/21/the-olymadmen-and-olympstats-and-sports-reference/. Ps. Spoken with user Multichill, sjoerddebruin and Stryn on IRC-chat about it. Best regards Migrant.
Looks like it will work till the new site will be available, so the problem is kind of solved, but my thoughts on question is: don't delete anything (from sportsperson items) and a) remove formatter URL (so links aren't clickable) or b) change the formatter URL to use archived version.
Hello, I'm not deleting anything, I'm just adding Sports-reference ID's to new Wikidata-Items which I'm creating at NO:WP. If we need to change the formatter-URL later, we'll do that, but only when that is needed and not before. But I'm also wondering about the future of Olympic.org ID (P3171), will that maybe merge with our Sports-reference ID at the address of Olympic.org. Have you tried to login at OlyMADmen's own site Olympedia mentioned at the site of Olympstats.com to see what's there, I haven't. Btw. Will you do an update on your underpage of Sports ID's. I think that page gives a good overview of how many inclusions of different Sports ID's are in use at Wikidata. Ps. I'm leaving for this afternoon, but will be back later tonight. Best regards Migrant.
Updated the Sports ID subpage count. Don't know about other things you mentioned, sorry.
About that proposal at the village pump
Hi I just proposed Meta:2016 Community Wishlist Survey/Categories/Miscellaneous#Mark thanked edits automatically patrolled for some users if you still want to support my idea of combined thanking and patrolling. Thank you for first comment in any case. Bye.
about lvwiki warning template
I've noticed there's a warning template appearing at the 'undo' pages. Would it be possible for me to apply it on trwiki? Can you refer me to instructions?
Thanks in advance,
Those aren't special notices for 'undo' action. It's a notice for editing mode. Sure, you can go and make it live at trwiki. Would linking to w:en:Wikipedia:Editnotice be enough - you seem to be pretty tech-savvy person? I didn't made any big changes for lvwiki version - just basic copy-paste, we're not using this system very often - only on elections :D and some other rare cases.