User talk:Legobot/properties.js

From Wikidata
Jump to navigation Jump to search

U.S. Roads

[edit]
Took a look at it and it should be fine. --Rschen7754 20:46, 19 February 2013 (UTC)[reply]

The main type of item (P107) for roads is place (= geographical feature, Q618123). --Kolja21 (talk) 20:42, 19 February 2013 (UTC)[reply]

We've been running just these three types, but I'm planning to do a larger request with main types soon. --Rschen7754 20:46, 19 February 2013 (UTC)[reply]
Since these aren't very big requests I don't mind re-running them later on to add more properties. Legoktm (talk) 12:26, 20 February 2013 (UTC)[reply]
Looks good to me (and thanks for getting the next few started - it's been a busy few days). --Rschen7754 21:45, 26 February 2013 (UTC)[reply]
All good except the last one - a bit too much collateral to only tag one article, w:en:U.S. Route 11 in Mississippi. --Rschen7754 07:25, 28 February 2013 (UTC)[reply]

Rivers in Africa

[edit]

None currently

Gender categories from Swedish Wikipedia

[edit]
The Swedish Wikipedia is great, because they have two flat categories, one for each gender. This will be a huge job – 28285 woman articles, 117281 man articles, but it is all safe.

These categories have been ✓ Checked, but are  Waiting for m:Steward_requests/Global_permissions#Global_API_High_Limit_Requestor_for_Legobot to be resolved. Also a few optimizations in the code. Legoktm (talk) 14:12, 17 February 2013 (UTC)[reply]

Global permission received. One more code optimization to make and we'll be ready to go. Legoktm (talk) 07:16, 19 February 2013 (UTC)[reply]
Can you please also add the main type of item (Property:P107): person (Q215627)? Thanks --Kolja21 (talk) 15:13, 19 February 2013 (UTC)[reply]
That is already being done! I modified the template so if you look at the actual code, you will see |pid2=P107|qid2=Q215627, but the template doesn't display it yet. Legoktm (talk) 15:14, 19 February 2013 (UTC)[reply]
Note that as of a while ago, the first category is running. Legoktm (talk) 15:18, 19 February 2013 (UTC)[reply]
Wow, I'm impressed! --Kolja21 (talk) 15:21, 19 February 2013 (UTC)[reply]
Wheee. :D Jon Harald Søby (talk) 22:28, 19 February 2013 (UTC)[reply]
First is ✓ Done, second is  In progress. Legoktm (talk) 12:18, 28 February 2013 (UTC)[reply]

✓ Done! Managed to create Q6000000 in the process :) Legoktm (talk) 08:06, 7 March 2013 (UTC)[reply]

Gender categories from German Wikipedia

[edit]
Same as the one above. The categories have the same criteria for inclusion, namely that the contents are only real people, so here are ~470000 more articles. :-D
Same status as above. Legoktm (talk) 07:16, 19 February 2013 (UTC)[reply]
✓ Done first, second is queued. Legoktm (talk) 09:12, 7 March 2013 (UTC)[reply]
Second is  In progress, using a much much more optimized script. Legoktm (talk) 08:34, 23 March 2013 (UTC)[reply]

Further investigation

[edit]

Categories that will be useful, may just need formatting or investigation whether they are accurate enough.

Italian person data

[edit]

Italian Wikipedia person data

[edit]
General discussion: Wikidata:Project chat#Italian person data (now archived -> consensus found); original location User_talk:Legobot/properties.js#Italian_person_data
  • Category: w:it:Categoria:BioBot (200+ thousands people, cf. [1])
  • Properties: w:it:Template:Bio#Tabella_completa, to be fetched from template usage (not everything translated to categories)
    • Examples: name and gender (mandatory), surname, place/date/year of birth/death, one out of 552 defined jobs
    • More details will follow, are you also interested in a mapping for the jobs? There's a lot of them and I doubt any other wiki has them in a structured format. --Nemo 19:27, 17 February 2013 (UTC)[reply]
    • I will work on implementing template parsing hopefully by this weekend.
    • Sesso: P21 -> 6581097 if M, 6581072 if F (note that this is used only for grammatical purposes so "intersex" is not used; in non-trivial cases, it may reflect the policy here on Wikidata or not)
      ✓ Done
    • LuogoNascita (but LuogoNascitaLink should prevail if available): P19 -> the entry for the item corresponding to the page with that title
      ✓ Done
    • LuogoNascitaAlt: same as above, for complex cases with alternatives; maybe a secondary statement for P19? no other property is available
    • NoteNascita: pull sources for the Nascita statements from the ref tags in here.
    • LuogoMorte, LuogoMorteLink and LuogoMorteAlt, NoteMorte: same as above but for P20
      ✓ Done
    • Nazionalità: P27 -> linked country
    • NazionalitàNaturalizzato: additional statement to P27
      ✓ Done for countries instances of a subclass of state (Q7275) except a few, see list of articles not imported yet and breakdown by their value.
       Info See the map from adjectives to countries. The local information is based on current sources. Except 4 entities to sync, all the values used are compatible with this property. See further discussion.
    • PostNazionalità: this field may contain sources for any of the previous statements (more general ones could also be right after the end of the template or in FineIncipit).
    • FineIncipit: replaces standard occupation etc., maybe add to item description?
    • Immagine: P18 -> image with this name (check if it's on Commons; over 35k usages)
    • For each statement: add as reference the Property:P143 with value Q11920, example cat (update: as discussed at project chat).
    • First name (Nome): P735
      ✓ Done where it equals an it.wiki article and hence entity.
    • Last name (Cognome): P734
      ✓ Done (same);  Info below on disambiguation pages, transliteration
    • Day and month of birth (GiornoMeseNascita) + Year of birth (AnnoNascita): P569
    • Day and month of death (GiornoMeseMorte) + Year of death (AnnoMorte): P570
      Do not add a date in contrast with a Integrated Authority File (Q36578) statement if available.
      ✓ Done in part by Dexbot, dates after 1920.  Info ViscoBot had started but stopped long ago.
       Question I also wrote the code to import dates of birth and death but I'm not running it yet because there is one important question: What is the colander model you use as date of birth and death? in some places Gregorian wasn't common until 1912 so I can't add these dates before 1912 because the bot can't be sure about calender model of these dates  – The preceding unsigned comment was added by Ladsgroup (talk • contribs).
      We're verifying, I'll let you know the final outcome. Past discussions seem to have all agreed on forcing gregorian calendar in the template, with the option to indicate giulian calendar next to it with a warning. --Nemo 13:27, 27 April 2014 (UTC)[reply]
    • Title to be used before name, or after it in some languages other than Italian (Titolo): P511 (about 3k usages)
    • Missing properties:
      • Unrecognized citizenship (peoples without state), e.g. Kurds (Cittadinanza)
      • Free text notes on dates of birth/death (NoteNascita, NoteMorte): some sources could be extracted from here. Example content is very varied but in 55 % of cases contains an URL, that could be imported as source.
    This should be it. --Nemo 08:49, 22 February 2013 (UTC)[reply]
A proposal on sourcing for Wikidata was moved to Wikidata:Project chat#Proposal: preventive control of imported data correctness
As far as edit summaries go, the bot actually does send proper edit summaries, in the format of Bot: Setting [[Property:{pid}|{pid}]] to [[{target_qid}]]; using [[:{lang}:{source}]]; requested by [[User talk:{user}|{user}]], it's just that the software doesn't support them yet. It may be worth putting this run on hold until the software does support custom summaries.
I do believe that at this point, we may need to look how to properly source these claims, since they are no longer "obvious". Maybe that should be a discussion on Project chat? I believe there are legitimate concerns before this request can go forward, as well as code that I need to work on. Legoktm (talk) 01:23, 23 February 2013 (UTC)[reply]
If edit summaries are a problem, we could just use a different username for the bot, like "Italian Wikipedia person data import bot".
What fields are no longer obvious, specifically? Surely place of birth is more "obvious" and less controversial than gender, for instance. I think it makes sense to start only with the "obvious" ones: it seems to me that most worries are about nationalist controversies, so probably those are the only fields to exclude in the first run? Otherwise, sources exist of course, you could pull them at the same time if people feel it can't be done later. --Nemo 08:50, 23 February 2013 (UTC)[reply]
Ping. I have updated the data above, it seems to me that we no longer have anything to wait for? Were the easy parts like gender done already? --Nemo 08:58, 23 August 2013 (UTC)[reply]

Constellations

[edit]

None at the moment

Astronomic object

[edit]

See here ;) --Paperoastro (talk) 12:03, 23 February 2013 (UTC)[reply]

Pre-approved

[edit]

These have been approved to run, and will be sent over once Legoktm fixes Legobot Sven Manguard Wha? 17:10, 9 November 2013 (UTC)[reply]

Requests

[edit]

Please add your requests here:

 Comment: sport (P641) seems a bit odd to me, why not "occupation: hand-ball player

no such occupation exists. --Danrok (talk) 16:50, 15 October 2013 (UTC)[reply]
You mean no item ? Then let's create one, association football player (Q937857) is already in use in 1000s of items, and I do not think it should be any different here. --Zolo (talk) 20:46, 15 October 2013 (UTC)[reply]
Since when is sport always an occupation? Many modern footballers are employed by a club and paid a salary, that is why footballer is their occupation. This situation does not exist for every sport, and has not existed since the beginning of sport. There are many sports persons who are notable and compete internationally, but are not employed in sport, it is not their occupation, they have a day job. We can't simply mass-claim a sport as an occupation. --Danrok (talk) 21:04, 15 October 2013 (UTC)[reply]
I would say it is ok to use "occupation" for unpaid or part-time activities. That might be "hermit" or "slave" too. If someone is known for being a handball player or a writer, I think it makes sense to use "occupation", even if he has a day-job. If it does not, we may need a specific property to express that. I am concerned that properties like "sport" do not have very clear semantics (we don't know for sure whether it means handball player, handball coach or even handball fan), and that properties like this would cause some scattering of information. --Zolo (talk) 07:19, 16 October 2013 (UTC)[reply]
The sport property is much the same as genre. Genre doesn't tell us enough on its own, either. But, all becomes clear via other properties and qualifiers. For example, sport = tennis, occupation = coach (Q41583). --Danrok (talk) 17:22, 16 October 2013 (UTC)[reply]

Thank you for copypaste.py

[edit]

Thank you for the very useful tool copypaste.py. Lionel Allorge (talk) 13:37, 18 March 2013 (UTC)[reply]

Could you remove two requests from User:Legobot/properties.js please?

[edit]

Hello, could you remove the two folowing lines from User:Legobot/properties.js please? Somebody deleted the original items and replace them with another which makes no real sense...

I've asked for the undeletion of this items and will make a new request soon. Thanks, Ayack (talk) 09:52, 19 April 2013 (UTC)[reply]

✓ Done. Legoktm (talk) 09:53, 19 April 2013 (UTC)[reply]
Wow, it was fast. Thanks! Ayack (talk) 09:59, 19 April 2013 (UTC)[reply]
You got lucky actually :P The way it's set up right now, the only way to stop the bot from processing a job after it's been added to the page is by a) blocking it, or b) logging into the Toolserver and killing it (what I did). I think it's time to come up with an alternative solution... Legoktm (talk) 11:28, 19 April 2013 (UTC)[reply]
The original items have just been undeleted. Could you add the following lines wich have another qid2?
Thanks again and sorry for the trouble! Ayack (talk) 15:06, 19 April 2013 (UTC)[reply]
Added to User talk:Legobot/properties.js/requests. Ayack (talk) 18:28, 24 April 2013 (UTC)[reply]

Could you remove request from User:Legobot/properties.js please?

[edit]

Thanks --Maxx (talk) 20:01, 9 May 2013 (UTC)[reply]

there is no such request --Akkakk 21:12, 9 May 2013 (UTC)