Wikidata:Property proposal/Biographical Dictionary of the Czech Lands
Biographical Dictionary of the Czech Lands
[edit]Originally proposed at Wikidata:Property proposal/Person
Description | MISSING |
---|---|
Represents | Biographical Dictionary of the Czech Lands (Q29538944) |
Data type | External identifier |
Domain | human (Q5) |
Example 1 | Othenio Abel (Q78571) → ABEL Othenio 20.6.1875-4.7.1946 |
Example 2 | Jan Otto (Q671725) → OTTO_Jan_8.11.1841-29.5.1916 |
Example 3 | Anna Adamcová (Q98380140) → ADAMCOVÁ_Anna_27.1.1874-2.6.1950 |
Example 4 | Anne of Foix-Candale (Q231126) → ANNA z Foix ?1484-26.7.1506 |
Source | http://biography.hiu.cas.cz/ |
Mix'n'match | 3509 |
Number of IDs in source | 51887 |
Formatter URL | http://biography.hiu.cas.cz/Personal/index.php/$1 |
Motivation
[edit]Source for bibliography about persons, for some of them with short CV JAn Dudík (talk) 21:13, 27 December 2020 (UTC)
Notified participants of WikiProject Czech Republic JAn Dudík (talk) 21:14, 27 December 2020 (UTC)
Discussion
[edit]- Support Jedudědek (talk) 21:18, 27 December 2020 (UTC)
- Support --Sapfan (talk) 21:25, 27 December 2020 (UTC)
- Support --Jklamo (talk) 21:31, 27 December 2020 (UTC)
- Support but – Biographical dictionary is at mediawiki – so, links arent solid. Maybe curid will be better. --Frettie (talk) 23:49, 27 December 2020 (UTC)
- in principle support, but I agree with Frettie. @JAn Dudík: Could you live with http://biography.hiu.cas.cz/Personal/index.php?curid=$1 as the formatter and \d* as the regex. A replacement for MnM 3509 has to be found after the creation of the property, but that might be a relatively small task (which can be done with a bot or even OpenRefine) compared to the advantage of true permalinks --Emu (talk) 00:03, 28 December 2020 (UTC)
- Current format is easy to add by hand. curid is not easy for many users and not intuitive. JAn Dudík (talk) 11:27, 28 December 2020 (UTC)
- In that case I have to
Opposesee below it. It is best practice to use IDs for Mediawiki projects. Deviating from this will just increase work in the future. (Is there really that much manual editing going on? I would suspect that almost all edits would be done via OpenRefine, MnM or a bot.) --Emu (talk) 14:10, 28 December 2020 (UTC) - Just click on "Nástroje" and "Informace o stránce", it's not that difficult. But I am not sure if MnM catalogue replacement will be that simple. It would not be nice to lose 40000+ matched items.--Jklamo (talk) 14:57, 28 December 2020 (UTC)
- That’s relatively easy:
- Download MnM data, import them into OpenRefine, match them to ID
- Create Property, match via OpenRefine
- Create new MnM catalogue, import entries from Wikidata
- Probably half a day’s work, tops --Emu (talk) 15:25, 28 December 2020 (UTC)
- OK, If anybody have half a day free time an understand what to do... Not me. I have only practical issues with ID. If there are thousands of already connected items in M'n'm it seems impractical to create new catalog from scratch. JAn Dudík (talk) 19:25, 28 December 2020 (UTC)
- That’s relatively easy:
- In that case I have to
- Current format is easy to add by hand. curid is not easy for many users and not intuitive. JAn Dudík (talk) 11:27, 28 December 2020 (UTC)
Second proposal using Mediawiki page id
[edit]I created a new MnM catalogue 4190 with all entries of the old one save for 111 pages that were moved without redirect. That actually shows the problem: The page titles aren’t stable on this wiki. Take ERBANOVÁ - DOSKOČILOVÁ Zora 28.08.1924 which was moved without redirect. (Even though I tried to follow the manual, a few hundred items matched by users aren’t automatched in the new MnM catalogue. I stored them in Wikidata:Property proposal/Biographical Dictionary of the Czech Lands/matched, they can easily be added via QuickStatements once the property is created). So I’d like to propose a second version of this property:
Biographical Dictionary of the Czech Lands
[edit]Originally proposed at Wikidata:Property proposal/Person
Represents | Biographical Dictionary of the Czech Lands (Q29538944) |
---|---|
Data type | External identifier |
Domain | human (Q5) |
Example 1 | Othenio Abel (Q78571) → 38916 |
Example 2 | Jan Otto (Q671725) → 56440 |
Example 3 | Anna Adamcová (Q98380140) → 38752 |
Example 4 | Anne of Foix-Candale (Q231126) → 39400 |
Source | http://biography.hiu.cas.cz/ |
Mix'n'match | 4190 |
Number of IDs in source | 51887 |
Formatter URL | http://biography.hiu.cas.cz/Personal/index.php?curid=$1 |
Single-value constraint | yes |
Distinct-values constraint | yes |
@JAn Dudík, Jedudedek, Sapfan, Jklamo, Frettie: Could you live with that?
- @JAn Dudík, Jedudedek, Sapfan, Jklamo, Frettie: second try at ping --Emu (talk) 01:23, 13 February 2021 (UTC)
- @JAn Dudík, Jedudedek, Sapfan, Jklamo, Frettie, Emu: Done as Biographical Dictionary of the Czech Lands ID (P9160). UWashPrincipalCataloger (talk) 23:38, 16 February 2021 (UTC)
Second Talk
[edit]- Support --Frettie (talk) 08:58, 13 February 2021 (UTC)
- Support --Sapfan (talk) 14:37, 13 February 2021 (UTC)
- Support --Jklamo (talk) 19:27, 13 February 2021 (UTC)
@Emu: I see that @Thierry Caro: has been doing some imports of this new property. Is one of you planning to do the conversion from old Mix'n'Match catalogue too is that task up for taking? Vojtěch Dostál (talk) 18:18, 17 February 2021 (UTC)
- @Vojtěch Dostál: I’m not sure what you mean – maybe you missed what I wrote just after Second proposal using Mediawiki page id (it’s easy to miss …) ? In short: I created a new MnM catalogue (4190) to replace the old one (3509). The new catalogue contains all old entries (just with a numerical ID as identifier) save for 111 (for the reason see above). Creating a good scraper for large Mediawikis without semantic extensions is really hard, so I didn’t do it yet. --Emu (talk) 18:43, 17 February 2021 (UTC)
- @Emu: I've read that but I unfortunately do not see there the information I am looking for. If I understand this right, we should convert all those thousands of matched items from the 1st MnM catalogue and find out their respective curids to be able to import them to Wikidata. That's what I was asking about - is someone working on that conversion? Vojtěch Dostál (talk) 18:48, 17 February 2021 (UTC)
- @Vojtěch Dostál: This work has already been done by me! MnM 4190 uses the IDs/curids (I found a way to do it much easier as I proposed on 28 December 2020). Thierry Caro has already started to sync the new MnM; I’m not sure why he stoped, as there are a another 16841 cases to be synced, see here. --Emu (talk) 19:09, 17 February 2021 (UTC)
- OK, great! Thanks for the clarification, appreciated :) Vojtěch Dostál (talk) 19:15, 17 February 2021 (UTC)
- @Vojtěch Dostál: This work has already been done by me! MnM 4190 uses the IDs/curids (I found a way to do it much easier as I proposed on 28 December 2020). Thierry Caro has already started to sync the new MnM; I’m not sure why he stoped, as there are a another 16841 cases to be synced, see here. --Emu (talk) 19:09, 17 February 2021 (UTC)
- @Emu: I've read that but I unfortunately do not see there the information I am looking for. If I understand this right, we should convert all those thousands of matched items from the 1st MnM catalogue and find out their respective curids to be able to import them to Wikidata. That's what I was asking about - is someone working on that conversion? Vojtěch Dostál (talk) 18:48, 17 February 2021 (UTC)