User:JAn Dudík
Appearance
Babel user information | ||||||||
---|---|---|---|---|---|---|---|---|
| ||||||||
Users by language |
JAn (also known as JAnD) from cs.wikis, owner of JAnDbot
I am working mainly:
- Connecting czech projects with Wikidata
- Small monuments and buildings
- items with coordinates in Czech Republic (Q213)
- Solving contraints
- SPARQL
- Items in Czechia with coordinates without instance of (P31)
- Objects in certain district
- Categories with same single P971 and P301
- Items in Czechia without P17:Q213
- Items located in Czechia without located in the administrative territorial entity (P131)
- [1] Stations of cross
- Chapels with images
- Objects with coordinates, but without both country (P17) and located in the administrative territorial entity (P131) - must be running for each 1° (12-19) separately
- Scripts
- WUS.js - own modification of Wikidata USeful for small monuments and czech region.
- nearby.js - tool for displaying three most closest objects for coordinate location (P625)
- OpenRefine reconciliation by coordinates (Thanks to Vojtěch Dostál)
First install csv-reconcile
$ python -m venv serverenv<br> $ source serverenv/bin/activate<br> $ python -m pip install csv-reconcile<br> $ python -m pip install csv-reconcile-geo<br> $ csv-reconcile --init-db query.tsv item coord --scorer geo
In working directry create file config.txt
containing:
MANIFEST = { "identifierSpace": "http://www.wikidata.org/entity/", "schemaSpace": "http://www.wikidata.org/prop/direct/", "view": {"url":"https://www.wikidata.org/wiki/{{id}}"}, "name": "GEO", } THRESHOLD=95.0 LIMIT=10 SCOREOPTIONS={ "SCALE": 150 } import logging LOGLEVEL=logging.DEBUG
- Download query of objects with unique coords
- rename to kaplicky.csv
- replace first column - only QIDs
- delete all lines with unknown value - contains
well-known
- Lines must be now in format
Q116166567 Point(17.7636 49.9156)
- Run in commandline
csv-reconcile init --scorer geo --config config.txt kaplicky.tsv item coords
csv-reconcile serve
- In OpenRefine I must have column containing coords in format
Point(17.9703308 49.8405722)
- Run reconciling for this columm, select GEO
- Depending on precision check lines where is score 96+ and use them as identifier.
- Namespaces
- (0) –
- (2) Uživatel: –
- (4) Wikipedie: Wikimedia project page (Q14204246)
- (6) Soubor: –
- (8) MediaWiki: –
- (10) Šablona: Wikimedia template (Q11266439)
- (12) Nápověda: ?
- (14) Kategorie: Wikimedia category (Q4167836)
- (100) Portál: Wikimedia portal (Q4663903)