Jump to content

User:JAn Dudík

From Wikidata
Babel user information
cs-N Tento uživatel je rodilý mluvčí češtiny.
en-3 This user has advanced knowledge of English.
sk-2 Tento užívateľ má stredné znalosti slovenčiny.
de-1 Dieser Benutzer beherrscht Deutsch auf grundlegendem Niveau.
mul-0 This user has no knowledge of multiple languages (or understands it with considerable difficulty).
Users by language

JAn (also known as JAnD) from cs.wikis, owner of JAnDbot

I am working mainly:

  • Connecting czech projects with Wikidata
  • Small monuments and buildings
  • items with coordinates in Czech Republic (Q213)
  • Solving contraints
SPARQL
Scripts
OpenRefine reconciliation by coordinates (Thanks to Vojtěch Dostál)

First install csv-reconcile

$ python -m venv serverenv<br>
$ source serverenv/bin/activate<br>
$ python -m pip install csv-reconcile<br>
$ python -m pip install csv-reconcile-geo<br>
$ csv-reconcile --init-db query.tsv item coord --scorer geo 

In working directry create file config.txt containing:

MANIFEST = {
    "identifierSpace": "http://www.wikidata.org/entity/",
    "schemaSpace": "http://www.wikidata.org/prop/direct/",
    "view": {"url":"https://www.wikidata.org/wiki/{{id}}"},
    "name": "GEO",
}
THRESHOLD=95.0
LIMIT=10
SCOREOPTIONS={
  "SCALE": 150
}
import logging
LOGLEVEL=logging.DEBUG
  1. Download query of objects with unique coords
    1. rename to kaplicky.csv
    2. replace first column - only QIDs
    3. delete all lines with unknown value - contains well-known
    4. Lines must be now in format Q116166567 Point(17.7636 49.9156)
  2. Run in commandline
    1. csv-reconcile init --scorer geo --config config.txt kaplicky.tsv item coords
    2. csv-reconcile serve
  3. In OpenRefine I must have column containing coords in format Point(17.9703308 49.8405722)
    1. Run reconciling for this columm, select GEO
    2. Depending on precision check lines where is score 96+ and use them as identifier.

Namespaces