Wikidata:WikidataCon 2017/Submissions/Data import: An overview of the current system, and idea exchange for the future direction

From Wikidata
Jump to: navigation, search

Pictogram voting info.svg This is an Open submission for WikidataCon 2017 that has not yet been reviewed by the members of the Program Committee.

Submission no. 14
Title of the submission
Data import: An overview of the current system, and idea exchange for the future direction.

Author(s) of the submission
E-mail address
  • navino at histropedia.com
  • mrjohncummings at gmail.com
Country of origin
  • United Kingdom
  • France
Affiliation, if any (organisation, company etc.)

UNESCO


Type of session
Workshop
Length of session
1 hour
Ideal number of attendees
15 - 30
EtherPad for documentation
https://etherpad.wikimedia.org/p/WikidataCon-14

Abstract

This session is a workshop intended to bring together Wikidata wizards and enthusiasts who can help resolve some of the major pain points in the data import process. For example:

  • It’s very difficult to keep Wikidata ‘in sync’ with an external data source (especially when they have no unique ID system!).
  • There’s no way to easily report metrics related to a particular data import.
  • It’s hard to find which data has been changed since a previous import.
  • Lots of data processing is needed by people with advanced spreadsheet and/or coding skills.

It will be framed around a presentation section, where we will share our experience of importing data from UNESCO, and show the data import documentation that has been created as a result.

Tool development will obviously play a huge part in solving the problems with data import. Central to our discussion will be where existing tools fit into the process (e.g. QuickStatements, Mix’n’Match, the Wikidata Query Service), and how things may need to change when GLAMPipe is ready to become the hub for data imports.

The hope is that we can come away from this session with a plan that will help shape future decisions about the data import process, documentation and related tools.

What will attendees take away from this session?
  1. Know how to use the current data import process
  2. Understand the challenges and limitations of the current approach
  3. Be inspired to help develop documentation and/or tools further
Slides or further information
Special requests

Interested attendees[edit]

If you are interested in attending this session, please sign with your username below. This will help reviewers to decide which sessions are of high interest.

  1. seav (talk) 23:09, 31 July 2017 (UTC)
  2. Carlojoseph14 (talk) 15:51, 8 August 2017 (UTC)
  3. --Sky xe (talk) 13:29, 23 October 2017 (UTC)
  4. a_ka_es