Wikidata:WikidataCon 2017/Submissions/Wikidata quality: a data consumers' perspective

From Wikidata
Jump to navigation Jump to search

Pictogram voting info.svg This is an Open submission for WikidataCon 2017 that has not yet been reviewed by the members of the Program Committee.

Submission no. 95
Title of the submission
Wikidata quality: A data consumers' perspective

Author(s) of the submission
E-mail address

alessandro.piscopo at

Country of origin

United Kingdom

Affiliation, if any (organisation, company etc.)

University of Southampton

Type of session


Length of session

1 hour

Ideal number of attendees


Slides for this session are available at
EtherPad for documentation


Data quality is an important topic for Wikidata, as the number of initiatives and projects around this topic testifies. To name just a few, the Item quality campaign relied on the work of the community to evaluate Items using a single-grading label scheme. The CoolWD project focuses on a particular dimension of quality, providing users with information about the completeness of the results of a query and allowing them to add this information to Wikidata. Furthermore, I have previously sought in a Request for Comment (Data quality framework for Wikidata) to gather opinions from the Wikidata community in order to create an appropriate data quality framework for this platform, which would be rooted in prior scientific literature and distinguish several quality dimensions.

All these projects focus either on measuring data quality under various viewpoints or on generating a conceptualisation of data quality in Wikidata. They are essential to our understanding of Wikidata, as they explore different aspects of its quality. Nevertheless, data quality is most commonly defined as "fitness for purpose". As such, it is seen from the point of view of data consumers. What can be an acceptable degree of completeness or accuracy for e.g. providing tourist information, it is not enough when it comes to using the data to provide medical advice. Therefore, for a comprehensive understanding of what data quality means in Wikidata we need to have a clear overview of how this is used as a resource.
Specifically, the aims of this session will be to:

  • identify typologies of data consumers for Wikidata;
  • gain an overview about the needs of each data consumer type and of the quality issues they experience.

This session is open to everyone interested in Wikidata. However, it would be ideal to have a mixed audience, with member of the Wikidata community and professionals using this project as a data resource, in order to facilitate the exchange of different points of view. The presence of both practitioners using Wikidata as individuals and members of organisations would be highly beneficial.
The session will be structured in three parts:

  1. Short introduction by the author of the submission about data quality-related projects concerning Wikidata (10-15 min.);
  2. Open discussion, where the attendees will be invited to report their experiences and express their ideas about the topic (35 min.);
  3. Summing up of the discussion and final remarks (10-15 min.).

What will attendees take away from this session?

The session should be intended as a chance for data contributors and data consumers to exchange opinions about their different perspectives. I am aware that these two categories often overlap though. This will be a chance to think separately about these two roles.
What attendees take away:

  1. Wikidata contributors will gain an overview of the information needs of data consumers using Wikidata as a resource.
  2. Furthermore, they will gain an overview of the quality issues that are more relevant for data consumers using Wikidata as a resource.
  3. Data consumers will get an understanding of how contributors work and how Wikidata knowledge graph is built.

Interested attendees[edit]

If you are interested in attending this session, please sign with your username below. This will help reviewers to decide which sessions are of high interest.

  1. Frimelle (talk) 14:31, 31 July 2017 (UTC)
  2. seav (talk) 15:37, 31 July 2017 (UTC)
  3. Andreasm háblame / just talk to me 21:44, 31 July 2017 (UTC)
  4. Ijon (talk) 04:21, 1 August 2017 (UTC)
  5. Maxlath (talk)
  6. Sic19 (talk) 21:05, 20 August 2017 (UTC)
  7. Salgo60 (talk) 17:01, 11 October 2017 (UTC)
  8. Atudu (talk) 04:25, 26 October 2017 (UTC)
  9. Rehman 11:15, 27 October 2017 (UTC)