Wikidata:WikiProject Scholia/February 2022 hackathon

From Wikidata
Jump to navigation Jump to search
February 2022 Scholia Hackathon
Event details
Date:Monday and Wednesday, 14 and 16 February 2022
Time1300 CET - 1800 CET
Where:https://virginia.zoom.us/my/wikilgbt
.

The February 2022 Scholia Hackathon is an event for Scholia developers to meet, address pending issues and pull requests, and do strategic planning for future tool development

Agenda[edit]

Day 1 (Monday 14)[edit]

Day 2 (Wednesday 16)[edit]

About the project[edit]

Scholia is a tool for browsing research records based on Linked Open Data available through Wikidata. Some of its functionality is comparable to other bibliographic databases, but in contrast to them, Scholia is based on open-source visualizations of community-curated open data. Scholia supports these curation efforts in various ways and benefits from them.

Scholia development takes place in Python (Flask) and JavaScript, and the tool itself presents data visualizations of SPARQL queries. This is reflected in GitHub labels for Python, JavaScript and SPARQL.

Links[edit]

Outcomes[edit]

Carlin[edit]

Carlin's previous contributions included Scholia design and some fixes to Scholia aspects. He had some pending pull requests which could not be implemented due to being in conflict with other updates. Carlin aligned those previous changes into the current codebase, resulting in 3 PR being closed. One of those changes enable Scholia to present data from Citation Typing Ontology (Q44955364), another added a button link to Scholia's curation pages which give easier access to the editing interface for adding or correcting data, and the last was a contribution to a big pull request related to a global variable which touches every SPARQL query.

Wolfgang Fahl[edit]

Wolfgang is integrating research conference data into Wikidata. He has a pilot dataset of 50k conference proceedings, and has access to about 700k more records for the future if and when Wikidata is able to meaningfully accept them.

A challenge to address is the difference in modeling conference proceedings versus conference data, as many records conflate these two. "Conference data" often is the record of who presents at conferences and the titles of their talks, and "conference proceedings" are often the text publication of the research papers presented at a conference. Often the events and the publications have the same names and it may not be useful to publish both identical datasets; however sometimes they are different and worth distinguishing. Open Research Community (Q109908486) for example does not separate the proceedings from the event in their data.

There are Wikidata precedents for managing conference data:

Wolfgang is interested in knowledge graphs around conferences. Projects include

See also[edit]