Wikidata:WikiProject Scholia/February 2022 hackathon
Event details | |
---|---|
Date: | Monday and Wednesday, 14 and 16 February 2022 |
Time | 1300 CET - 1800 CET |
Where: | https://virginia.zoom.us/my/wikilgbt |
. |
The February 2022 Scholia Hackathon is an event for Scholia developers to meet, address pending issues and pull requests, and do strategic planning for future tool development
Agenda[edit]
Day 1 (Monday 14)[edit]
- Attempt to close open pull requests
- https://github.com/WDscholia/scholia/pulls
- Many of these involve merge conflicts that require detailed attention.
- You can help by reviewing these pull requests or perhaps updating, rebasing, splitting or otherwise reorganizing them.
- https://github.com/WDscholia/scholia/pulls
- Documentation
- on-wiki: Wikidata:Scholia and Wikidata:WikiProject Scholia
- on GitHub: README.rst, CONTRIBUTING.rst
- on-site (e.g. FAQ, or the text on individual aspect pages)
Day 2 (Wednesday 16)[edit]
- Continue with Day 1 activities as needed
- Any open issues
- Discuss community matters
- Plan for future development
- Use cases
- Software, e.g. Jupyter notebooks
- Events
- Invasion biology
- Hypotheses
- Use cases
About the project[edit]
Scholia is a tool for browsing research records based on Linked Open Data available through Wikidata. Some of its functionality is comparable to other bibliographic databases, but in contrast to them, Scholia is based on open-source visualizations of community-curated open data. Scholia supports these curation efforts in various ways and benefits from them.
Scholia development takes place in Python (Flask) and JavaScript, and the tool itself presents data visualizations of SPARQL queries. This is reflected in GitHub labels for Python, JavaScript and SPARQL.
Links[edit]
- https://scholia.toolforge.org/
- GitHub repository https://github.com/WDscholia/scholia
- ticket for this hackathon: https://github.com/WDscholia/scholia/issues/1807
- Zoom room https://virginia.zoom.us/my/wikilgbt
- Wikimedia Etherpad https://etherpad.wikimedia.org/p/scholia-hackathon-february-2022
Outcomes[edit]
Carlin[edit]
Carlin's previous contributions included Scholia design and some fixes to Scholia aspects. He had some pending pull requests which could not be implemented due to being in conflict with other updates. Carlin aligned those previous changes into the current codebase, resulting in 3 PR being closed. One of those changes enable Scholia to present data from Citation Typing Ontology (Q44955364), another added a button link to Scholia's curation pages which give easier access to the editing interface for adding or correcting data, and the last was a contribution to a big pull request related to a global variable which touches every SPARQL query.
Wolfgang Fahl[edit]
Wolfgang is integrating research conference data into Wikidata. He has a pilot dataset of 50k conference proceedings, and has access to about 700k more records for the future if and when Wikidata is able to meaningfully accept them.
A challenge to address is the difference in modeling conference proceedings versus conference data, as many records conflate these two. "Conference data" often is the record of who presents at conferences and the titles of their talks, and "conference proceedings" are often the text publication of the research papers presented at a conference. Often the events and the publications have the same names and it may not be useful to publish both identical datasets; however sometimes they are different and worth distinguishing. Open Research Community (Q109908486) for example does not separate the proceedings from the event in their data.
There are Wikidata precedents for managing conference data:
- Simon Cobb in Spring 2021 uploaded 7000 records of conferences to proceedings
- Finn shared that there can be one conference with several proceedings https://scholia.toolforge.org/event-series/Q17012957
- There is Wikidata:WikiProject Events, with a related discussion
Wolfgang is interested in knowledge graphs around conferences. Projects include
- Wikidata:Requests for permissions/Bot/ConferenceCorpusBot
- Wikidata:Property proposal/DBLP event ID
- The curation workflow is described and visualized
- example conference attendee profile, matching a person to their conferences more examples