Wikidata:WikiProject LD4 Wikidata Affinity Group/Affinity Group Calls/Meeting Notes/2021-11-16
Jump to navigation
Jump to search
Call details[edit]
- Date: 2021-11-16
- Topic: Adding journal articles to Wikidata
- Presenter: Éder Porto
- Link to original agenda with link to recording: https://docs.google.com/document/d/1LjxZ01AUOIJItcwgXTWAIFblel9js_XjwG4aWArD-Zk/edit
Presentation material[edit]
None
Notes[edit]
- Éder is a mathematician, software developer & graphic designer
- Presentation today based on chapter published in Wikipedia and Academic Libraries: A Global Project [[Q107609044]]: https://en.wikisource.org/wiki/Wikipedia_and_Academic_Libraries:_A_Global_Project/Chapter_17
- Introduction:
- Lay out context of what challenge they are trying to overcome (putting journal articles on Wikidata)
- Discrepancy in the amount of citations between Global North (investment in research and academic communities) versus Global South
- In Brazil, only 77% of museums & 75% of libraries have access to Internet; while 98% of US libraries have Internet access.
- In Brazil, most of GLAM institutions have 1-2 employees and small budget.
- Investment in cultural institutions by Brazilian federal government has been decreasing since 2019 due to a new administration. Annual budget laws: https://w.wiki/4Q45
- Objective of project: Motivate Global South and underrepresented communities in the large-scale deposit of scholarly articles into Wikidata
- Challenges:
- Global North is well served with metadata structure while Global South operates ad hoc.
- Article references in Wikipedia can be difficult as they are language dependent. (CiteQ template trying to address this: https://en.wikipedia.org/wiki/Template:Cite_Q)
- Sources on Wikimedia projects
- Showed two charts showing number of scholarly articles on Wikidata at two different periods of time
- Context: Chose to work with historical journal in Brazil: Anais do Museu Paulista ( published since 1922)
- Importing Scholarly Articles into Wikidata via Zotero
- 8 steps for importing process
- Download Zotero
- Import a set of articles into Zotero
- Create an account on Wikimedia Projects
- Download, install, and setup QuickStatements translator
- Check for duplicates in Wikidata
- Upload to Wikidata via QuickStatements
- Check Completeness of Item Properties using Wikidata Query Service
- Add the “Journal of Publication” statement using PetScan
- Authors & references not added by Zotero translator
- Data Visualization
- Scholia profile for the Anais do Museu Paulista: https://scholia.toolforge.org/venue/Q50426299
- Once this work is done, able to see more information in Scholia profile, such as the co-author graph, number of pages, etc.
- This is a visible benefit for importing articles into Wikidata that isn’t available anywhere else
One unfoldings: GLAM CEPID NeuroMat [[Q101001059]]: GLAM partnership of the Unviersity of São Paulo: https://www.wikidata.org/wiki/Q101001059
Questions[edit]
- Did you attempt to put a lot of articles into Wikidata manually before embarking on project?
- A: Began process as it was experimental; developed methodology as they progressed and this process was born.
- How about the authors’ information? Did you add authors and affiliations to Wikidata?
- A: In terms of authors’ citation and place of publication, this is a match problem that Zotero doesn’t have the means to solve, so first created articles and then worked with author and citation values afterwards.
- What was the size of the data set?
- A: Was 552 articles from the journal. He uploads recently published articles at the end of the year.
- Do you know if many of the articles that you are interested in have DOIs?
- All of the items have DOIs. Not only in Gobal South but also North, it’s a problem that sometimes articles don’t have identifiers. Unfortunately, Zotero only searches for 4 identifiers: DOIs, arXiv, PMIDs, ISBNs. If they didn’t have identifiers, have to add them one-by-one.
- Is it possible to download the metadata from SciELO in bulk for this or other journals?
- Articles are stored in SciELO and sounds like it would be a good option.
- Éder demonstrated a live upload