Topic on User talk:TiagoLubiana/Structured Discussions Archive 1

Jump to navigation Jump to search
Mxn (talkcontribs)

Given your work on updating case count statements automatically with a bot, I was wondering if you'd be interested in helping to automate counts about U.S. counties from sources other than Johns Hopkins.

For weeks, this table of cases in California by county has been updated manually and piecemeal by various users. I'm not sure if it's been updated consistently. As a nod to automation, I've converted a couple rows to use tabular data, but the underlying tables so far need to be updated manually using a script. I have a Toolforge account and can develop a bot to update those tables, but managing all 55 counties, each with their own data table, would be a bit daunting.

For counties where Wikipedia isn't presenting a time series chart, I think it would suffice to keep a Wikidata statement up-to-date with the latest numbers. The Corona Data Scraper project should make it feasible to obtain current numbers for all the counties. Then it's a matter of feeding the numbers into Wikidata via bot. I could look into that eventually, but I figure you're further along and I'd rather not duplicate efforts.

I'm interested to hear your thoughts about where else we could streamline these mini data projects as well.

TiagoLubiana (talkcontribs)

Hey, thanks for the messageǃ I am actually a bit overloaded .

I am not familiar of how to add tabular data on commons. Most of the work on this case count bot has actually been done by @jvcavv on this GitHub pageː https://github.com/lubianat/wikidata_covid19/tree/master/sandbox/worldwide_data/src . Maybe he is in a better position to aid this project.


Moving the data from the corona scraper to Wikidata seems straight forward, but I would not know how to do it for the .tab items.


Jvcavv (talkcontribs)

Hey, sorry for taking so long to answer, I'm actually a bit overworked too.

I'm also not familiar on adding tabular data to commons, but if you intend on creating quickstatements to update the items on wikidata instead, I believe the script we're using to batch update case/death counts for countries (The one Tiago linked above) can be adapted to take in data coming from Corona DataScraper. That seems pretty straight forward and by the end you'd have a text file containing the quickstatements to do the batch update, but if you need any help feel free to hit me up.

Reply to "Case count automation"