Hi Pintoch! I've been doing a bit of author/title matching against VIAF, LoC, and ISNI using a nest of Perl scripts, to try to match authors from the MARC 100 field of the catalogue entry of a book to VIAFs, LCNAFs, ISNIs, and Wikidata items. (For cases where the book does not currently have a Wikidata item, as a step towards creating one).
There are various reconciliators that try to do author matching against these services, eg:
- VIAF: http://iphylo.blogspot.com/2013/04/reconciling-author-names-using-open.html
- VIAF (2): https://github.com/codeforkjeff/conciliator
- ISNI: https://github.com/cmharlow/isni-reconcile
- LoC: https://github.com/mphilli/LoC-reconcile
- LoC (2): https://github.com/cmharlow/lc-reconcile
- LoC (3) (via VIAF): https://github.com/codeforkjeff/conciliator
How big a job would it be to create an author/title reconciliator, rather than just an author conciliator ?
And also, to extend what these conciliators do, to be able to retrieve foreign IDs from these services (eg LoC IDs from VIAF), in the way that eg the Wikidata conciliator can add columns for the values of Wikidata properties based on a match?
Is there enough support in the community that this could be offered eg as a student project for a Digital Humanities student? Or would writing/adapting an OpenRefine reconciliator be rather too big an ask?
The British Library quite liked the rough samples from my Perl scripts, but they're a bit close to the metal; whereas an OpenRefine reconciliator could be something that anybody could use. What would be your instincts on this?