Wikidata talk:WikiProject Periodicals

From Wikidata
Jump to navigation Jump to search

Issues[edit]

Please see User:Research_Bot/issues --- Jura 10:04, 7 September 2019 (UTC)

Received date ?[edit]

What property is suggested for the received date (date the article was received by the journal, months before the actual publication)? I started using production date (P2754) for that, but maybe a dedicated property is needed. --- Jura 10:04, 7 September 2019 (UTC)

Keywords[edit]

I am wondering whether to propose a property called keywords. Yes, we have main subject (P921), but in many cases the string which you need as a keyword does not have an item and is even nonlogic to create an item for it. --Juandev (talk) 01:47, 9 September 2019 (UTC)

@Juandev: take a look at Wikidata:Property proposal/subject facet which did not pass. If you do propose a new “keyword” property, you should probably address the issues raised here. - PKM (talk) 22:59, 4 October 2019 (UTC)

I have seen that and I think my proposal is different as the dataset is string. What issues were rised there? --Juandev (talk) 16:59, 6 October 2019 (UTC)

Magazine[edit]

Hello,

I’m looking into sorting out a bunch of video game magazine (Q69662460) a bit, and before I start, I want to make sure the following model is good :)

Jean-Fred (talk) 16:18, 4 October 2019 (UTC)

(I ask in particular because genre (P136) and country of origin (P495) are not mentioned on Wikidata:WikiProject Periodicals Jean-Fred (talk) 16:20, 4 October 2019 (UTC))
@Jean-Frédéric:Don't use country of origin (P495), use place of publication (P291) instead. Don't use - instance of (P31) magazine (Q41298)+- genre (P136) video game magazine (Q69662460) but rather - instance of (P31) magazine (Q41298)+- instance of (P31) video game magazine (Q69662460). Also, maybe you have an ISSN (P236) for these magazines ?. Tfrancart (talk) 07:32, 18 October 2019 (UTC)
I have also seen field of work (P101) used on some magazines-items − does not feel right to me but has some usage (although way less than main subject (P921) Jean-Fred (talk) 16:37, 4 October 2019 (UTC)
And also, there is plenty of usage of start time (P580) and end time (P582) − just double-confirming that they are not to use in this case. Jean-Fred (talk) 17:23, 4 October 2019 (UTC)
SELECT ?item ?itemLabel ?field_of_work ?field_of_workLabel WHERE {
  SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
  ?item (wdt:P31/(wdt:P279*)) wd:Q1002697;
    wdt:P101 ?field_of_work.
}

Try it!

@Tfrancart: not sure to understand your propositions. Of course place of publication (P291) is better country of origin (P495) in general but here, if the value is a country, country of origin (P495) could work too, not sure here maybe both is the best heere? And for video game magazine (Q69662460),I disagree, it's definitely a value for {{P|136} and not for instance of (P31). Cheers, VIGNERON (talk) 08:29, 18 October 2019 (UTC)
@VIGNERON: country of origin (P495) is not wrong, place of publication (P291) is just more precise, and is actually a recommended property for periodicals, see the first table in this very same page. My suggestion for - instance of (P31) video game magazine (Q69662460) is based on the fact that video game magazine (Q69662460) is actually a class, subclass of magazine (Q41298), hence should be referenced through instance of (P31) too; one could see this as a matter of taste; personnally I would stick to using instance of (P31) everytime I refer to a class. Is there some commonly agreed rule for this ? Cheers Tfrancart (talk) 08:58, 18 October 2019 (UTC)
@Tfrancart: Sure, will add P291 (with the mutual-understanding that ideally it should be more precise than country level) but rather on top of P136 − it’s handy to quickly request the country without having to climb up the class tree.
Regarding video game magazine (Q69662460): I see what you mean, but I don’t think the implication is necessarily true: for example, we do have Citizen Kane (Q24815) instance of (P31) film (Q11424) and Citizen Kane (Q24815) genre (P136) drama film (Q130232), even though drama film (Q130232) is a subclass of film (Q11424).
@Jean-Frédéric: I wouldn't take the existing practices on this subject as the definite or the good practice. I am interested in this question, and I will probably escalate it to the Ontology Wikidata project. I would like to see reference to past discussions and agreement on this matter. Of course this is in no way related to your magazine project, and this should not prevent you to proceed with your upload of magazine descriptions. As a side note : if "video game magazine" was to be referenced as a "genre", it would be named "video game", not "video game magazine"; maybe both point of views can co-exist : the "video game magazines" could be defined as the "set of magazines for which genre = video game", or something like that. Tfrancart (talk) 13:37, 28 October 2019 (UTC)
@Tfrancart: Well, sure, data modeling practices are not set in stone never to be revisited :-) ; however, to the best of my knowledge, the use of genre (P136) I described is the currently agreed-upon best practice, as documented in Wikidata:WikiProject_Video_games/Properties, Wikidata:WikiProject_Movies/Properties and Wikidata:WikiProject_Music. Jean-Fred (talk) 22:16, 4 November 2019 (UTC)
@Jean-Frédéric: I am not saying that the use of genre (P136) in itself is bad. I am saying that if you use genre (P136), it should reference a genre, not a class. Take the example of genre (P136) given in Wikidata:WikiProject_Music : Night Train (Q7033524) genre (P136) blues (Q9759). blues (Q9759) is not a class; it is an instance of music genre (Q188451), and this makes all the difference. Tfrancart (talk) 09:24, 5 November 2019 (UTC)
And wait, I see that video game magazine (Q69662460) is both a class and a genre, so I think we're fine and I'm happy :-) Tfrancart (talk) 16:08, 5 November 2019 (UTC)
(Also, the outcome is at Wikidata:WikiProject Video games/Magazines − most have ISSNs).
Jean-Fred (talk) 14:36, 23 October 2019 (UTC)

Distribution format values ? (from MARC)[edit]

John Vandenberg (talk) 09:30, 2 December 2013 (UTC) Aubrey (talk) 12:15, 11 December 2013 (UTC) Daniel Mietchen (talk) 12:47, 11 December 2013 (UTC) Micru (talk) 13:09, 11 December 2013 (UTC) DarTar (talk) 01:37, 15 January 2014 (UTC) Maximilianklein (talk) 00:23, 28 March 2014 (UTC) Mvolz (talk) 08:10, 20 July 2014 (UTC) Andy Mabbett (Pigsonthewing); Talk to Andy 22:17, 27 July 2014 (UTC) Mattsenate (talk) 17:26, 14 August 2014 (UTC) author  TomT0m / talk page JakobVoss (talk) 14:25, 16 June 2016 (UTC) Mahdimoqri (talk) 08:04, 5 April 2018 (UTC) Jsamwrites Dig.log Sic19 (talk) 22:46, 12 July 2017 (UTC) Andreasmperu Nomen ad hoc Pete F (talk) 99of9 Mfchris84 (talk) 09:02, 26 November 2018 (UTC) Runner1928 (talk) 17:22, 1 December 2018 (UTC) Wittylama (talk) 09:55, 22 December 2018 (UTC) Jneubert (talk) 07:30, 22 February 2019 (UTC) --Juandev (talk) 20:28, 27 April 2019 (UTC) VIGNERON (talk) Uomovariabile (talk to me) 08:46, 24 June 2019 (UTC) SilentSpike (talk) Ecritures (talk) Tfrancart (talk) Dick Bos (talk) 10:47, 30 January 2020 (UTC) Pictogram voting comment.svg Notified participants of WikiProject Periodicals


Hello,

I am working on pushing part of the ISSN Portal (Q70460099) to Wikidata (teaser - more on this later). As you may know, an ISSN is assigned to a specific format of a periodical, and typically the paper version and the electronic version of the same periodical receive 2 different ISSNs. I plan on capturing these with a distribution format (P437) qualifier on the ISSN (P236) statement on these magazines.

My question is which values should I use for these distribution formats ? Here is the list, sorted by reverse number of usage in the ISSN register, with my mapping proposal for some values. I am not looking for a 100% perfect mapping, simplifications for rarely used values are welcomed.

Marc code Value Number of instances in ISSN register Wikidata mapping
ta (Print) 1868807 printed matter (Q1261026) (most used currently)
cr (Online) 282934 online publication (Q1714118) (most used currently)
co (Optical disk) 21255 optical disc (Q234870)
tu (resource is a text (no specific material designation)) 4764 text (Q234460) (proper equivalences with Bibframe and DCMI)
zm (resource is in multiple physical forms) 2904 ?
vd (resource on videodisc) 1801 videodisc (Q764322)
t<pipe> (Print) 1610 printed matter (Q1261026) (same value as "Paper" above)
ou (resource is a kit) 1096 kit (Q811844) (although the image is incorrect, I think this entry comes from bibliograph.net, bib extension to schema.org, so definitely from the bibliographic world)
vf (resource on videocassette) 791 videotape (Q747779)
cz (electronic resource : other) 563 electronic media (Q1209283) ?
cm (electronic resource on magnetooptical disk) 513 magneto-optical drive (Q763745)
he (resource on microfiche) 502 microfiche (Q2368076)
cj (magnetic disk) 370 magnetic storage (Q1364527)
ss (resource is a sound recording on cassette) 344 Compact Audio Cassette (Q149757)
td (resource is in a loose-leaf binder) 323 could not find corresponding entry ? Note that loose-leaf exists : loose leaf (Q6676005)
tb (resource is in large print) 253 large-print (Q1548123) seems correct
tz (Print : other) 230 ?
sd (resource is a sound recording on disc) 217 disc vinyle ? to be clarified ?
tc (resource is in printed Braille) 203 braille (Q79894)
zz (resource is in another form) 133 ?

Here is the query to list the currently used distribution formats on property "ISSN" :

SELECT ?format (COUNT(?format) AS ?howMany) ?formatLabel
WHERE
{
  ?serial wdt:P31/wdt:P279* wd:Q2217301 .
  ?serial p:P236 [ ps:P236 ?issn; pq:P437 ?format ].
  SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE]". }
}
GROUP BY ?format ?formatLabel
ORDER BY DESC(?howMany)

Try it!

Also, I could not find a field to record the mapping of a Wikidata entry with a MARC code. Does that exists ?

Cheers Tfrancart (talk) 08:04, 18 October 2019 (UTC)

@Tfrancart: it sounds really interresting, I try to look into it in more details (and probably create the missing items). As a first remark, I see that you used CD-ROM (Q7982) twice, which doesn't seems right : if there is two value on the ISSN side, there should be two values on Wikidata two.
Thanks ! as I wrote, I am ready to accept simplifications here, but of course the more precise, the better. This is why I used CD-ROM twice.Tfrancart (talk) 09:03, 18 October 2019 (UTC)
Hello @VIGNERON: a friendly ping on that matter : did you had the time to review the list of distribution formats ? we would love to get some feedback. Thanks. Tfrancart (talk) 13:41, 28 October 2019 (UTC)
And for has been done already, you could take a look at wikimania:2019:Libraries/Converting MARC and EAD Creator Descriptions to Wikidata.
Cheers, VIGNERON (talk) 08:37, 18 October 2019 (UTC)

Data donation from ISSN Register - Feedback welcome[edit]

John Vandenberg (talk) 09:30, 2 December 2013 (UTC) Aubrey (talk) 12:15, 11 December 2013 (UTC) Daniel Mietchen (talk) 12:47, 11 December 2013 (UTC) Micru (talk) 13:09, 11 December 2013 (UTC) DarTar (talk) 01:37, 15 January 2014 (UTC) Maximilianklein (talk) 00:23, 28 March 2014 (UTC) Mvolz (talk) 08:10, 20 July 2014 (UTC) Andy Mabbett (Pigsonthewing); Talk to Andy 22:17, 27 July 2014 (UTC) Mattsenate (talk) 17:26, 14 August 2014 (UTC) author  TomT0m / talk page JakobVoss (talk) 14:25, 16 June 2016 (UTC) Mahdimoqri (talk) 08:04, 5 April 2018 (UTC) Jsamwrites Dig.log Sic19 (talk) 22:46, 12 July 2017 (UTC) Andreasmperu Nomen ad hoc Pete F (talk) 99of9 Mfchris84 (talk) 09:02, 26 November 2018 (UTC) Runner1928 (talk) 17:22, 1 December 2018 (UTC) Wittylama (talk) 09:55, 22 December 2018 (UTC) Jneubert (talk) 07:30, 22 February 2019 (UTC) --Juandev (talk) 20:28, 27 April 2019 (UTC) VIGNERON (talk) Uomovariabile (talk to me) 08:46, 24 June 2019 (UTC) SilentSpike (talk) Ecritures (talk) Tfrancart (talk) Dick Bos (talk) 10:47, 30 January 2020 (UTC) Pictogram voting comment.svg Notified participants of WikiProject Periodicals and Mattsenate (talk) 13:11, 8 August 2014 (UTC)
KHammerstein (WMF) (talk) 13:15, 8 August 2014 (UTC)
Mitar (talk) 13:17, 8 August 2014 (UTC)
Mvolz (talk) 18:07, 8 August 2014 (UTC)
Daniel Mietchen (talk) 18:09, 8 August 2014 (UTC)
Merrilee (talk) 13:37, 9 August 2014 (UTC)
Pharos (talk) 14:09, 9 August 2014 (UTC)
DarTar (talk) 15:46, 9 August 2014 (UTC)
HLHJ (talk) 09:11, 11 August 2014 (UTC)
Blue Rasberry (talk) 18:02, 11 August 2014 (UTC)
Micru (talk) 20:11, 12 August 2014 (UTC)
JakobVoss (talk) 12:23, 20 August 2014 (UTC)
Finn Årup Nielsen (fnielsen) (talk) 02:06, 23 August 2014 (UTC)
Jodi.a.schneider (talk) 09:24, 25 August 2014 (UTC)
Abecker (talk) 23:35, 5 September 2014 (UTC)
Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 14:21, 24 October 2014 (UTC)
Mike Linksvayer (talk) 23:26, 18 October 2014 (UTC)
Kopiersperre (talk) 20:33, 20 October 2014 (UTC)
Jonathan Dugan (talk) 21:03, 20 October 2014 (UTC)
Hfordsa (talk) 19:26, 5 November 2014 (UTC)
Vladimir Alexiev (talk) 15:09, 23 January 2015 (UTC)
Runner1928 (talk) 03:25, 6 May 2015 (UTC)
Pete F (talk)
econterms (talk) 13:51, 19 August 2015 (UTC)
Sj (talk)
author  TomT0m / talk page
guillom (talk) 21:57, 4 January 2016 (UTC)
·addshore· talk to me! 17:43, 18 January 2016 (UTC)
Bodhisattwa (talk) 16:08, 29 January 2016 (UTC)
Ainali (talk) 16:51, 29 January 2016 (UTC)
Shani Evenstein (talk) 21:29, 5 July 2018 (UTC)
Skim (talk) 07:17, 6 November 2018 (UTC)
PKM (talk) 23:19, 19 November 2018 (UTC)
Ocaasi (talk) 22:19, 29 November 2018 (UTC)
Trilotat Trilotat (talk) 15:43, 16 February 2019 (UTC)
NAH
Iwan.Aucamp
Pictogram voting comment.svg Notified participants of WikiProject Source MetaData and LeadSongDog (talk) 21:42, 23 March 2016 (UTC)
RobLa-WMF (talk) 01:24, 25 March 2016 (UTC)
Kosboot (talk) 20:45, 30 March 2016 (UTC)
Sydney Poore/FloNight♥♥♥♥ 15:10, 14 April 2016 (UTC)
Peaceray (talk) 18:40, 28 April 2016 (UTC)
PKM (talk) 16:29, 1 May 2016 (UTC)
Aubrey (talk) 12:42, 25 August 2016 (UTC)
Chiara (talk) 12:47, 25 August 2016 (UTC)
Marchitelli (talk) 19:02, 1 September 2016 (UTC)
YULdigitalpreservation (talk) 17:44, 9 December 2016 (UTC)
Satdeep Gill (talk) 14:59, 2 February 2017 (UTC)
Pintoch (talk) 09:44, 28 February 2017 (UTC)
Raymond Ellis (talk) 16:06, 1 April 2017 (UTC)
Crazy1880 (talk) 18:21, 16 June 2017 (UTC)
T Arrow (talk) 07:55, 22 June 2017 (UTC)
GerardM (talk) 08:25, 30 July 2017 (UTC) With a particular interest of opening up sources about Botany and opening up any freely licensed publications.
Clifford Anderson (talk) 18:26, 11 August 2017 (UTC)
Jsamwrites (talk) 07:52, 27 August 2017 (UTC)
Krishna Chaitanya Velaga (talk) 09:52, 19 September 2017 (UTC)
Capankajsmilyo (talk) 18:32, 19 September 2017 (UTC)
Hsarrazin (talk) 20:41, 15 October 2017 (UTC)
Mlemusrojas (talk) 10:15, 6 December 2017 (UTC)
Samat (talk)
Ivanhercaz Plume pen w.png (Talk) 20:27, 25 December 2017 (UTC)
Simon Cobb (User:Sic19 - talk page) 21:20, 21 January 2018 (UTC)
Mahdimoqri (talk) 20:22, 26 March 2018 (UTC)
Maria zaos (talk) 18:45, 9 April 2018 (UTC)
Jaireeodell (talk) 14:07, 23 April 2018 (UTC)
Egon Willighagen (talk) 12:29, 10 May 2018 (UTC)
RobinMelanson (talk) 2:13, 25 November 2018 (UTC)
Vladimir Alexiev (talk) 03:02, 4 December 2018 (UTC) interested, in particular because of TRR project https://m.wikidata.org/wiki/Q56259739
Maxlath (talk) 18:36, 6 January 2019 (UTC)
Dcflyer (talk) 21:38, 26 January 2019 (UTC)
Trilotat Trilotat (talk) 15:39, 16 February 2019 (UTC)
Mfchris84 (talk) 05:37, 18 April 2019 (UTC)
Salgo60 (talk)
Walkuraxx (talk) 14:58, 18 July 2019 (UTC)
NAH
FULBERT (talk) 17:14, 10 November 2019 (UTC)
Pictogram voting comment.svg Notified participants of WikiProject Source MetaData/More

The ISSN International Center (ISSN International Centre (Q12131129)), which maintains the ISSN Register (ISSN Portal (Q70460099)) (the most complete reference source in the world for the identification of serial publications) wishes to contribute a subset of its bibliographic database to Wikidata. This consists of (approximately) 1.100.000 entries (only current/not discontinued publications), with 10% having 2 or more ISSNs (typically, paper and electronic version). Currently, Wikidata contains approximately 94000 ISSN values.

Each serial notice is described with metadata from the set of metadata already freely available from the ISSN Portal :

  • ISSN identifier
  • ISSN-L identifier (the ISSN-L identifies the serial publication independantly of its medium, while distinct ISSNs are assigned to each medium)
  • Proper (original) title of the serial
  • Key (unambiguous) title of the serial
  • Country of publication
  • Distribution format (of a given ISSN)
  • URL (for online publications)
  • Language of the serial

(Note that the publisher is not part of the free data)

More facts and figures on ISSN and Wikidata can be found in this presentation.

The plan is to reconcile and send this data using OpenRefine.

Preliminary analysis has been conducted and this data is planned to be captured with the data model described in the following diagram. We are looking for feedback from the community before proceeding with the ingest. Although this is not a formal deadline, feedback is welcome until 4th of december (1 month).

ISSN data model-v4.png

A few notes / explanations :

  • the values for distribution formats are discussed here;
  • the language of the label (and title) will be derived from the language of the publication itself (this may not be an accurate assumption in 100% of the case)
  • the cancelled ISSNs are ISSNs that have been published, then cancelled; the plan is to store them as with a deprecated rank;
  • there is a single ISSN-L for each serial, but multiple ISSNs are possible (one for each medium)

Thanks for your comments

Tfrancart (talk) 21:39, 4 November 2019 (UTC)

--

  • Excellent news. Thanks for doing that. I think we should be able to gather feedback in less than a month. Just a few minor points:
  • I'm not entirely sure if OpenRefine is suitable for that many additions, but alternatively, you might want to look into QuickStatements.
  • I tend to agree about the suggestion made earlier of using country of origin (P495) for country of publication (if no city is available).
OK, we will consider that, thanks. Tfrancart (talk) 09:36, 5 November 2019 (UTC)
  • If there is a start date available, I think it would be good to include that.
ISSN has that in its registry, but it is not part of the free data. Tfrancart (talk) 09:36, 5 November 2019 (UTC)
Is there any chance of making dates part of the free data, either on Wikidata or at the public ISSN portal? The problem is that without them, it can be very hard to precisely define the entity being described. Wikidata has dates for many of the serials in its database, but I think they often disagree with the dates in the ISSN portal (in particular because Wiki* tends to lump slightly changed prior versions of the serial (including title and publisher changes) with the current version more often that ISSN and many library catalogs tend to. It's hard to match up serial entities from multiple data sources (such as Wiki* and the ISSN database) unless it's clear on both sides where they start and end. Thanks for considering this! JohnMarkOckerbloom (talk) 21:09, 5 November 2019 (UTC)
  • What happens when a publication change its name?
AFAIK, kf the name changes significantly, it gets a new ISSN. Tfrancart (talk) 09:36, 5 November 2019 (UTC)
  • Could you do a test run of 10 or so items?
Yes, we will do that. Tfrancart (talk) 09:36, 5 November 2019 (UTC)
@GZWDer, Magnus Manske: who work frequently with journals.--- Jura 21:51, 4 November 2019 (UTC)

--

wonderful news! Just so you know that it's an option, you could also make the import using wikibase-cli, and I would be more than happy to give support if needed. -- Maxlath (talk) 23:39, 4 November 2019 (UTC)
Thanks ! we will see what the best option for the tool is. The nice thing with OpenRefine is 1/ the ability to reconcile with serials already in Wikidata and having already an ISSN and 2/ (if I understand properly) the ability to not duplicate statements that are already present. How does wikidabse-cli performs on these 2 aspects ?. Tfrancart (talk) 09:36, 5 November 2019 (UTC)

--

I have just used OpenRefine to add a few thousand ISSN statements for academic journals and I think it works perfectly. Many academic journals have ISSN statements but no ISSN-L. Having ISSN-L allows less tedious queries.
I have no experience on the best data model for the "distribution format".
Do I understand correctly that you don't plan to add information on the publisher? That could be useful, especially historical information for publications which have changed publisher (relevant to identify reverse flipped journals) or whose publisher went bankrupt (relevant for digital preservation projects).
Correct, the publisher is not part of the free data in the ISSN register. Tfrancart (talk) 15:33, 5 November 2019 (UTC)
As a suggestion, I recommend that you start by adding the information you described above to the existing entities about periodicals. We have no shortage of incomplete entities; having information on their language, publisher, identifiers and URL unquestionably makes them more useful. After you've tested your data model and precision of reconciliation with existing items, it will be easier to proceed with the creation of new items. Only after reconciliation you will know exactly how many new items you need to create. Nemo 10:07, 5 November 2019 (UTC)
I take good note of this. We will separate addition of statements on existing entity and creation of new entities. Tfrancart (talk) 15:33, 5 November 2019 (UTC)

--

This is great, thanks! The data model diagram makes sense to me. I am assuming that there will only be one item created/updated per ISSN-L, you will not be creating separate items for each ISSN (name or format change), right? ArthurPSmith (talk) 13:21, 5 November 2019 (UTC)
Correct : one item per ISSN-L. Tfrancart (talk) 15:33, 5 November 2019 (UTC)

--

  • I agree that this is fine news. Here are some issues you might consider:
    1. ISSN number change when the title change. Wikidata follows, I believe, no overall scheme of determining when a serial should have a new item when there are name changes, splits or merges, so there might be issues around these things.
> This is one of the reason ISSN data will be limited to current publications only. Tfrancart (talk) 15:33, 5 November 2019 (UTC)
    1. It is unclear to me whether the place of publication is stable. Doesn't large international publishing houses move the publishing site around? For instance, NeuroImage (Q1981225) was initially published from Academic Press, San Diego AFAIR, now Wikidata says The Netherlands (perhaps erroneously), while ISSN and NLM portal says Orlando. I see in old editions that the editorial office is in San Diego while the publisher is in Orlando.
> The place of publication is (I think) the country of the latest/current publisher of the serial. Tfrancart (talk) 15:33, 5 November 2019 (UTC)
    1. Wikidata tend to merge the electronic version and the paper version in one item. As far as I read the model, that is also what the ISSN dump will do, and that a qualifier will be added to the two ISSNs that are typically associated with the serial Wikidata item.
> Correct, the plan is to have one item per ISSN-L (which is not as trivial as it seems on the ISSN Register side). Important Note : this means separate entries (with separate ISSN-L) will be created for the various regional editions of the same journal, or the different linguistic editions of the same journal; these entries will not be linked as these links are not part of what the ISSN will import. Tfrancart (talk) 15:33, 5 November 2019 (UTC)
Finn Årup Nielsen (fnielsen) (talk) 14:55, 5 November 2019 (UTC)

--

  • Agree with everybody above that this is really exciting news. It's great to see the ISSN International Center taking this proactive step.
  • Also agree with cautions noted above, to be careful with journals where there have been name changes / series changes / publisher changes. Wikidata doesn't necessarily deal with these well. I see you're going to restrict to only current publications with ISSNs, which may be a pity, because it may be older journals that can be particularly likely to have copyright-expired or copyright-waived copies, that it can also be particularly useful to be able to match identifying ISSNs to. However I can see it may make sense to start with the most straightforward case first. But beware of items for older runs of the journal, or for the whole history of the journal, that may need to be distinguished when it comes to matching.
  • Props for really having done your homework untangling the data modelling that's evolved, and publishing it above -- you now appear to have a clearer take on it than many of us may well have done!
  • I'd echo also what User:JohnMarkOckerbloom wrote, that there may be more efficient ways of going about this at such a scale than OpenRefine. In particular, it may be more efficient to write a query for all live periodicals currently without ISSNs, with accompanying side-data, and then try to match as much of that as you can offline first, rather than going straight to OpenRefine.
  • But mostly I just want to say Thank You for what has the makings of being a hugely valuable contribution. Jheald (talk) 14:51, 6 November 2019 (UTC)

Former names[edit]

  • Quote "This is one of the reason ISSN data will be limited to current publications only. "

Seems my question about name changes isn't relevant as only current names are being added. Depending how it goes, I think importing data about former publications (and former names) would be most helpful as well. How many would this be? --- Jura 10:23, 9 November 2019 (UTC)

Start date[edit]

  • Quote (1) "If there is a start date available, I think it would be good to include that." (by Jura1)
  • Quote (2) "ISSN has that in its registry, but it is not part of the free data. Tfrancart (talk) 09:36, 5 November 2019 (UTC)"
  • Quote (3) "Is there any chance of making dates part of the free data, either on Wikidata or at the public ISSN portal? The problem is that without them, it can be very hard to precisely define the entity being described. Wikidata has dates for many of the serials in its database, but I think they often disagree with the dates in the ISSN portal (in particular because Wiki* tends to lump slightly changed prior versions of the serial (including title and publisher changes) with the current version more often that ISSN and many library catalogs tend to. It's hard to match up serial entities from multiple data sources (such as Wiki* and the ISSN database) unless it's clear on both sides where they start and end. Thanks for considering this! JohnMarkOckerbloom (talk) 21:09, 5 November 2019 (UTC)"

Agree with that. --- Jura 10:26, 9 November 2019 (UTC)


Upload[edit]

@Tfrancart: do you need help with this? --- Jura 13:25, 20 January 2020 (UTC)

@Jura1: we are getting back on track with this. For internal reasons this was delayed, and the suggestion to not use OpenRefine but some other tools has required more analysis. Besides OpenRefine does not allow to create statements with a Deprecated rank. I am now considering Wikidata toolkit.

Followup on ISSN data import : questions and help needed[edit]

John Vandenberg (talk) 09:30, 2 December 2013 (UTC) Aubrey (talk) 12:15, 11 December 2013 (UTC) Daniel Mietchen (talk) 12:47, 11 December 2013 (UTC) Micru (talk) 13:09, 11 December 2013 (UTC) DarTar (talk) 01:37, 15 January 2014 (UTC) Maximilianklein (talk) 00:23, 28 March 2014 (UTC) Mvolz (talk) 08:10, 20 July 2014 (UTC) Andy Mabbett (Pigsonthewing); Talk to Andy 22:17, 27 July 2014 (UTC) Mattsenate (talk) 17:26, 14 August 2014 (UTC) author  TomT0m / talk page JakobVoss (talk) 14:25, 16 June 2016 (UTC) Mahdimoqri (talk) 08:04, 5 April 2018 (UTC) Jsamwrites Dig.log Sic19 (talk) 22:46, 12 July 2017 (UTC) Andreasmperu Nomen ad hoc Pete F (talk) 99of9 Mfchris84 (talk) 09:02, 26 November 2018 (UTC) Runner1928 (talk) 17:22, 1 December 2018 (UTC) Wittylama (talk) 09:55, 22 December 2018 (UTC) Jneubert (talk) 07:30, 22 February 2019 (UTC) --Juandev (talk) 20:28, 27 April 2019 (UTC) VIGNERON (talk) Uomovariabile (talk to me) 08:46, 24 June 2019 (UTC) SilentSpike (talk) Ecritures (talk) Tfrancart (talk) Dick Bos (talk) 10:47, 30 January 2020 (UTC) Pictogram voting comment.svg Notified participants of WikiProject Periodicals and Mattsenate (talk) 13:11, 8 August 2014 (UTC)
KHammerstein (WMF) (talk) 13:15, 8 August 2014 (UTC)
Mitar (talk) 13:17, 8 August 2014 (UTC)
Mvolz (talk) 18:07, 8 August 2014 (UTC)
Daniel Mietchen (talk) 18:09, 8 August 2014 (UTC)
Merrilee (talk) 13:37, 9 August 2014 (UTC)
Pharos (talk) 14:09, 9 August 2014 (UTC)
DarTar (talk) 15:46, 9 August 2014 (UTC)
HLHJ (talk) 09:11, 11 August 2014 (UTC)
Blue Rasberry (talk) 18:02, 11 August 2014 (UTC)
Micru (talk) 20:11, 12 August 2014 (UTC)
JakobVoss (talk) 12:23, 20 August 2014 (UTC)
Finn Årup Nielsen (fnielsen) (talk) 02:06, 23 August 2014 (UTC)
Jodi.a.schneider (talk) 09:24, 25 August 2014 (UTC)
Abecker (talk) 23:35, 5 September 2014 (UTC)
Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 14:21, 24 October 2014 (UTC)
Mike Linksvayer (talk) 23:26, 18 October 2014 (UTC)
Kopiersperre (talk) 20:33, 20 October 2014 (UTC)
Jonathan Dugan (talk) 21:03, 20 October 2014 (UTC)
Hfordsa (talk) 19:26, 5 November 2014 (UTC)
Vladimir Alexiev (talk) 15:09, 23 January 2015 (UTC)
Runner1928 (talk) 03:25, 6 May 2015 (UTC)
Pete F (talk)
econterms (talk) 13:51, 19 August 2015 (UTC)
Sj (talk)
author  TomT0m / talk page
guillom (talk) 21:57, 4 January 2016 (UTC)
·addshore· talk to me! 17:43, 18 January 2016 (UTC)
Bodhisattwa (talk) 16:08, 29 January 2016 (UTC)
Ainali (talk) 16:51, 29 January 2016 (UTC)
Shani Evenstein (talk) 21:29, 5 July 2018 (UTC)
Skim (talk) 07:17, 6 November 2018 (UTC)
PKM (talk) 23:19, 19 November 2018 (UTC)
Ocaasi (talk) 22:19, 29 November 2018 (UTC)
Trilotat Trilotat (talk) 15:43, 16 February 2019 (UTC)
NAH
Iwan.Aucamp
Pictogram voting comment.svg Notified participants of WikiProject Source MetaData and LeadSongDog (talk) 21:42, 23 March 2016 (UTC)
RobLa-WMF (talk) 01:24, 25 March 2016 (UTC)
Kosboot (talk) 20:45, 30 March 2016 (UTC)
Sydney Poore/FloNight♥♥♥♥ 15:10, 14 April 2016 (UTC)
Peaceray (talk) 18:40, 28 April 2016 (UTC)
PKM (talk) 16:29, 1 May 2016 (UTC)
Aubrey (talk) 12:42, 25 August 2016 (UTC)
Chiara (talk) 12:47, 25 August 2016 (UTC)
Marchitelli (talk) 19:02, 1 September 2016 (UTC)
YULdigitalpreservation (talk) 17:44, 9 December 2016 (UTC)
Satdeep Gill (talk) 14:59, 2 February 2017 (UTC)
Pintoch (talk) 09:44, 28 February 2017 (UTC)
Raymond Ellis (talk) 16:06, 1 April 2017 (UTC)
Crazy1880 (talk) 18:21, 16 June 2017 (UTC)
T Arrow (talk) 07:55, 22 June 2017 (UTC)
GerardM (talk) 08:25, 30 July 2017 (UTC) With a particular interest of opening up sources about Botany and opening up any freely licensed publications.
Clifford Anderson (talk) 18:26, 11 August 2017 (UTC)
Jsamwrites (talk) 07:52, 27 August 2017 (UTC)
Krishna Chaitanya Velaga (talk) 09:52, 19 September 2017 (UTC)
Capankajsmilyo (talk) 18:32, 19 September 2017 (UTC)
Hsarrazin (talk) 20:41, 15 October 2017 (UTC)
Mlemusrojas (talk) 10:15, 6 December 2017 (UTC)
Samat (talk)
Ivanhercaz Plume pen w.png (Talk) 20:27, 25 December 2017 (UTC)
Simon Cobb (User:Sic19 - talk page) 21:20, 21 January 2018 (UTC)
Mahdimoqri (talk) 20:22, 26 March 2018 (UTC)
Maria zaos (talk) 18:45, 9 April 2018 (UTC)
Jaireeodell (talk) 14:07, 23 April 2018 (UTC)
Egon Willighagen (talk) 12:29, 10 May 2018 (UTC)
RobinMelanson (talk) 2:13, 25 November 2018 (UTC)
Vladimir Alexiev (talk) 03:02, 4 December 2018 (UTC) interested, in particular because of TRR project https://m.wikidata.org/wiki/Q56259739
Maxlath (talk) 18:36, 6 January 2019 (UTC)
Dcflyer (talk) 21:38, 26 January 2019 (UTC)
Trilotat Trilotat (talk) 15:39, 16 February 2019 (UTC)
Mfchris84 (talk) 05:37, 18 April 2019 (UTC)
Salgo60 (talk)
Walkuraxx (talk) 14:58, 18 July 2019 (UTC)
NAH
FULBERT (talk) 17:14, 10 November 2019 (UTC)
Pictogram voting comment.svg Notified participants of WikiProject Source MetaData/More and @Jura1: and @VIGNERON:

Hello

Followup on ISSN data import. I have manually entered 2 notices following the above proposed data model from ISSN register : Europa medicophysica (Q27720997) and Q84021148. This triggered some questions :

Thanks and Best Regards Tfrancart (talk) 09:14, 30 January 2020 (UTC)

Just a partial feedback:
  • In these items, I don't see an advantage to add a reference to the P31 statement.
  • If a more specific value is already available (as at Q27720997), please don't add serial (Q2217301). You could just skip P31 for any existing item.
  • On labels/titles like Q84021148, can you suppress the trailing "." ?
  • I added the new formatter url for ISSN-L (P7363).
  • If you prefer to work with Open Refine, you could either first skip retired ISSNs or add these with normal rank, but ask another bot operator to adjust the ranks. If more information about the reason for their discontinuation is available, maybe normal rank with "end cause" or "end date" is appropriate. Deprecated rank is for values that were never really correct.
It might be worth doing 50 items as a more extended test. --- Jura 10:06, 30 January 2020 (UTC)

Tables with information on periodical copyright and free issue availability[edit]

I've been working at the Penn Libraries on compiling information on "deep backfiles" of periodicals, to help surface what is and what can be made freely available online from them. I've been using Wikidata and various publisher manifests to compile basic information on serials with free online content, or that were publishing prior to 1964 (when, at least in the US, rightsholders had to actively register and renew copyrights, and many periodical publishers and authors did not when required). This is the main page for the project, and here's its initial announcement.

I'm grateful to the Wiki* community for compiling much of the data in this project, and I invite folks to research and send me more data about serial copyrights and online availability. (Instructions for finding and contributing basic copyright information can be found in the announcement post.) I'm currently linking the data I'm compiling on periodicals with their corresponding Wikidata entries (using the P5396 property for the links from Wikidata, and links going the other way from our data pages). It may eventually make sense to copy or migrate some of the data I'm compiling directly into Wikidata. I haven't made steps to do that yet, but the data is CC0, so feel free to reuse it as you see fit.

My initial tables were built for the offerings of various publishers and aggregators (e.g. Elsevier, JSTOR, Taylor and Francis, etc.) I've recently built another table, though, based on the periodicals that have articles about them in English Wikipedia-- not just journals, but also other periodicals that Wikidata identifies as going back far enough to potentially be at least partly out of copyright, or that have free online issues that I know of. I plan to add that table to our Deep Backfile page soon, but here's an early look at it. It's quite large-- there are over 10,000 periodicals written about in English Wikipedia that are eligible for listing! Please let me know if you find it useful or if you have any suggestions for improving it. I'll also happily add any additional data you contribute via the "Contact us" links.

I'd also like to see if there's other data this community has already compiled that can be included in the knowledge base. There are links to each serial's Wikidata record already, but if there's a good way to automatically slurp up information on free issue availability based on Wikidata or other Wiki* information, I'd be happy to hear about it. (I'm looking to link to complete, copyright-cleared issues, volumes, and runs, rather than individual articles.)

Thanks! JohnMarkOckerbloom (talk) 14:08, 6 November 2019 (UTC)

Oh, thank you for posting here too! I just came to link your article. :) So, to clarify, an item will be added to your table only if it has both an English Wikipedia entry and a Wikidata entry? If your objective it to have a free text article you can link, you could also link non-English Wikipedia articles as well,
An interesting source to merge is the ISSN registry being discussed above, given your table shows many missing ISSN codes. To show availability, maybe we should link/import data from https://fatcat.wiki/ . BHL is collecting such information for many biology-related publications. For recent journals, Unpaywall has recently changed their detection of fully-OA journals and we could import that together with DOAJ data and others.
Isn't the country of (original) publication relevant too, if you want to establish public domain status? Copyright registration doesn't matter for non-USA periodicals, usually.
Finally, do you plan to provide help hosting copies of those serials if they're in the public domain but not hosted anywhere yet? The Internet Archive has thousands of journals and millions of articles but they don't necessarily want to host everything (yet). I'm sad to see your repository was bought by Elsevier, but it would be trivial to create a separate DSpace repository à la DeepBlue for public domain submissions, if you're interested. Nemo 17:08, 10 November 2019 (UTC)
Thanks Nemo for your comments and questions! My responses:
I'm focusing on English Wikipedia here because that's already plenty big (10,000 serials, as I mention), and because the copyright information I'm gathering is most relevant to American serials (for reasons you mention), and I expect English Wikipedia would be the Wiki most interested in those. Some older American serials are indeed covered in non-English WP but not English WP, but if anyone creates an English Wikipedia article for any of these and it meets the data criteria, it'll automatically show up in the table not long after.
The countries of publication are indeed important, and I'm noting in my copyright files when serials originate outside the US, for the reasons you indicate. (The notes are currently free text, but they're regular enough that I might make these structured data fields if that seems warranted.) In some cases, content of these journals is still subject to US notice and renewal requirements, so it's still worth listing them. The main such cases are (a) foreign serials that can still be considered "published" in the US for the purposes of US copyright law, which might include ones that (for instance) regularly marketed and distributed to US subscribers; and (b) contributions to foreign serials that were authored by Americans. And of course really old foreign serials can also be public domain in the US due to sheer age.
We're not currently set up at Penn to host large quantities of digitized serials, but I'll let you know if that changes in the future. We do have a digitization unit, but the output from it that we self-host is mostly from our rare books collections. We do, however, send digitizations of things from our general collection to places like HathiTrust and the Internet Archive. In my own listings, I'm happy to point to copyright-cleared issues at either of those places, or at Wikisource, or anywhere else that will provide stable, user-friendly hosting for them. JohnMarkOckerbloom (talk) 15:24, 11 November 2019 (UTC)