Property talk:P1065

From Wikidata
Jump to navigation Jump to search

Documentation

archive URL
URL to the archived web page specified with URL property
DescriptionThe URL in the same reference but archived on Archive.org or some other archive service (webcite.org, archive.is).
Data typeURL
Template parameteren:Template:Cite field "archiveurl"
Domainany item having an URL property (note: this should be moved to the property statements)
Allowed values
According to this template: any link to any web page
According to statements in the property:
https?:\/\/(web\.archive\.org\/web\/[0-9]{14}|archive\.org\/(details|download)|archive\.(today|is|fo|li|ph|vn)\/[0-9]{4}[\-\.]?[0-9]{2}[\-\.]?[0-9]{2}[\-\.]?[0-9]{2}[\-\.]?[0-9]{2}[\-\.]?[0-9]{2}|webrecorder\.io|(www\.)?webcitation\.org|perma\.cc|perma\-archives\.org|([^\/]+\.)?megalodon\.jp|archive\.wikiwix\.com|(webarchive|yourarchives)\.nationalarchives\.gov\.uk|(pandora|trove|webarchive|content\.webarchive)\.nla\.gov\.au|webarchive\.loc\.gov|media\.digitalarkivet\.no|arquivo\.pt|(swap|sul-swap-prod|purl)\.stanford\.edu|wayback\.archive-it\.org)\/.+|https?:\/\/archive\.(today|is|fo|li|ph|vn)\/[a-zA-Z0-9]{4,}|https:\/\/webcache\.googleusercontent\.com\/search\?q\=cache:|https?:\/\/wayback\.archive-it\.org\/[0-9]+\/[0-9]{14}
When possible, data should only be stored as statements
ExampleDisturbed's Draiman on Band's Hiatus: 'It's the Right Time to Step Away' (Q67086087) → http://web.archive.org/web/20130312103925/http://www.billboard.com/articles/news/468770/disturbeds-draiman-on-bands-hiatus-its-the-right-time-to-step-away
Sourceany archived url on Wikipedia sources (note: this information should be moved to a property statement; use property source website for the property (P1896))
Robot and gadget jobsMaybe if a bot can crawl archive.org we can add them in mass
See alsofull work available at URL (P953), archive date (P2960), reference URL (P854)
Lists
Proposal discussionProposal discussion
Current uses
Total87,256
Main statement1,5291.8% of uses
Qualifier41,41747.5% of uses
Reference44,31050.8% of uses
Search for values
[create Create a translatable help page (preferably in English) for this property to be included here]
Scope is as qualifier (Q54828449), as reference (Q54828450): the property must be used by specified way only (Help)
Exceptions are possible as rare values may exist.
List of this constraint violations: Database reports/Constraint violations/P1065#scope, SPARQL, SPARQL (new)
Format “https?:\/\/(web\.archive\.org\/web\/[0-9]{14}|archive\.org\/(details|download)|archive\.(today|is|fo|li|ph|vn)\/[0-9]{4}[\-\.]?[0-9]{2}[\-\.]?[0-9]{2}[\-\.]?[0-9]{2}[\-\.]?[0-9]{2}[\-\.]?[0-9]{2}|webrecorder\.io|(www\.)?webcitation\.org|perma\.cc|perma\-archives\.org|([^\/]+\.)?megalodon\.jp|archive\.wikiwix\.com|(webarchive|yourarchives)\.nationalarchives\.gov\.uk|(pandora|trove|webarchive|content\.webarchive)\.nla\.gov\.au|webarchive\.loc\.gov|media\.digitalarkivet\.no|arquivo\.pt|(swap|sul-swap-prod)\.stanford\.edu)\/.+|https?:\/\/archive\.(today|is|fo|li|ph|vn)\/[a-zA-Z0-9]{4,}: value must be formatted using this pattern (PCRE syntax). (Help)
Exceptions are possible as rare values may exist.
List of this constraint violations: Database reports/Constraint violations/P1065#Format, SPARQL, SPARQL (new)
This property is being used by:

Please notify projects that use this property before big changes (renaming, deletion, merge with another property, etc.)

Discussion label change[edit]

I am cleaning https://www.wikidata.org/wiki/Wikidata:Database_reports/Constraint_violations/P1065#.22Qualifier.22_violations and notice many uses of 'archive URL' (P1064) where the contributor intention was to provide link to a webpage with the full text of the work in question. I am adding 'full text available at' (P953) with the information but if someone has a suggestion about how make the label of P1064 state the use as a link to an archived version of another webpage more clearly I would appreciate. Carlos Porto (talk) 01:05, 16 October 2015 (UTC)

I give 'archived at' alias to full work available at URL (P953) so people that search for the 'archive' string can find that option, and I think it would be better to change 'URL archived' from alias to label of this property and keep current label 'archive URL' as an alias. Carlos Porto (talk) 02:38, 16 October 2015 (UTC)

Using archive URL under references?[edit]

The documentation/talk page of this says this can only be used as a property. I am confused if this can be used under references? For example reference URL (Property:P854) is added as a reference to a statement. Then along with it is archive URL containing the archived link. (The Talk page of Talk page of full text available at (Property:P953) says its Domain must be "creative work (intended for songs, albums, and artists)", so I doubt I will use this because the type of item I am trying to add the archive URL is a place). Sanglahi86 (talk) 08:58, 13 October 2016 (UTC)

New property for dead links[edit]

Is there a boolean property alongside this to specify that the URL is dead? Not all URLs with an archive version specified are dead. --Valerio Bozzolan (talk) 22:58, 1 December 2016 (UTC)

There was some discussion about this at Wikidata:Property proposal/dead-url. Seems too much text to figure out what is the outcome.
--- Jura 23:02, 1 December 2016 (UTC)

Restricted to qualifiers?[edit]

Please pay attention to the archive date (P2960); it is limited to Source section. d1g (talk) 10:09, 2 January 2017 (UTC)

Which version?[edit]

If I am looking to the web archive, which version should I use? Point out that some webs are updated in time, so if I ad P1065 now, and nobody will update it in the future, the archived web page might provide an only old version of the page. --Juandev (talk) 11:25, 19 September 2019 (UTC)

Change that restricts this property to a few archives[edit]

I wonder about the rationale for changing this to only be valid for a handful of archives, P1793 is added here [1] by Jc86035, this makes it fail for a lot of national archives. I would say the constraint is wrong and should be removed. Jeblad (talk) 15:31, 12 January 2020 (UTC)

@Jeblad: This is the discussion where I justified adding it. National archives are barely used compared to the Wayback Machine (which now accounts for the vast majority of archive links used across the Wikimedia projects), so I think I thought it was an acceptable compromise to use a whitelist. (I've changed it to a suggestion constraint for now.) By all means, change the constraint if you think it's worth improving it. Jc86035 (talk) 16:27, 12 January 2020 (UTC)
@Jc86035: It should not be changed, it should be removed. Create a new property for web archives if you find it useful, don't mess with an existing property. As it is now this property is pretty much useless for archives, which is not the same as a web archive. Jeblad (talk) 16:32, 12 January 2020 (UTC)
@Jeblad: Not all online archives are appropriate to link to, especially for the Wikipedias (e.g. self-hosted and self-published content). Properties and their constraints are changed all the time, and there isn't a Wikidata policy that restricts doing so (though it's possible that that should be changed). There are fewer than 200 format constraint violations right now, so the property is still at the level where these can be fixed by hand (either by updating the property constraints or by updating the items).
The national archives that I originally included in the format constraint were just the few that were the most commonly used on the English Wikipedia. The main reason I didn't add any more was that I didn't think it was necessary to get to zero constraint violations (there are still a few dozen that could be fixed trivially). I've added the Norwegian government's archive site for now, but more of them should definitely be added. Jc86035 (talk) 16:50, 12 January 2020 (UTC)
@Jc86035: This will create a nightmare to maintain, and will not work. Jeblad (talk) 17:02, 12 January 2020 (UTC)
@Jeblad: Which is probably more or less why I didn't add all of the archives in the first place (I don't remember). The filtering makes it actually possible to check issues with some of the statements, which is more or less the exact purpose of a suggestion constraint. It shouldn't prevent users from adding archives that aren't listed in the constraint. Jc86035 (talk) 17:11, 12 January 2020 (UTC)
@Jc86035: Don't add constraints that isn't actionable, that only create noise. Jeblad (talk) 17:36, 12 January 2020 (UTC)

Can someone fix the regex and constraint?[edit]

This is a valid Wayback Machine URL for an archived blog: https://web.archive.org/web/*/https://blog.numerade.com/

But it is getting a warning that it doesn't conform to the regex. Can someone revise the regex and constraint so that a URL like this is correct? I think the issue is the asterisk in the URL? UWashPrincipalCataloger (talk) 01:34, 11 November 2020 (UTC)