Property talk:P5337

From Wikidata
Jump to navigation Jump to search

Documentation

Google News topics ID
identifier for a subject in the news on Google News
[create Create a translatable help page (preferably in English) for this property to be included here]
Single value: this property generally contains a single value. (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303). Known exceptions: mathematics (Q395)
List of violations of this constraint: Database reports/Constraint violations/P5337#Single value, SPARQL
Distinct values: this property likely contains a value that is different from all other items. (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P5337#Unique value, SPARQL (every item), SPARQL (by value)
Format “CAA[a-zA-Z0-9_]{10,80}: value must be formatted using this pattern (PCRE syntax). (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P5337#Format, SPARQL
Allowed entity types are Wikibase item (Q29934200): the property may only be used on a certain entity type (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P5337#Entity types
Format “CAA.+|: value must be formatted using this pattern (PCRE syntax). (Help)
List of violations of this constraint: Database reports/Constraint violations/P5337#Format, hourly updated report, SPARQL
Scope is as main value (Q54828448): the property must be used by specified way only (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P5337#Scope, SPARQL
Label required in languages: en: Entities using this property should have labels in one of the following languages: en (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P5337#Label in 'en' language, search, SPARQL
Pattern ^https://news\.google\.com/topics/(CAA[a-zA-Z0-9_]{10,80})\?.+$ will be automatically replaced to \1.
Testing: TODO list

About the label and "Single value"[edit]

  1. It may be better if named "Google News subject ID".
  2. I found that the topic ID is language-dependent, so Giant panda (Q33602) may be Giant panda, 大熊猫 (For search result in Chinese), ジャイアントパンダ (Japanese) and more. These topics seem to be the same, the topic label depends on the subject ID, and the content is in accordance with the display language.

--YFdyh000 (talk) 12:26, 19 June 2018 (UTC)[reply]

Perhaps it'll be better if this property allows multiple values, but each value must have a qualifier for language...
Btcprox (talk) 08:34, 9 January 2019 (UTC)[reply]

How to find?[edit]

What is the easiest way to figure out the topic ID from a search? Ainali (talk) 12:25, 7 August 2020 (UTC)[reply]

@Ainali:, topic id is directly computed from Google Knowledge Graph ID (P2671) and Freebase ID (P646), so there are literally millions of automatically generated topics for each entity. This also means that there are millions of topics without a single news piece (yet or ever). You can use [1] or [2] to get topic ids (first link is slightly better, as it generates links with more common language code "en", instead of "en-US"). Sometimes it is very language+region dependent, for example Kyoko Fukada has 1 article in English, many in Japanese, none in Russian (but label is translated). So there are 3 ways to deal with it:
  1. Visit related RSS-feeds in English, and add topics, if topic is popular (but what is popular?)
  2. Occam's razor: delete this property, as Wikidata does not add any useful information with it.
  3. Build a middleware, which extracts all RSS-feeds, sorts them by popularity and provides links to most popular ones.
--Lockal (talk) 09:48, 25 January 2021 (UTC)[reply]