Property talk:P3242

From Wikidata
Jump to navigation Jump to search

Documentation

SIC code
U.S. Standard Industrial Classification number for industries and economic activities
[create Create a translatable help page (preferably in English) for this property to be included here]
Format “\d{2}[1-9]{0,2}|[A-K]: value must be formatted using this pattern (PCRE syntax). (Help)
List of violations of this constraint: Database reports/Constraint violations/P3242#Format, hourly updated report, SPARQL
Allowed entity types are Wikibase item (Q29934200): the property may only be used on a certain entity type (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P3242#Entity types
Scope is as main value (Q54828448): the property must be used by specified way only (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P3242#Scope, SPARQL
Type “SIC industry classification (Q123186574): item must contain property “instance of (P31)” with classes “SIC industry classification (Q123186574)” or their subclasses (defined using subclass of (P279)). (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P3242#Type Q123186574, SPARQL

scope and/or datatype[edit]

Kopiersperre Jklamo ArthurPSmith S.K. Givegivetake fnielsen rjlabs ChristianKl Vladimir Alexiev Parikan User:Cardinha00 MB-one User:Simonmarch User:Jneubert Mathieudu68 User:Kippelboy User:Datawiki30 User:PKM User:RollTide882071 Andber08 Sidpark SilentSpike Susanna Ånäs (Susannaanas) User:Johanricher User:Celead User:Finnusertop cdo256 Mathieu Kappler RShigapov User:So9q User:1-Byte pmt Rtnf econterms Dollarsign8 User:Izolight maiki c960657 User:Automotom applsdev Bubalina Fordaemdur

Notified participants of WikiProject Companies

Somehow this isn't being used as planned. Instead of being added to sectors, it ended up on companies. --- Jura 20:52, 9 January 2022 (UTC)[reply]

  • Yes, this looks bad. @James Hare (NIOSH): - how do you think we should proceed? BorkedBot (talkcontribslogs) seems to have entered at least some of these (ones I looked at). One option is to just redefine this property according to the way it's being used, but that would mean changing the datatype to String from External ID. Or we could propose a new property for applying the SIC code to a company as has been done, and migrate the existing uses. ArthurPSmith (talk) 14:28, 10 January 2022 (UTC)[reply]
It's probably easier to create a new property for organizations (with string datatype) and convert most uses to that. --- Jura 12:13, 18 January 2022 (UTC)[reply]

Above the ids if you someone wants to use them. --- Jura 10:22, 1 March 2022 (UTC)[reply]

DnB extension?[edit]

DnB uses an extension with 8-digit codes: http://www.dnb.com/content/dam/english/dnb-solutions/sales-and-marketing/sic_8_digit_codes.xls. Is it also covered by this property? Examples:

If someone wanted to import these I'd be fine. I doubt we already have items for most of these since they are so specific. We could use a qualifier to distinguish them. Though the only property that comes to mind would be authority (P797) which isn't great. BrokenSegue (talk) 16:02, 17 January 2022 (UTC)[reply]

SIC Wikidata structure documentation[edit]

SIC industry classifications are now modeled on Wikidata. The information is drawn from a Census Bureau spreadsheet and SIC Manuals.

  • Each Wikidata item represents a unique classification definition, not a unique code number. Note that a single definition may have multiple codes (at the same time or at different times), and a single code may have multiple definitions (at different times), as detailed below.
  • Each classification has its own item with a instance of (P31): SIC industry classification (Q123186574) statement. These are separate from any existing "general" item about a specific industry, as the SIC definitions are specific; they should not be merged, but may be linked in some way if appropriate.
  • SIC codes are hierarchical. Two-digit codes are the broadest, with four-digit codes being the most specific. There are also high-level "divisions" represented by a letter; these contain sets of two-digit codes, but sometimes do not correspond to the first digit.
    • These are linked with part of (P361)/has part(s) (P527) statements.
    • In some cases, a node has only one child, both with the same title and definition. In these cases, all such codes are listed on a single Wikidata item.
  • SIC codes were revised periodically, with the 1987 version being the final one.
    • The 1987 SIC codes (which are valid on and after that date) are fully represented in Wikidata.
    • All SIC codes that were abolished in 1987 or 1977, and some abolished in 1972, are also represented. If they had the same title or definition as a later code, they are both listed in the same item with start time (P580) and end time (P582) qualifiers, regardless of any splits or merges that happened. (This is different than how NAICS codes are modeled.) Otherwise, a new item was created with a dissolved, abolished or demolished date (P576) statement, as well as replaced by (P1366) and/or merged into (P7888) statements.
    • No attempt has been made to otherwise represent the structure of merges, splits, or title or definition changes. There is no guarantee that any 1987 code existed or had the same definition before 1987.
    • There are also replaced by (P1366) statements linking to the newer NAICS industry classifications.
  • If more than one Wikidata item has the same SIC code, there is also a separate item with an additional instance of (P31): conflation (Q14946528) statement and "[unspecified definition]" in the label. This is to assist for making lookup tables and for cases where the date is not clear.

John P. Sadowski (NIOSH) (talk) 05:07, 1 November 2023 (UTC)[reply]