Property talk:P6597

From Wikidata
Jump to navigation Jump to search

Documentation

Digital Dictionary of Surnames in Germany ID
ID of corresponding entry in the DFD online dictionary of family names
Associated itemDigital Dictionary of Surnames in Germany (Q61889795)
Applicable "stated in" valueDigital Dictionary of Surnames in Germany (Q61889795)
Data typeExternal identifier
Domainfamily name (Q101352)
Allowed values[1-9][0-9]{0,5}|10[0-9]{5}
ExampleMüller (Q8157228)1
Conrad (Q12652033)285
Frenkel (Q420404)8988
von Poppen (Q83384239)1095930
Sourcehttps://www.namenforschung.net/en/dfd/dictionary/list/
https://www.namenforschung.net/dfd/woerterbuch/liste/
Formatter URLhttps://www.namenforschung.net/id/name/$1
See alsoGéopatronyme ID (P3370), Geneanet family name ID (P9644)
Lists
Proposal discussionProposal discussion
Current uses
Total59,631
Main statement55,316 out of 70,087 (79% complete)92.8% of uses
Qualifier4<0.1% of uses
Reference4,3117.2% of uses
Search for values
Explanations [Edit]

The Digital Dictionary of Surnames in Germany (Q61889795) publishes new entries every two weeks, on the first and fifteenth day of the month. With every update, two lists are also updated on the DFD server:

Both lists are formatted like the upload format for M’n’M (not yet adjusted to the M’n’M update of February 2021). The Mix’n’match catalog will soon be updated every two weeks from the first list.

Additional matching is possible with an external workflow, which produces a list ready for QuickStatements. This workflow is intended to only make unequivocal 1:1 matches and avoid constraint violations.
Single value: this property generally contains a single value. (Help)
List of violations of this constraint: Database reports/Constraint violations/P6597#Single value, hourly updated report, SPARQL
Distinct values: this property likely contains a value that is different from all other items. (Help)
List of violations of this constraint: Database reports/Constraint violations/P6597#Unique value, hourly updated report, SPARQL (every item), SPARQL (by value)
Format “[1-9][0-9]{0,5}|10[0-9]{5}: value must be formatted using this pattern (PCRE syntax). (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P6597#Format, SPARQL
Item “native label (P1705): Items with this property should also have “native label (P1705)”. (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P6597#Item P1705, SPARQL
Item “writing system (P282): Latin script (Q8229): Items with this property should also have “writing system (P282): Latin script (Q8229)”. (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P6597#Item P282, search, SPARQL
Item “instance of (P31): family name (Q101352): Items with this property should also have “instance of (P31): family name (Q101352)”. (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P6597#Item P31, search, SPARQL
Type “family name (Q101352): item must contain property “instance of (P31)” with classes “family name (Q101352)” or their subclasses (defined using subclass of (P279)). (Help)
List of violations of this constraint: Database reports/Constraint violations/P6597#Type Q101352, hourly updated report, SPARQL
Allowed entity types are Wikibase item (Q29934200), Wikibase lexeme (Q51885771): the property may only be used on a certain entity type (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P6597#Entity types
Scope is as main value (Q54828448), as reference (Q54828450): the property must be used by specified way only (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P6597#Scope, SPARQL
Item “language of work or name (P407): Items with this property should also have “language of work or name (P407)”. (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P6597#Item P407, search, SPARQL
Lexeme language: German (Q188): this property should only be applied to lexemes with these languages (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P6597#language
Label required in languages: de: Entities using this property should have labels in one of the following languages: de (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P6597#Label in 'de' language, search, SPARQL
Label required in languages: en: Entities using this property should have labels in one of the following languages: en (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P6597#Label in 'en' language, search, SPARQL
Label required in languages: es: Entities using this property should have labels in one of the following languages: es (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P6597#Label in 'es' language, search, SPARQL
single native label
there should be just one native label (P1705) statement on items (Help)
Violations query: SELECT ?item (COUNT(DISTINCT ?st) as ?count) (COUNT(DISTINCT str(?nl)) as ?count2) (GROUP_CONCAT(DISTINCT str(?nl); separator=", ") as ?nls) WHERE { ?item wdt:P6597 [] . ?item p:P1705 ?st . ?st ps:P1705 ?nl . } GROUP BY ?item HAVING (?count2 > 1) ORDER BY DESC(?count2) ?item LIMIT 500
List of this constraint violations: Database reports/Complex constraint violations/P6597#single native label
en label should match P1705 value
check native label (P1705) and label in English (en) (Help)
Violations query: SELECT ?item ?nl { ?item wdt:P6597 ?value . ?item wdt:P1705 ?nl . BIND( strlang(str(?nl),"en") as ?en_label) FILTER NOT EXISTS { ?item rdfs:label ?en_label } }
List of this constraint violations: Database reports/Complex constraint violations/P6597#en label should match P1705 value
de label should match P1705 value
check native label (P1705) and label in German (de) (Help)
Violations query: SELECT ?item ?nl { ?item wdt:P6597 ?value . ?item wdt:P1705 ?nl . BIND( strlang(str(?nl),"de") as ?de_label) FILTER NOT EXISTS { ?item rdfs:label ?de_label } }
List of this constraint violations: Database reports/Complex constraint violations/P6597#de label should match P1705 value
es label should match P1705 value
check native label (P1705) and label in Spanish (es) (Help)
Violations query: SELECT ?item ?nl { ?item wdt:P6597 ?value . ?item wdt:P1705 ?nl . BIND( strlang(str(?nl),"es") as ?es_label) FILTER NOT EXISTS { ?item rdfs:label ?es_label } }
List of this constraint violations: Database reports/Complex constraint violations/P6597#es label should match P1705 value
labels in several languages
labels would generally be available in several languages (Help)
Violations query: SELECT ?item ?nl (COUNT(*) as ?count) { ?item wdt:P6597 ?value . ?item rdfs:label ?l . ?item wdt:P1705 ?nl . } GROUP BY ?item ?nl HAVING ( ?count < 9)
List of this constraint violations: Database reports/Complex constraint violations/P6597#labels in several languages
en description to include "family name"
Default English language description for Latin script family names is "family name" (Help)
Violations query: SELECT ?item ?en_desc { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item schema:description ?en_desc . FILTER( lang(?en_desc)="en" && !CONTAINS( ?en_desc, "family name" ) ) ?item wdt:P282 wd:Q8229 }
List of this constraint violations: Database reports/Complex constraint violations/P6597#en description to include "family name"
de description to include "Familienname"
Default German language description for Latin script family names is "Familienname" (Help)
Violations query: SELECT ?item ?de_desc { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item schema:description ?de_desc . FILTER( lang(?de_desc)="de" && !CONTAINS( ?de_desc, "Familienname" ) ) ?item wdt:P282 wd:Q8229 }
List of this constraint violations: Database reports/Complex constraint violations/P6597#de description to include "Familienname"
ja description to include P1705 value
Japanese description format is generally "姓 (<P1705 value>)" (Help)
Violations query: SELECT ?item ?nl ?ja_desc { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item wdt:P1705 ?nl . ?item schema:description ?ja_desc . FILTER(lang(?ja_desc)="ja" && !CONTAINS( ?ja_desc, ?nl ) ) ?item wdt:P282 wd:Q8229 . } LIMIT 200
List of this constraint violations: Database reports/Complex constraint violations/P6597#ja description to include P1705 value
ru description to include P1705 value
Russian description format is generally "фамилия - <P1705 value>" (Help)
Violations query: SELECT ?item ?nl ?ru_desc { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item wdt:P1705 ?nl . ?item schema:description ?ru_desc . FILTER(lang(?ru_desc)="ru" && !CONTAINS( ?ru_desc, ?nl ) ) ?item wdt:P282 wd:Q8229 . } LIMIT 200
List of this constraint violations: Database reports/Complex constraint violations/P6597#ru description to include P1705 value
ru alias to include P1705 value
Native label should be an alias in Russian (Help)
Violations query: SELECT ?item ?nl ?alt { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item wdt:P1705 ?nl . BIND( strlang(str(?nl), "ru") as ?alt) FILTER NOT EXISTS { ?item skos:altLabel ?alt } } LIMIT 200
List of this constraint violations: Database reports/Complex constraint violations/P6597#ru alias to include P1705 value
Most frequent P734 values without property (Germany, nat)
family names used as P734 values, but without a P6597 statement. Selection by nationalities (Help)
Violations query: SELECT ?item ?l ?count ?sample WITH { SELECT ?item (COUNT(DISTINCT ?p) as ?count) (SAMPLE(?p) as ?sample) { VALUES ?c { wd:Q183 wd:Q16957 wd:Q713750 } ?p wdt:P27 ?c . hint:Prior hint:rangeSafe true . ?p wdt:P734 ?item . ?p wdt:P31 wd:Q5 . } GROUP BY ?item HAVING ( ?count > 20 ) } as %a WHERE { INCLUDE %a FILTER NOT EXISTS { ?item wdt:P6597 [] } ?item rdfs:label ?l . FILTER(lang(?l) = "de" ) } ORDER BY DESC(?count) LIMIT 100
List of this constraint violations: Database reports/Complex constraint violations/P6597#Most frequent P734 values without property (Germany, nat)
Most frequent P734 values without property (Germany, POB)
family names used as P734 values, but without a P6597 statement. Selection by place of birth (Help)
Violations query: SELECT ?item ?l ?count ?sample WITH { SELECT ?item (COUNT(DISTINCT ?p) as ?count) (SAMPLE(?p) as ?sample) { ?p wdt:P19 / wdt:P17 wd:Q183 . hint:Prior hint:rangeSafe true . ?p wdt:P734 ?item . ?p wdt:P31 wd:Q5 . } GROUP BY ?item HAVING ( ?count > 20 ) } as %a WHERE { INCLUDE %a FILTER NOT EXISTS { ?item wdt:P6597 [] } ?item rdfs:label ?l . FILTER(lang(?l) = "de" ) } ORDER BY DESC(?count) LIMIT 100
List of this constraint violations: Database reports/Complex constraint violations/P6597#Most frequent P734 values without property (Germany, POB)

Statistics

[edit]

Names used in family name (P734)

[edit]

Frequency of uses of names as family name (P734)-values in Wikidata.

This list is periodically updated by a bot. Manual changes to the list will be removed on the next update!

WDQS | PetScan | TABernacle | Find images | Recent changes
range names total_items sample
0 21431 0 Kopfmann
1 7671 7671 Ayik
2-4 11194 34407 Krobot
5-9 3979 28936 Wrubel
10+ 7256 157152 Wörndle
50+ 1545 107252 Dolinar
100+ 1662 350523 Link
500+ 277 195189 Stahl
1000+ 249 496877 Johnston
5000+ 20 126969 Martin
10000+ 5 66164 Johnson


Names used in family name (P734) (Germany)

[edit]

Frequency of uses of names as family name (P734)-values in Wikidata. Items with country of citizenship (P27) = Germany (Q183) only.

This list is periodically updated by a bot. Manual changes to the list will be removed on the next update!

WDQS | PetScan | TABernacle | Find images | Recent changes
range names total_items sample name sample person
0 35985 0 Alp
1 7876 7876 Vietoris Christian Vietoris
2-4 7244 20871 Harmsen Björn Harmsen
5-9 1646 11867 Hübener Thomas Hübener
10+ 2121 42330 Engelbrecht Julie Engelbrecht
50+ 255 17144 Conrad Lars Conrad
100+ 151 27174 Berger Ludwig Berger
500+ 9 6031 Schneider Richard Schneider
1000+ 2 3348 Schmidt Hans Schmidt


Names used in family name (P734) (Austrian)

[edit]

Frequency of uses of names as family name (P734)-values in Wikidata. Items with country of citizenship (P27) = Austria (Q40) only.

This list is periodically updated by a bot. Manual changes to the list will be removed on the next update!

WDQS | PetScan | TABernacle | Find images | Recent changes
range names total_items sample name sample person
0 49689 0 Neuberger
1 2726 2726 Stone Michael Stone
2-4 2015 5757 Reinhart Karl Reinhart
5-9 408 2935 Hanke Franz Hanke
10+ 403 7367 Keller Greta Keller
50+ 34 2243 Leitner Thea Leitner
100+ 14 1843 Steiner Rudolf Steiner


Discussion

[edit]

It seems to me that item-requires-statement constraint (Q21503247) with language of work or name (P407) is used as a convenient way to check the completeness of family name (Q101352) items? The applicability of Digital Dictionary of Surnames in Germany ID (P6597) is independent of language of work or name (P407), in my opinion. Wouldn’t it be better to rely on EntitySchema:E734 for this check?

(A colleague raised the concern that this constraint might give the impression that the Digital Dictionary of Surnames in Germany ID (P6597) statement is erroneous, and I see the point.) @Jura1

Julian Jarosch (digicademy) (talk) 17:19, 19 December 2019 (UTC)[reply]

Eventually every item for a family name should have one or several such statements, but, as one can see on Property talk:P734/numbers/values for values of P734, we are far from that.
Beyond the format constraint, I don't think constraints indicate that, but I suppose we could change it to a mere suggestion (done that). --- Jura 07:00, 21 January 2020 (UTC)[reply]
Great, thank you! Julian Jarosch (digicademy) (talk) 09:59, 21 January 2020 (UTC)[reply]

I have removed the restriction conflicts-with constraint (Q21502838) because the dictionary also contains Ukrainian, Russian, Japanese names etc. and an error is displayed when merging them, see e.g. Antonjuk (Q107453552)/Antoniuk (Q12784938) and cf. http://www.namenforschung.net/id/name/300261/1 or conflicts-with constraint (Q21502838) etc. --HarryNº2 (talk) 13:29, 8 August 2022 (UTC)[reply]