Property talk:P6597
Documentation
ID of corresponding entry in the DFD online dictionary of family names
The Digital Dictionary of Surnames in Germany (Q61889795) publishes new entries every two weeks, on the first and fifteenth day of the month. With every update, two lists are also updated on the DFD server:
- http://www.namenforschung.net/alle.csv – list of all published entries
- http://www.namenforschung.net/neu.csv – list of newly published entries from the last update
Both lists are formatted like the upload format for M’n’M (not yet adjusted to the M’n’M update of February 2021). The Mix’n’match catalog will soon be updated every two weeks from the first list.
Additional matching is possible with an external workflow, which produces a list ready for QuickStatements. This workflow is intended to only make unequivocal 1:1 matches and avoid constraint violations.[1-9][0-9]{0,5}|10[0-9]{5}
”: value must be formatted using this pattern (PCRE syntax). (Help)List of violations of this constraint: Database reports/Constraint violations/P6597#Format, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Item P1705, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Item P282, search, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Item P31, search, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Entity types
List of violations of this constraint: Database reports/Constraint violations/P6597#Scope, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Item P407, search, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#language
List of violations of this constraint: Database reports/Constraint violations/P6597#Label in 'de' language, search, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Label in 'en' language, search, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Label in 'es' language, search, SPARQL
there should be just one native label (P1705) statement on items (Help)
Violations query:
SELECT ?item (COUNT(DISTINCT ?st) as ?count) (COUNT(DISTINCT str(?nl)) as ?count2) (GROUP_CONCAT(DISTINCT str(?nl); separator=", ") as ?nls) WHERE { ?item wdt:P6597 [] . ?item p:P1705 ?st . ?st ps:P1705 ?nl . } GROUP BY ?item HAVING (?count2 > 1) ORDER BY DESC(?count2) ?item LIMIT 500
List of this constraint violations: Database reports/Complex constraint violations/P6597#single native label
check native label (P1705) and label in English (en) (Help)
Violations query:
SELECT ?item ?nl { ?item wdt:P6597 ?value . ?item wdt:P1705 ?nl . BIND( strlang(str(?nl),"en") as ?en_label) FILTER NOT EXISTS { ?item rdfs:label ?en_label } }
List of this constraint violations: Database reports/Complex constraint violations/P6597#en label should match P1705 value
check native label (P1705) and label in German (de) (Help)
Violations query:
SELECT ?item ?nl { ?item wdt:P6597 ?value . ?item wdt:P1705 ?nl . BIND( strlang(str(?nl),"de") as ?de_label) FILTER NOT EXISTS { ?item rdfs:label ?de_label } }
List of this constraint violations: Database reports/Complex constraint violations/P6597#de label should match P1705 value
check native label (P1705) and label in Spanish (es) (Help)
Violations query:
SELECT ?item ?nl { ?item wdt:P6597 ?value . ?item wdt:P1705 ?nl . BIND( strlang(str(?nl),"es") as ?es_label) FILTER NOT EXISTS { ?item rdfs:label ?es_label } }
List of this constraint violations: Database reports/Complex constraint violations/P6597#es label should match P1705 value
labels would generally be available in several languages (Help)
Violations query:
SELECT ?item ?nl (COUNT(*) as ?count) { ?item wdt:P6597 ?value . ?item rdfs:label ?l . ?item wdt:P1705 ?nl . } GROUP BY ?item ?nl HAVING ( ?count < 9)
List of this constraint violations: Database reports/Complex constraint violations/P6597#labels in several languages
Default English language description for Latin script family names is "family name" (Help)
Violations query:
SELECT ?item ?en_desc { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item schema:description ?en_desc . FILTER( lang(?en_desc)="en" && !CONTAINS( ?en_desc, "family name" ) ) ?item wdt:P282 wd:Q8229 }
List of this constraint violations: Database reports/Complex constraint violations/P6597#en description to include "family name"
Default German language description for Latin script family names is "Familienname" (Help)
Violations query:
SELECT ?item ?de_desc { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item schema:description ?de_desc . FILTER( lang(?de_desc)="de" && !CONTAINS( ?de_desc, "Familienname" ) ) ?item wdt:P282 wd:Q8229 }
List of this constraint violations: Database reports/Complex constraint violations/P6597#de description to include "Familienname"
Japanese description format is generally "姓 (<P1705 value>)" (Help)
Violations query:
SELECT ?item ?nl ?ja_desc { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item wdt:P1705 ?nl . ?item schema:description ?ja_desc . FILTER(lang(?ja_desc)="ja" && !CONTAINS( ?ja_desc, ?nl ) ) ?item wdt:P282 wd:Q8229 . } LIMIT 200
List of this constraint violations: Database reports/Complex constraint violations/P6597#ja description to include P1705 value
Russian description format is generally "фамилия - <P1705 value>" (Help)
Violations query:
SELECT ?item ?nl ?ru_desc { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item wdt:P1705 ?nl . ?item schema:description ?ru_desc . FILTER(lang(?ru_desc)="ru" && !CONTAINS( ?ru_desc, ?nl ) ) ?item wdt:P282 wd:Q8229 . } LIMIT 200
List of this constraint violations: Database reports/Complex constraint violations/P6597#ru description to include P1705 value
Native label should be an alias in Russian (Help)
Violations query:
SELECT ?item ?nl ?alt { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item wdt:P1705 ?nl . BIND( strlang(str(?nl), "ru") as ?alt) FILTER NOT EXISTS { ?item skos:altLabel ?alt } } LIMIT 200
List of this constraint violations: Database reports/Complex constraint violations/P6597#ru alias to include P1705 value
family names used as P734 values, but without a P6597 statement. Selection by nationalities (Help)
Violations query:
SELECT ?item ?l ?count ?sample WITH { SELECT ?item (COUNT(DISTINCT ?p) as ?count) (SAMPLE(?p) as ?sample) { VALUES ?c { wd:Q183 wd:Q16957 wd:Q713750 } ?p wdt:P27 ?c . hint:Prior hint:rangeSafe true . ?p wdt:P734 ?item . ?p wdt:P31 wd:Q5 . } GROUP BY ?item HAVING ( ?count > 20 ) } as %a WHERE { INCLUDE %a FILTER NOT EXISTS { ?item wdt:P6597 [] } ?item rdfs:label ?l . FILTER(lang(?l) = "de" ) } ORDER BY DESC(?count) LIMIT 100
List of this constraint violations: Database reports/Complex constraint violations/P6597#Most frequent P734 values without property (Germany, nat)
family names used as P734 values, but without a P6597 statement. Selection by place of birth (Help)
Violations query:
SELECT ?item ?l ?count ?sample WITH { SELECT ?item (COUNT(DISTINCT ?p) as ?count) (SAMPLE(?p) as ?sample) { ?p wdt:P19 / wdt:P17 wd:Q183 . hint:Prior hint:rangeSafe true . ?p wdt:P734 ?item . ?p wdt:P31 wd:Q5 . } GROUP BY ?item HAVING ( ?count > 20 ) } as %a WHERE { INCLUDE %a FILTER NOT EXISTS { ?item wdt:P6597 [] } ?item rdfs:label ?l . FILTER(lang(?l) = "de" ) } ORDER BY DESC(?count) LIMIT 100
List of this constraint violations: Database reports/Complex constraint violations/P6597#Most frequent P734 values without property (Germany, POB)
Statistics
[edit]Names used in family name (P734)
[edit]Frequency of uses of names as family name (P734)-values in Wikidata.
This list is periodically updated by a bot. Manual changes to the list will be removed on the next update!
WDQS | PetScan | TABernacle | Find images | Recent changesrange | names | total_items | sample |
---|---|---|---|
0 | 21431 | 0 | Kopfmann |
1 | 7671 | 7671 | Ayik |
2-4 | 11194 | 34407 | Krobot |
5-9 | 3979 | 28936 | Wrubel |
10+ | 7256 | 157152 | Wörndle |
50+ | 1545 | 107252 | Dolinar |
100+ | 1662 | 350523 | Link |
500+ | 277 | 195189 | Stahl |
1000+ | 249 | 496877 | Johnston |
5000+ | 20 | 126969 | Martin |
10000+ | 5 | 66164 | Johnson |
Names used in family name (P734) (Germany)
[edit]Frequency of uses of names as family name (P734)-values in Wikidata. Items with country of citizenship (P27) = Germany (Q183) only.
This list is periodically updated by a bot. Manual changes to the list will be removed on the next update!
WDQS | PetScan | TABernacle | Find images | Recent changesrange | names | total_items | sample name | sample person |
---|---|---|---|---|
0 | 35985 | 0 | Alp | |
1 | 7876 | 7876 | Vietoris | Christian Vietoris |
2-4 | 7244 | 20871 | Harmsen | Björn Harmsen |
5-9 | 1646 | 11867 | Hübener | Thomas Hübener |
10+ | 2121 | 42330 | Engelbrecht | Julie Engelbrecht |
50+ | 255 | 17144 | Conrad | Lars Conrad |
100+ | 151 | 27174 | Berger | Ludwig Berger |
500+ | 9 | 6031 | Schneider | Richard Schneider |
1000+ | 2 | 3348 | Schmidt | Hans Schmidt |
Names used in family name (P734) (Austrian)
[edit]Frequency of uses of names as family name (P734)-values in Wikidata. Items with country of citizenship (P27) = Austria (Q40) only.
This list is periodically updated by a bot. Manual changes to the list will be removed on the next update!
WDQS | PetScan | TABernacle | Find images | Recent changesrange | names | total_items | sample name | sample person |
---|---|---|---|---|
0 | 49689 | 0 | Neuberger | |
1 | 2726 | 2726 | Stone | Michael Stone |
2-4 | 2015 | 5757 | Reinhart | Karl Reinhart |
5-9 | 408 | 2935 | Hanke | Franz Hanke |
10+ | 403 | 7367 | Keller | Greta Keller |
50+ | 34 | 2243 | Leitner | Thea Leitner |
100+ | 14 | 1843 | Steiner | Rudolf Steiner |
Discussion
[edit]Is item-requires-statement constraint (Q21503247) with language of work or name (P407) necessary?
[edit]It seems to me that item-requires-statement constraint (Q21503247) with language of work or name (P407) is used as a convenient way to check the completeness of family name (Q101352) items? The applicability of Digital Dictionary of Surnames in Germany ID (P6597) is independent of language of work or name (P407), in my opinion. Wouldn’t it be better to rely on EntitySchema:E734 for this check?
(A colleague raised the concern that this constraint might give the impression that the Digital Dictionary of Surnames in Germany ID (P6597) statement is erroneous, and I see the point.) @Jura1
Julian Jarosch (digicademy) (talk) 17:19, 19 December 2019 (UTC)
- Eventually every item for a family name should have one or several such statements, but, as one can see on Property talk:P734/numbers/values for values of P734, we are far from that.
- Beyond the format constraint, I don't think constraints indicate that, but I suppose we could change it to a mere suggestion (done that). --- Jura 07:00, 21 January 2020 (UTC)
- Great, thank you! Julian Jarosch (digicademy) (talk) 09:59, 21 January 2020 (UTC)
I have removed the restriction conflicts-with constraint (Q21502838) because the dictionary also contains Ukrainian, Russian, Japanese names etc. and an error is displayed when merging them, see e.g. Antonjuk (Q107453552)/Antoniuk (Q12784938) and cf. http://www.namenforschung.net/id/name/300261/1 or conflicts-with constraint (Q21502838) etc. --HarryNº2 (talk) 13:29, 8 August 2022 (UTC)
- All Properties
- Properties with external-id-datatype
- Properties used on 10000+ items
- Properties with single value constraints
- Properties with unique value constraints
- Properties with format constraints
- Properties with constraints on items using them
- Properties with constraints on type
- Properties with entity type constraints
- Properties with scope constraints
- Properties with conflicts with constraints
- Properties with lexeme language constraints
- Properties with label language constraints
- Properties with complex constraints