Wikidata:WikiProject Informatics/Structures/Character encodings
Jump to navigation
Jump to search
Try it!
Try it!
Subpages[edit]
Goals[edit]
Long term goals[edit]
- For Wikidata to become the most comprehensive resource for information on character encoding systems
- For Wikipedia to extensively use data from Wikidata on articles relating to computer and telecommunication systems
Short term goals[edit]
- Define and reach agreement on an ontology for character encoding systems
- Advertise for and encourage new contributors to join the project
- Commence detailed definition of popular and widely used character encoding systems to encourage development of an ontology, and to raise awareness of this project
Automatic lists[edit]
TODO
Useful queries[edit]
- Return a list of all instance of (P31) or subclass of (P279) of grapheme (Q2545446)
SELECT ?item ?itemLabel
WHERE {
?item wdt:P31*/wdt:P279* wd:Q2545446 .
SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }
}
- Return a list of all instance of (P31) or subclass of (P279) of character set (Q184757)
SELECT ?item ?itemLabel
WHERE {
?item wdt:P31*/wdt:P279* wd:Q184757 .
SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }
}
- Return a list of all instance of (P31) or subclass of (P279) of coded character set (Q29149990)
SELECT ?item ?itemLabel
WHERE {
?item wdt:P31*/wdt:P279* wd:Q29149990 .
SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }
}
- Return a list of all instance of (P31) or subclass of (P279) of code page (Q184766)
SELECT ?item ?itemLabel
WHERE {
?item wdt:P31*/wdt:P279* wd:Q184766 .
SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }
}
- Return a list of all instance of (P31) or subclass of (P279) of code point (Q1105784)
SELECT ?item ?itemLabel
WHERE {
?item wdt:P31*/wdt:P279* wd:Q1105784 .
SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }
}
- Return a list of characters (symbol), and the UNICODE characters associated with each.
SELECT ?item ?itemLabel ?unicode
WHERE
{
?item wdt:P31 wd:Q3241972.
OPTIONAL{?item wdt:P487 ?unicode.}
SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
}
- Return a list of characters (computing), and the UNICODE characters associated with each.
SELECT ?item ?itemLabel ?unicode
WHERE
{
?item wdt:P31 wd:Q32483.
OPTIONAL{?item wdt:P487 ?unicode.}
SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
}
Properties & structure of items[edit]
TODO: identify properties applying to graphemes, code pages, character encodings/etc
Property proposals in progress:
- Coded Character Set Identifier (CCSID)
- Code Page Identifier (CPGID)
- Graphic Character Global Identifier (GCGID)
- Graphic Character Set Global Identifier (GCSGID)
[edit]
TODO