This page collects information on known issues with structure-related properties and external identifiers in items about partially ionic entities. See also a list of all issues related to this WikiProject.
Ionic bond refers to the bond between atoms with sharply different electronegativities. Such chemical entity is overall neutral, but consists of positively charged cations and negatively charged anions. In practice, however, chemical compounds are neither purely ionic, nor purely covalent. Each bond between atoms in chemical compounds exhibits some amount of ionic character and some of covalent character.
Such partial bonding characters are difficult to capture when representing a chemical entity through structural formulas and thus cannot be correctly reproduced using identifiers based on such formulas, like simplified molecular input line entry specification or International Chemical Identifier, which leads to the representation of the chemical entity as either ionic or covalent. Chemical databases can thus contain records for the same chemical entity generated from different structure-based identifiers.
In Wikidata, structure-related properties like InChI (P234), InChIKey (P235) and canonical SMILES (P233) are affected by this, as well as identifiers in large-scale chemical databases like ChemSpider ID (P661) or PubChem CID (P662). Entries in such databases are usually not considered incorrect, even if a predominantly ionic chemical entity described has a covalent structure in such entry.
In some cases, like with ChemSpider entries, standard InChI and InChIKey may be the same for both ionic and covalent structures, and the difference may be with structural formula, SMILES or non-standard InChI/InChIKey (which are visible only for logged in users after clicking ‘Wikibox‘ button). This is a result of generating ChemSpider entries for organometallic entities and salts based on non-standard InChI with RecMet option and adding a metal reconnected layer ‘\r‘ in InChI Software. Sometimes entries for ionic and covalent entities may have different standard InChI and InChIKey.
This page contains examples of solutions that will work in most cases, however, in unclear situations, you may consult each case on the WikiProject Chemistry talk page. For the evaluation purposes, all cases of the above-mentioned issues with ionic entities should be listed below.
which should be added for external identifier and structure-related properties for values that show a more appropriate way of representing a chemical entity as ionic.
which should be added to structure-related properties, but not to external identifiers, for values that show a covalent character for a predominantly ionic compound
sometimes incorrect structure of molecular entity (Q52679949) value may be of use (for both structure-related properties and external identifier), if for some reason the generated structure in the database, and on this basis also the statements imported to Wikidata, is obviously incorrect (this does not concern the nature of the bonds, but e.g. charge or stoichiometry).
ChemSpider database for this salt contains two entries: CSID:23338 for an entity consisting of a cobalt(2+) ion and a sulfate anion, and CSID:28822501 for an entity with a covalent bonds between a cobalt atom and two oxygen atoms. Standard InChI and InChIKeys are identical, structural formula, SMILES and non-standard InChI and InChIKeys are different.
Both identifiers have a specific criterion used (P1013) qualifier added regarding the ionic/covalent character. As the chemical entity has a predominantly ionic character (salt), one value is set to preferred with a proper reason for preferred rank (P7452) added.
First value shows a correct representation of a ionic compound, no need to set rank as preferred. The second value shows a covalent representation of a predominantly ionic compound, thus it is set to deprecated with a proper reason stated.
For this item there are four entries in PubChem and two entries in ChemSpider. Also, there are many SMILES, InChI and InChIKey values. This example shows a complex situation in which there is not only a problem of different representation of ionic and covalent bonding, but also issues with generating records in chemical databases with improbable formula or charge, which are then imported to Wikidata based on some identifiers or names.
First value shows a correct representation of a ionic compound, no need to set rank as preferred. The second value was imported from ChemSpider and shows an incorrect representation of an entity (wrong formula and charge), while the third value shows a covalent representation of a predominantly ionic compound; both values are set to deprecated with a proper reason stated.
First value shows a correct representation of a ionic compound, no need to set rank as preferred. The second value was imported from ChemSpider and shows an incorrect representation of an entity (wrong formula and charge), while the third value shows a covalent representation of a predominantly ionic compound; both values are set to deprecated with a proper reason stated.
First value shows a correct representation of a ionic compound, no need to set rank as preferred. The second value was imported from ChemSpider and shows an incorrect representation of an entity (wrong formula and charge), while the third value shows a covalent representation of a predominantly ionic compound; both values are set to deprecated with a proper reason stated.
First value shows a correct representation of a ionic compound. However, based on the content of the second ChemSpider record, it was imported into this item, but as it describe incorrect structure, it is set as deprecated.
The reason why both records exist in PubChem is unknown, and the choice between them remains purely editorial (the first one seems to be more complete), hence the indicated reason for preferred rank. The third value has been withdrawn from PubChem (non-live status), while the fourth value shows an invalid representation of a chemical entity (wrong formula and charge).