Aromaticity Detection

Which molecules are aromatic?

An aromatic molecule is one in which electrons form a continuous pi cloud above and below the circular arrangements of atoms. In one representation these atoms are connected by alternating single and double bonds. Another representation is that of the circular pi bond, in which the electron density is evenly distributed through a pi bond above and below the ring. The circulating pi electrons in a aromatic molecule generate significant local magnetic fields that can be detected by NMR techniques. Aromatic molecules typically display enhanced chemical stability, compared to similar non-aromatic molecules.

This commonly seen model of aromatic rings was developed by Friedrich August Kekulé von Stradonitz and was first explained in quantum mechanical terms by Linus Pauling in the 1930s. In 1931, Erich Hückel devised the "4n+2" pi electron rule, valid for planar molecules with a single ring. Molecules having 4n+2 pi electrons (n >= 0) are expected to be aromatic.

Methods

Two types of transformation to aromatic representation are implemented in Marvin, both are based on the Hückel's 4n+2 rule.

All transformation methods work only in structures which are in non-aromatic representation. If the molecules are in partially aromatic form (containing any aromatic bond) the transformation method may fail.

Basic aromaticity detection

Process: locate the ring atoms in the molecule connected with single and double bonds respectively, sum the number of electrons and if the Hückel's rule is valid, then the ring is aromatic. Ring systems are also checked.

There are some exceptions:

General aromaticity detection

The method is the same as used by Daylight.
Locate the ring atoms in the molecule connected with single and double bonds respectively, sum the number of electrons and if the Hückel's rule is valid, then the ring is aromatic. Ring systems are also checked. The atoms at the generated ring system may not form a continuous ring like in benzo[de]chromene (SMILES: O1C=CC2=CC=CC3=CC=CC1=C23) where all atoms of the molecule are in the ring system generated from the three 6 membered ring.

There are some exceptions:

Loose aromaticity detection

The following ring systems are interpreted as aromatic:

Differences between the Basic and General methods

The two method approach the question differently. The general method tries to incorporate mesomeric, tautomeric rearrangement, as in 2-pyridone, while the basic method does not. In the basic method the external double bond breaks the formation of aromatic ring.

The 2-pirydone is aromatic due to its mesomeric rearrangement:
pyridone mesomer

The following molecules will give different results depending upon the method applied.

Molecule in aliphatic form Basic aromatization General aromatization
IUPAC name: pyridin-2(1H)-one
pyridone pyridone with ChemAxon aromatization pyridone with Daylight aromatization
IUPAC name: 2-thioxo-2,3-dihydropyrimidin-4(1H)-one
smi 2  with ChemAxon aromatization  with Daylight aromatization
IUPAC name: 2,4-dihydro-3H-1,2,4-triazol-3-thione
smi 3  with ChemAxon aromatization  with Daylight aromatization
IUPAC name: 9H-xanthen-9-one
smi 4  with ChemAxon aromatization  with Daylight aromatization
IUPAC name: 5-thioxo-1,2,4-triazolidin-3-one
smi 5  with ChemAxon aromatization  with Daylight aromatization
IUPAC name: imidazo[1,5-a]pyridine-3(2H)-thione
smi 6  with ChemAxon aromatization  with Daylight aromatization

Aromatization of query structures

Structures with Query atoms

A query structure which defines one or more molecules with atom lists or query atoms in one query structure is converted to aromatic form if any of the defined structures can be aromatized. Link nodes, R-atoms and query bonds are not aromatized. If these features are used, the use of single/aromatic or double/aromatic bond types is recommended for each of the ring bonds in order to ensure matching to aromatic systems.
Query structure Aromatized
smi 7 yes
smi 8 yes
smi 9 yes
smi 10 yes
smi 11 yes
smi 12 yes
smi 13 no

Structures with Query bonds

Query structures with ANY bond are aromatized if the ring or ring system containing the ANY bond can be aromatized supposing the ANY bond as aromatic. In this case the single and double bonds are converted to single_or_aromatic and double_or_aromatic bonds respectively.
Structures with query bonds may contain atoms with improper valence due to mixed aromatic-Kekule representation. These structures are accepted and aromatized as they are just an incomplete representation of an otherwise aromatic ring.
Query structure Aromatized form
[#6]1~[#6]C=CC=C1 query aromatized
*~1~*-*~*~*~*~1 query aromatized
mixed aromatic-Kekule representation 1 query aromatized
mixed aromatic-Kekule representation 2 query aromatized

Aromatization error detection

Possible aromatization errors may be detected with the use of Structure Checker. Guide to Aromaticity Error Checker.

References

[1] http://www.daylight.com/dayhtml_tutorials/languages/smiles/index.html
[2] Open discussion forum about aromatic forms
[3] MDL's Enhanced Stereochemical Representation