ARTICLE
OBC
www.rsc.org/obc
Quantum chemical topology (QCT) descriptors as substitutes for
appropriate Hammett constants
P. J. Smith and P. L. A. Popelier*
School of Chemistry, Faraday Building, Sackville Site, University of Manchester, Manchester,
M60 1QD, Great Britain. E-mail: pla@manchester.ac.uk; Fax: +44 (0)161 306 4559;
Tel: +44 (0)161 306 4511
Received 18th May 2005, Accepted 26th July 2005
First published as an Advance Article on the web 17th August 2005
A technique called quantum topological molecular similarity (QTMS) was recently proposed [J. Chem. Inf. Comput.
Sci., 2001, 41, 764] in order to construct a variety of medicinal, ecological and physical organic QSAR/QSPRs, based
on modern ab initio wave functions of geometry optimised molecules, in combination with quantum chemical
topology (QCT). The current abundance of computing power can be utilised to inject realistic descriptors into
QSAR/QSPRs. In previous work [J. Chem. Soc., Perkin Trans. 2, 2002, 1231] it was proven that a set of Hammett
constants (r
p
, r
m
, r
I
and r
0
p
) for a sizeable set of mono- and polysubstituted carboxylic acids can be replaced by QCT
bond descriptors. Using QTMS and proper statistical validation we examined seven data sets in total. The first three
sets (para-substituted phenols (r
−
), substituted toluenes (r
+
) and bromophenethylamines (r
+
)) corroborate that a
wider class of Hammett constants can also be replaced by QCT descriptors. A fourth set (benzyl radicals) focuses on
non-Hammett behaviour being superimposed on Hammett behaviour. QCT descriptors selectively correlate with
Hammett behaviour. The QTMS analysis of the last three sets (toxicity of benzyl alcohols, chromatographic capacity
factors of chalcones and herbicidal activity of 5-chloro-2,3-dicyanopyrazines) screens for false positives. This test is
successfully passed in that QCT descriptors fail when lipophilicity/hydrophobicity is in charge. Hence, overall, the
discriminatory capacity of QCT descriptors is established, in detecting Hammett behaviour and specifically replacing
the Hammett constants by more modern and non-empirical descriptors.
Introduction
The Hammett substituent constants, which quantify the electron
withdrawing or donating capabilities of a given substituent,
have enjoyed enormous success as structural descriptors in
quantitative structure–activity relationships (QSARs) and quan-
titative structure–property relationships (QSPRs).
1
The physical
chemical database of Hansch and co-workers contains over 8900
QSARs, of which 8875 are based on the r parameters.
2
It is
apparent that electronic effects are of considerable importance
in describing chemical reactions, and hence biological systems.
Some time ago
3
we used for the first time quantum chemical
descriptors defined by quantum chemical topology (QCT)
4–7
to set up a QSAR to predict r constants and, therefore, also
the acidity of substituted benzoic acids. Other than being a
generalisation of quantum mechanics to subspaces
8–10
QCT is
widely appreciated as a theory that extracts chemical insight
from modern wave functions.
11,12
The acronym QTMS (quantum
topological molecular similarity) covered the initial develop-
ment and intention of QCT to provide an economic alternative
to Carbo-like indices.
13
It was proven that superposition of
(complete molecular) electron densities is not necessary to set up
a successful QSAR. Instead, as explained below, special points
in 3D space sufficed.
Although QCT descriptors could have been fed into molecular
similarity indices, they immediately appeared in a “supervised”
regression context. In other words, models
14
were constructed
using partial least squares (PLS) for a variety of medicinal
15–17
and ecological
18–21
QSARs. The estimation of the pK
a
of
carboxylic acids, anilines and phenols
22
and prediction of r
p
,
r
m
, r
I
and r
0
p
parameters of mono-
3
and polysubstituted benzoic
acids, phenylacetic acids and bicyclo carboxylic acids
19
feature as
examples of physical organic properties. The QTMS approach
inspired work in other groups (e.g.ref. 23–24). A strong and
important feature of QTMS is that it is able to localise a part
in the molecule where the chemical change associated with
the observed activity actually happens. For example, the O–H
bond is highlighted as the active center in the deprotonation of
carboxylic acids if their acidity is studied.
QTMS typically uses so-called bond critical point (BCP)
properties, as in this study, but (integrated) atomic properties can
also feature. BCPs are 3D saddle points in the electron density,
located by computation
25
and appearing at the boundary
between two QCT atoms. Certain functions, such as the electron
density, are evaluated at each BCP, thereby characterising the
bond that the BCP represents. A BCP can be represented
by an arbitrarily high number of properties, which serve as
vector components locating the BCP as a “quantum chemical
fingerprint” in BCP-space.
3
In view of the ubiquity of the Hammett parameters in
formulating QSARs, coupled with the promising results ob-
tained so far in capturing substituent effects, we aimed at
extending the range of r constants investigated by QTMS. In
addition, the range of systems is hereby also increased, beyond
the domain of carboxylic acid datasets. We examined four
data sets: para-substituted phenols (r
−
), bromophenethylamines
(r
+
), substituted toluenes (r
+
), and benzyl radicals in order to
investigate non-Hammett behaviour. We will show that excellent
statistics are obtained for the first three sets, and that the active
centers highlighted make sense. The last set is a “false positive”
test, adopted to see whether QTMS still performs well when
conventional “polar effect” Hammett constants fail. We will
show that QTMS passes this test and hence operates successfully
for the right reasons.
Methods
The full details of how QTMS operates have been published
before
13
and repeated in the applications mentioned in the
Introduction. In the interest of making this paper self-contained
it is useful to summarise the procedure here.
Geometry-optimised wave functions were generated at HF/6-
31G*//HF/6-31G* level using the program GAUSSIAN98.
26
In previous QTMS work we varied the level of theory between
DOI: 10.1039/b507024d
This journal is ©
The Royal Society of Chemistry 2005 Org. Biomol. Chem. , 2005, 3 , 3399–3407 3399