Neural Network Based Temperature-Dependent Quantitative Structure Property
Relations (QSPRs) for Predicting Vapor Pressure of Hydrocarbons
Denise Yaffe and Yoram Cohen*
Department of Chemical Engineering, University of California, Los Angeles,
Los Angeles, California 90095-1592
Received September 28, 2000
A neural network based quantitative structure-property relationship (QSPR) was developed for the vapor
pressure-temperature behavior of hydrocarbons based on a data set for 274 compounds. The optimal QSPR
model was developed based on a 7-29-1 back-propagation neural network architecture using valance molecular
connectivity indices (
1
v
,
3
v
,
4
v
), molecular weight, and temperature as input parameters. The average
absolute errors in vapor pressure predictions for the test, validation, and overall data sets were 8.2% (0.036
log P units or 23.2 kPA), 9.2% (0.039 log P units or 26.8 kPA), and 10.7% (0.046 log P units or 31.1 kPA),
respectively. The performance of the QSPR for temperature-dependent vapor pressure, which was developed
from a simple set of molecular descriptors, displayed accuracy of better than or well within the range of
other available estimation methods.
INTRODUCTION
Vapor pressure is an important property in many practical
applications. Vapor pressures are commonly used for as-
sessing the mass distribution of chemicals in the environment,
designing chemical processes, and calculating other physi-
cochemical properties. With vapor pressure data, air-water
partition coefficients, enthalpy of vaporizations, rates of
evaporation, and flash points can be estimated. The greatest
difficulty and uncertainty arises when vapor pressures are
being determined for chemicals of low volatility (vapor
pressures below 1 Pa). Experimental vapor pressure data are
abundant for low molecular weight hydrocarbons. However,
reliable experimental vapor pressure data are scarce for most
compounds with normal boiling points over 200 °C.
As a complement to experimental vapor pressure data,
numerous correlations for estimating vapor pressures have
been proposed. Most vapor pressure estimation equations are
either empirical or are based on equations-of-state or on the
Clausius-Clapeyron equation. The Clapeyron, Lee-Kesler,
Riedel, Frost-Kalkwarf-Thodos, Riedel-Plank-Miller, and
Thek-Stiel are equations based on corresponding-state rela-
tionships developed from critical temperature and pressure
data. These equations are applicable for mostly light mo-
lecular weight, nonpolar compounds with vapor pressures
above 1.33 kPa.
1
The above approaches rely on knowledge
of at least one critical property, boiling point, or melting
point temperature. Unfortunately, boiling points and melting
points as well as critical properties are often lacking,
especially for heavy hydrocarbons with many being thermally
unstable in the critical region.
Quantitative structure-property relationship (QSPR) is an
alternative approach for estimating vapor pressure. The
premise of QSPR is that physicochemical properties can be
correlated with molecular structural characteristics (geometric
and electronic) expressed in terms of appropriate molecular
descriptors.
2
Various studies have reported on the use of
electronic (i.e., dipole moments, hydrogen bonding param-
eters), lipophilic (i.e., partition coefficients), and topological
(i.e., molecular connectivity indices and other geometric
parameters) descriptors as well as other molecular parameters
(e.g., molar volume, parachor, and molar refractivity) for
correlating structural parameters with physicochemical prop-
erties. QSPR development involves the selection of molecular
descriptors to satisfactorily characterize different sets of
compounds and the application of algorithms, such as partial
least-squares or artificial neural networks to build the QSPR
model.
3-15
Partial least-squares regression methods require
a priori specification of the analytical form of the QSPR
model. As an alternative, neural networks (NNs) have gained
popularity in recent years as a technique for developing
quantitative structure-property relationship (QSPR) models.
The advantage of NNs over the regression analysis methods
is their inherent ability to incorporate nonlinear relationships
between the structures of compounds and their physical
properties.
13-16
The primary goal of the present study is to investigate the
potential applicability of neural network/temperature-de-
pendent QSPR models to predict vapor pressure as a function
of temperature. Specifically, we demonstrate the approach
for hydrocarbons ranging from four to 12 carbon atoms. We
select a back-propagation neural network system since it is
especially suitable for mapping complex nonlinear relation-
ships that may exist between model output (i.e., physico-
chemical properties) and model inputs (i.e., molecular
descriptors).
Currently, vapor pressure QSPRs that have been proposed
are limited to predicting vapor pressure at a constant
temperature. For example, Katritzky et al.,
10
using a QSPR
approach, proposed a five-descriptor linear correlation model
for predicting vapor pressures of 411 compounds at 25 °C.
The above author reported a standard vapor pressure (atm)
* Corresponding author phone: (310)825-8766; fax: (310)477-3868;
e-mail: yoram@ucla.edu.
463 J. Chem. Inf. Comput. Sci. 2001, 41, 463-477
10.1021/ci000462w CCC: $20.00 © 2001 American Chemical Society
Published on Web 02/21/2001