Available online at www.sciencedirect.com Sensors and Actuators B 132 (2008) 13–19 Application of GA-MLR, GA-PLS and the DFT quantum mechanical (QM) calculations for the prediction of the selectivity coefficients of a histamine-selective electrode Siavash Riahi a,b, , Mohammad Reza Ganjali b , Parviz Norouzi b , Fatemeh Jafari b a Institute of Petroleum Engineering, Faculty of Engineering, University of Tehran, Tehran, Iran b Center of Excellence in Electrochemistry, Faculty of Chemistry, University of Tehran, P.O. Box 14155-6455, Tehran, Iran Received 23 September 2007; received in revised form 17 December 2007; accepted 3 January 2008 Available online 16 January 2008 Abstract Quantitative structure–property relationships (QSPRs) were developed using a genetic algorithm (GA), based on the variable-selection approach with topological descriptors. The selectivity coefficients of 26 molecules (drug, amino-acid and organic compound) of a histamine-selective electrode were efficiently estimated and predicted with the QSPR models. The most important descriptors were selected from a set of 74 topological descriptors to build the QSPR models, using the multiple linear regressions (MLRs) and the partial least squares (PLS) regression. The predictive quality of the QSPR models was tested for an external prediction set of 7 compounds, randomly chosen from 26 compounds. The PLS regression method was used to model the structure-selectivity coefficient relationships. However, the results surprisingly showed more or less the same quality for MLR and PLS modeling, according to the squared regression coefficients R 2 values, which were 0.918 and 0.915, respectively. In addition, the theoretical investigation on the interaction of the histamine and the other studied compounds with the ionophore was performed. The correlation between the interaction energies and the selectivity coefficients of the studied compounds was equal to 0.993, demonstrating the applicability of these results for the prediction of the selectivity coefficients. © 2008 Elsevier B.V. All rights reserved. Keywords: Selectivity coefficient; DFT calculation; Partial least squares (PLS); Genetic algorithm; QSPR; Chemometrics 1. Introduction The multiple linear regressions (MLRs) and the partial least squares (PLS) regression are the mostly used modeling methods in QSPRs (quantitative structure–property relationships) [1–3]. The MLR yields are simpler and easier to interpret than those of the PLS models, because these methods perform regression on the latent variables, not having physical meaning. However, due to the collinearity between the structural descriptors, MLR is not able to extract useful information from the structural data and, therefore, an overfitting problem is encountered. The appli- cation of these techniques usually requires variable selection for building well-fitted models. Corresponding author at: Institute of Petroleum Engineering, Faculty of Engineering, University of Tehran, Tehran, Iran. E-mail address: riahisv@khayam.ut.ac.ir (S. Riahi). Variable selection methods range from simple methods, such as stepwise selection [4], to more elaborate methods, such as simulated annealing [5], evolutionary programming [6] and genetic algorithms (GA) [7]. GA is a stochastic method to solve optimization problems, defined by fitness criteria applying the evolution hypothesis of Darwin and different genetic functions, i.e., crossover and mutation [8]. Compared with the traditional search and the optimization procedures, such as calculus-based and enumerative strategies, GA is robust, global and generally more straightforward to apply to situations where there is little or no a priori knowledge about the process to be controlled. Since GA does not require derivative information or a formal initial estimate of the solution region and because of the stochastic nature of the search mechanism, it is capable of searching the entire solution space with a greater likelihood of finding the global optimum. For literature on GA, the reader can refer to Goldberg [9]. Ion-selective electrodes (ISEs) are electrochemical sensors that respond selectively to the activity of ionic species [10]. In 0925-4005/$ – see front matter © 2008 Elsevier B.V. All rights reserved. doi:10.1016/j.snb.2008.01.009