Green Chemistry PAPER Cite this: Green Chem., 2016, 18, 4468 Received 5th April 2016, Accepted 15th July 2016 DOI: 10.1039/c6gc00945j www.rsc.org/greenchem Predicting skin permeation rate from nuclear magnetic resonance spectra Nan An, a John-Hanson Machado, b Yuechuan Tang, a Jakub Kostal b and Adelina Voutchkova-Kostal* a In order to systematically approach the design of chemicals with minimal toxicity we are in need of predictive tools that can be applied seamlessly during chemical synthesis and characterization. Our approach is to develop models that utilize spectroscopic data, called Quantitative Spectrometric Data-Activity Relationships, to predict bioavailability and toxicity. Such models do not require knowledge of chemical structure and can be applied to chemical mixtures. Here we report a predictive QSDAR for skin permeation rate (log K p ) of organic chemicals from NMR spectroscopic data and molecular weight. The model is trained on a large data set consisting of structurally diverse chemicals and has been thoroughly externally validated once with a withheld subset of the original data set, and once with a distinct set of complex biologically active compounds curated by Klimisch scoring (r 2 = 0.838, q ext1 2 = 0.837, q ext2 2 = 0.419). The model performs equally or better than prevailing struc- ture-based methods, and oers a number of advantages for facilitating rational design of safer chemicals. Introduction Estimation of skin permeation rates of chemicals is of para- mount importance to medicinal chemists, formulators and toxicologists. 1,2 Medicinal chemists consider skin permeability of dermal APIs in an eort to deliver the desired dose; formu- lators seek to understand permeability in order to create eective and safe cosmetic products; and toxicologists con- sider permeation when carrying out risk and alternatives assessments. 3 Furthermore, molecular designers seek to mini- mize skin permeation of commercial chemicals in order to reduce the probability of toxicity. Experimental methods for testing skin permeability include in vitro diusion chamber experiments, 4 biomonitoring experiments 5,6 and excised skin experiments, most commonly from rat and pig. 2 However, these methods are often cost-prohibitive and time-consuming, which makes the development of fast and accurate predictive methods highly desirable. Predictive methods are also required in order to rationally design new commercial chemi- cals with minimal skin permeation. 1 A number of predictive quantitative structureactivity relationships (QSARs) exist that successfully relate skin per- meation rates, log K p , to chemical structure. 2,716 If one exam- ines the QSARs that are not constrained to a single functional class (trained on more than 100 compounds), the coecient of determination of training set (r 2 ) ranges from 0.63 to 0.945. 2,710,1722 The salient properties driving existing QSAR models are hydrophobicity, reflected by octanolwater par- tition coecient (log P), 23 molecular size (or volume) and intermolecular interactions, such as hydrogen bonding. 24 As a result, many QSARs for skin permeation rate share the generic form, 10,17,22,24 log K p ¼ aðhydrophobicityÞþ bðmolecular sizeÞ þ cðinteractionsÞþ d The majority of these QSARs rely on physicochemical and structural descriptors derived from the 2D or 3D chemical structures, and many of them suer from limited applicability domains and low external predictivity. 22 The limitations of existing log K p models stem from two factors: first, the in- consistency in the measurement of K p , which depends on the carrier solvent, the mode and location of application to the skin, and biological factors; and second, the existence of numerous mechanisms by which a chemical can permeate through the epidermis, such as intracellular diusion, inter- cellular transport and follicular transport. 15,25 Herein we attempt to develop a quantitative model for log K p that does not use structure-based descriptors, but rather nuclear magnetic resonance (NMR) spectroscopic data. Such models are referred to as quantitative spectrometric data- activity relationships (QSDARs). 26 A handful of literature reports document successful attempts to integrate NMR spec- troscopic data into predictive models for physicochemical Electronic supplementary information (ESI) available. See DOI: 10.1039/c6gc00945j a Department of Chemistry, The George Washington University, Washington, DC, USA. E-mail: avoutchkova@email.gwu.edu; Fax: +202-994-5873; Tel: +202-994-6121 b Computational Biology Institute, The George Washington University, Washington, DC, USA 4468 | Green Chem. , 2016, 18, 44684474 This journal is © The Royal Society of Chemistry 2016 Published on 25 July 2016. Downloaded by George Washington University on 28/10/2016 19:31:30. View Article Online View Journal | View Issue