Contents lists available at ScienceDirect Food Research International journal homepage: www.elsevier.com/locate/foodres Combinations of graph invariants and attributes of simplied molecular input-line entry system (SMILES) to build up models for sweetness P.G.R. Achary a, , A.P. Toropova b , A.A. Toropov b a Department of Chemistry, Institute of Technical Education and Research (ITER), Siksha OAnusandhan deemed to be University, Bhubaneswar, Odisha 751030, India b Department of Environmental Health Science, Laboratory of Environmental Chemistry and Toxicology, Istituto di RicercheFarmacologiche Mario Negri IRCCS, Via La Masa 19, 20156 Milan, Italy ARTICLE INFO Keywords: Sweetness potential QSAR OECD principles Monte Carlo method Index of Idelaity of correlation CORAL software ABSTRACT The quantitative structure activity relationships (QSARs) for sweetness value (log S) were built with a dataset of 315 molecules; following a novel criterion of Index of Ideality of Correlation(IIC)This criterion of IIC is available in the latest version of the CORAL software (www.insilico.eu/coral). The descriptor used in the model building for log S is a hybrid optimal descriptor; obtained by combining the two descriptors: (i) molecular graph based descriptor derived from correlation weights of molecular features and (ii) descriptor derived from the simplied molecular input-line entry system (SMILES) code of sweetener molecule. The data set of 315 mole- cules was divided into four random splits. The four QSAR models which were build for log S using the criterion of IIC were compared with four similar models built traditional protocoldescribed elsewhere. The comparison revealed that the models built using IIc were better with statistical performance. 1. Introduction Today, type 2 diabetes, cardiovascular diseases and obesity are becoming common diseases globally (Lustig, Schmidt, & Brindis, 2012). The well accepted reason for the increase in the number of patients in the above diseases is the change in lifestyle and the consumption of high calorie food. One of the major steps to minimize the risk and to decrease the calorie intake a good number of low-calorie sweeteners are coming in the market and becoming a part of the life style. The growing market of low-calorie sweeteners is not only due to the above patients but also huge consumption of people (non-patients) working in the philosophy of Prevention is better than cure. The sweeteners synthe- sized chemically or extracted from natural products are subject of dis- cussion as far as health and safety is concerned (Bassoli, Borgonovo, & Morini, 2011; Behrens, Meyerhof, Hellfritsch, & Hofmann, 2011; DuBois & Prakash, 2012). Literature review on SweetenersDBreveal that 316 sweeteners have the sweetness value(S) in between 0.200 and 2.25 × 10 5 , the denition of relative sweetness is presented in the data section. Due to wide distribution of the sweetness values it is better expressed as logarithm of sweetness value(log S).The sweetness potential is an im- portant data for chemistry, psychology, pharmacology, biochemistry, and the food industry (Chéron, Casciuc, Golebiowski, Antonczak, & Fiorucci, 2017).The denition of SP is not fast and cheap. Hence, the search for computational methods to estimate SP is actually a vital task. Quantitative structure activity relationships (QSARs) are a tool to build up predictive models of SP for substances, which were not studied experimentally (Goel, Gajula, Gupta, & Rai, 2018; Zhong, Chong, Nie, Yan, & Yuan, 2013).Among the various tools to design predictive models (QSAR) for dierent endpoints of various substances CORAL software is known for its uniqueness (Amata et al., 2017; Choi, Trinh, Yoon, Kim, & Byun, 2019; Fioressi, Bacelo, Rojas, Aranda, & Duchowicz, 2019; Ghaedi, 2015; Islam & Pillay, 2016; Rescina et al., 2017; Toropov, Achary, & Toropova, 2016; Toropov et al., 2019; Toropov & Toropova, 2018; Toropova & Toropov, 2019; Toropova, Toropov, Begum, & Achary, 2018; Velázquez-Libera, Caballero, Toropova, & Toropov, 2019).Thus, with a motivation that the CORAL software can also give good models for the sweetness value (log S); dierent possibilities, functionalities uniqueness available in CORAL was explored in this communication. Recently, a good number of papers used Index of Ideality of Correlation(IIC)as a novel criterion to build best predictive QSAR models (Toropov, Carbó-Dorca, & Toropova, 2018; Toropov & Toropova, 2017, 2018; Toropova & Toropov, 2017b, 2018). Factually, the majority of criteria of predictive ability QSPR/or QSAR is based o the distribution of dots in the plot experimental vs. calculated values of an endpoint. However, the IIC is a new approach which considers the use to both correlation coecient as well as the distribution of the https://doi.org/10.1016/j.foodres.2019.03.067 Received 1 January 2019; Received in revised form 9 March 2019; Accepted 28 March 2019 Corresponding author. E-mail address: pgrachary@soa.ac.in (P.G.R. Achary). Food Research International 122 (2019) 40–46 Available online 29 March 2019 0963-9969/ © 2019 Published by Elsevier Ltd. T