Contents lists available at ScienceDirect Computers in Biology and Medicine journal homepage: www.elsevier.com/locate/compbiomed Renement-based modeling of the ErbB signaling pathway Bogdan Iancu a,b,1 , Usman Sanwal a,b,1 , Cristian Gratie a,b , Ion Petre a,c,d,* a Computational Biomodeling Laboratory, Turku Centre for Computer Science, Finland b Department of Computer Science, Åbo Akademi University, Finland c Department of Mathematics and Statistics, University of Turku, Finland d National Institute for Research and Development in Biological Sciences, Romania ARTICLE INFO Keywords: Computational modeling Model construction Renement ErbB signaling pathway ODE-Based models Event-B Invariant ABSTRACT The construction of large scale biological models is a laborious task, which is often addressed by adopting iterative routines for model augmentation, adding certain details to an initial high level abstraction of the biological phenomenon of interest. Retting a model at every step of its development is time consuming and computationally intensive. The concept of model renement brings about an eective alternative by providing adequate parameter values that ensure the preservation of its quantitative t at every renement step. We demonstrate this approach by constructing the largest-ever renement-based biomodel, consisting of 421 species and 928 reactions. We start from an already t, relatively small literature model whose consistency we check formally. We then construct the nal model through an algorithmic step-by-step renement procedure that ensures the preservation of the model's t. 1. Introduction Mechanistic control of cellular activity is intricate and making predictions about its system-level behavior is highly dicult. Our ability to make such predictions can be essential not only in reversing the dynamics of cellular impairment, but also in directing cellular ac- tivity towards a more favorable behavior. Mathematical modeling is essential in making such predictions, but its use as a standard procedure in the eld of practical applications is severely limited due to large numbers of parameters that are required either to be xed or estimated, see Ref. [1]. A massive number of parameters to estimate requires the avail- ability of a large volume of data and makes model tting computa- tionally intensive. For this reason, we focus on renement-based model construction as an intermediary step in the model development cycle. Stepwise renement emerged from the eld of software engineering. It was introduced at rst as a concept in parallel computing and it ex- panded quickly, giving rise to the framework of renement calculus, where it is promoted as a renement method to ensure correctness preservation, see Ref. [2]. In the eld of systems biology, model renement becomes crucial in the model development cycle. Model t is greatly aected by changes in the number of reactants, reactions, modules, etc. The entire process of model tting for considerably large models is not only a tedious task for the modeler as such, but it is computationally intensive since most parameter estimation routines take considerable time to complete and require massive amounts of computational resources. Hence, an itera- tive approach which relies on the conventional reiteration of the entire model tting procedure is not feasible for large models. As an alter- native, we consider an approach which ensures model t preservation at every renement step. The approach was discussed in the literature for rule-based models, see Refs. [3,4]. For reaction-based models with a quantitative dynamic described by ODEs, the method was referred to as quantitative model renement, see Ref. [5] and then extended and called t-preserving data renement [6]. We discuss in this paper the implementation of the largest-ever model built through model renement, describing the ErbB signaling pathway. Our renement approach is based on data renement, where a nite set of subspecies of a given species in the initial model are sub- stituted in the rened model for their corresponding parentspecies in the initial model. We started with a model of the EGFR (ErbB1) sig- naling pathway proposed in Refs. [7,8]. Throughout the paper, the model from Ref. [7] is referred to as the basic model. We rened this model to include four dierent types of receptor tyrosine kinases, ErbB1 4, structurally related to the epidermal growth factor receptor, EGFR, and two types of ligands, EGF and HRG, and we compared the computational eort needed to build it with that of [9]. We used logic- based formal methods support based on modeling with Event-B [10] to https://doi.org/10.1016/j.compbiomed.2019.01.016 Received 10 October 2018; Received in revised form 18 January 2019; Accepted 19 January 2019 * Corresponding author. Computational Biomodeling Laboratory, Turku Centre for Computer Science, Finland. E-mail address: ion.petre@utu.(I. Petre). 1 Authors with equal contribution. Computers in Biology and Medicine 106 (2019) 91–96 0010-4825/ © 2019 Elsevier Ltd. All rights reserved. T