Effective Search of the Energy Landscape for Protein Folding Eugene Santos Jr. 1 , Keum Joo Kim 1 , and Eunice E. Santos 2 1 University of Connecticut, Storrs, CT 06269 {eugene,keumjoo}@engr.uconn.edu, 2 Virginia Polytechnic Institute and State University, Blacksburg, VA 24061 santos@cs.vt.edu Abstract. We propose a new algorithmic approach for global optimiza- tion in protein folding. We use the information found in various local minima to direct the search for the global minimum. In this way, we explore the energy landscape efficiently by considering only the space of local minima instead of the whole feasible space of conformations. Our fundamental approach is to sample only the space of local minima and guide the sampling process by exploring protein structure building blocks found in sampled local minima. These building blocks form the basis of information in searching for the global minimum. In particular we employ an iterative algo- rithm that begins with an initial pool of local minima; construct a new pool of solutions by combining the various building blocks found in the original pool; take each solution and map them to their representative local minima; and, re- peat the process. Our procedure seems to share a great deal of commonality with evolutionary computing techniques. Indeed, we even employ genetic operators in our algorithm. However, unlike existing hybrid evolutionary computing algo- rithms where local minimization algorithms are simply used to “fine-tune” the solutions, we focus primarily on constructing local minima from previously ex- plored minima and only use genetic operators to assist in diversification. Hence, our total number of iterations/generations were demonstrated (empirically) to be quite low (50) whereas standard genetic algorithms and Monte Carlo are very high ranging from 150,000 to nearly 20,000,000 generations in order to pro- vide sufficient opportunity for these methods to converge and achieve their best solution. We applied our idea to several proteins from the Protein Data Bank (PDB) using the UNRES model[1]. We compared against Standard Genetic Al- gorithms(SGA) and Metropolis Monte Carlo(MMC) approaches. In all cases, our new approach computed the lowest energy conformation. Procedure LMBE begin t =0; initialize P (t) with local minima; while termination condition not satisfied do begin select individuals P sub new (t) from current pool P (t); recombine structures with selected individuals P sub new (t); E. Cant´ u-Paz et al. (Eds.): GECCO 2003, LNCS 2723, pp. 642–643, 2003. c Springer-Verlag Berlin Heidelberg 2003