Intelligent decision
support for protein
crystal growth
by I. Jurisica
P. Rogers
J. I. Glasgow
S. Fortier
J. R. Luft
J. R. Wolfley
M. A. Bianca
D. R. Weeks
G. T. DeTitta
Current structural genomics projects are likely
to produce hundreds of proteins a year for
structural analysis. The primary goal of our
research is to speed up the process of crystal
growth for proteins in order to enable the
determination of protein structure using single
crystal X-ray diffraction. We describe Max, a
working prototype that includes a high-
throughput crystallization and evaluation setup in
the wet laboratory and an intelligent software
system in the computer laboratory. A robotic
setup for crystal growth is able to prepare and
evaluate over 40 thousand crystallization
experiments a day. Images of the crystallization
outcomes captured with a digital camera are
processed by an image-analysis component that
uses the two-dimensional Fourier transform to
perform automated classification of the
experiment outcome. An information repository
component, which stores the data obtained from
crystallization experiments, was designed with an
emphasis on correctness, completeness, and
reproducibility. A case-based reasoning
component provides support for the design of
crystal growth experiments by retrieving previous
similar cases, and then adapting these in order
to create a solution for the problem at hand.
While work on Max is still in progress, we report
here on the implementation status of its
components, discuss how our work relates to
other research, and describe our plans for the
future.
P
roteins are involved in every biochemical pro-
cess that maintains life in a living organism. One
of the fundamental challenges of modern molecu-
lar biology is discovering the laws that control how
proteins evolve their three-dimensional structure.
Through an increased understanding of protein
structure we can gain insight into the functions of
these important molecules. Currently, the most pow-
erful method for determining protein structure is sin-
gle crystal X-ray diffraction.
A crystallography experiment begins with a crystal
that ideally diffracts X-rays to high resolution, i.e.,
it produces a high-quality diffraction pattern that re-
veals the crystal’s internal order. Crystals are reg-
ular, repeating arrays of atoms or molecules in three-
dimensional space. The basic building block of a crys-
tal is called a unit cell, the smallest unit of a lattice de-
fined by three axes and the three angles between them.
In order for a protein crystal to diffract at high reso-
lution, it should not have large unit cell dimensions.
Determining protein structure is often limited by the
difficulty of growing crystals suitable for diffraction.
This is partially due to the large number of param-
eters affecting the crystallization outcome (e.g., pu-
rity of proteins, intrinsic physico-chemical, biochem-
ical, biophysical, and biological parameters) and the
unknown dependencies between the variation of
these parameters and the propensity of a given mac-
romolecule to crystallize. The primary goal of the
research described in this paper is to develop a com-
prehensive repository of data from crystal growth ex-
periments (both successful and unsuccessful) and ap-
ply this knowledge in an intelligent decision-support
system for planning novel experiments.
Copyright 2001 by International Business Machines Corpora-
tion. Copying in printed form for private use is permitted with-
out payment of royalty provided that (1) each reproduction is done
without alteration and (2) the Journal reference and IBM copy-
right notice are included on the first page. The title and abstract,
but no other portions, of this paper may be copied or distributed
royalty free without further permission by computer-based and
other information-service systems. Permission to republish any
other portion of this paper must be obtained from the Editor.
JURISICA ET AL. 0018-8670/01/$5.00 © 2001 IBM IBM SYSTEMS JOURNAL, VOL 40, NO 2, 2001
394