38 SEPTEMBER • OCTOBER 2002 http://computer.org/internet/ 1089-7801/02/$17.00©2002 IEEE IEEE INTERNET COMPUTING Guest Editor’s Introduction Global Deployment of Data Centers Pankaj Mehra Hewlett-Packard NonStop Labs D ata centers ﬁrst arose from IT efforts to consolidate the server and storage resources of one or more enterprises, so they could be central- ly managed and shared among various departments and functions. IT architec- tures involving multiple data centers dis- tributed around the globe exist in part due to the evolutionary nature of enterprises and, in part, by design. By effectively utilizing distributed re- sources, enterprises seek to improve their services’ scalability, availability, and responsiveness. Scalability here denotes the ability to extend the resources avail- able to a service beyond a single data cen- ter’s boundaries in order to address grow- ing user demand. High availability means ensuring that users have continuous access to a service despite faults in sys- tems, networks, hardware, or software — even when caused by natural disasters or human error. Responsiveness refers to short and predictable turnaround times on service requests; it is usually improved using affinity. A site in Singapore, for instance, might field all service requests originating in Southeast Asia. Architects of global-scale services often achieve all three objectives using load bal- ancing to distribute the task of servicing Internet requests among servers at multi- ple sites. Load-balancing algorithms dynamically adjust the number of sites, and the number of servers used at each site, in response to changes in demand. They are programmed to work around unreachable sites, and they account for affinities between Internet requests and the sites available to service them. With large enterprises continuing to develop and deploy globally distributed content, it is important to understand the ecosystem for global-scale applications. At the base, a well-understood and widely adopted tiered architecture determines how to arrange servers and routers within a data center. At the top of the ecosystem are the applications, which today primarily include Web content and ﬁrst-generation Internet services that follow the client-server para- digm, such as mail and chat. The articles in this issue discuss the two broad approach- es generally used to deploy services glob- ally across multiple data centers. In the future, most applications will likely be Web services, which involve wide-area server- to-server peering. The Internet community