ISSN: 2055-530X International Journal of Latest Trends in Engineering, Science and Technology Volume 1, Issue 5 IJLTEST – Volume I, Issue 5 (MAY-JUNE 2014) 1 A Survey on Research Problems in Distributed Data Mining D.Lakshmi Srinivasulu Asst Prof Dept of CSE, Santhiram Engineering College, Nandyal-518501, India dlsrinivas@gmail.com B.Sunil Kumar Asst. Professor, Dept of CSE, Brindavan Institute of technology & Science, Kurnool-518218, India bangi.sunilkumar@gmail.com Dr V S Giridhar Akula Professor and Principal,Malla Reddy Institute of Engineering &Technology, Secunderabad- 501512, India akulagiri2002@yahoo.com Abstract In this paper we describe research problems in distributed mining and monitoring of intelligence data. We first review the basic architecture of such a system, and then outline research problems in multi-query optimization, online data mining, high-speed archiving, and foundations of stream computations, distributed data mining, and data mining model management. 1 Introduction The continuous developments in information and communication technology have recently led to the appearance of distributed computing environments, which comprise several, and different sources of large volumes of data and several computing units. The most prominent example of a distributed environment is the Internet, where increasingly more databases and data streams appear that deal with several areas, such as meteorology, oceanography, economy and others. In addition the Internet constitutes the communication medium for geographically distributed information systems, as for example the earth observing system of NASA (eos.gsfc.nasa.gov). Other examples of distributed environments that have been developed in the last few years are sensor networks for process monitoring and grids where a large number of computing and storage units are interconnected over a high-speed network. The global information sphere spans all participating government agencies, and mediates inter-agency collaboration. It addresses the sometimes conflicting requirements of allowing analysts from different government agencies to efficiently share information, hypotheses and evidence without violating applicable laws regarding privacy and civil rights. The requirements of the global information sphere present a number of research challenges, particularly in the areas of privacy- preserving data mining and information integration, an area in which we have a recent research focus but which is not the focus of this paper. Instead, we concentrate here on functional research problems within a local information sphere. Each local information sphere must continuously process high-speed data streams from a variety of sources, performing on-line data mining and trigger evaluation as well as supporting interactive data analysis for human analysts. Our goal is to design a distributed mining and monitoring system, which implements this vital functionality of a local information sphere. Furthermore the architecture should follow the plug-and-play paradigm in order to support easy integration of new data mining and analysis operators and extensibility with new functionality. The remainder of the paper discusses research problems in the design and