International Journal of Recent Technology and Engineering (IJRTE) ISSN: 2277-3878, Volume-6 Issue-3, July 2017 11 Published By: Blue Eyes Intelligence Engineering & Sciences Publication Pvt. Ltd. A Digital Resource System on HDFS Ebenezer Komla Gavua, Seth Okyere-Dankwa, Collinson Colin Agbesi Abstract: The issue of managing digital resources has become a major concern for educational and research institutions expanding with research laboratories, schools and departments. Some of these institutions have a major challenge as to how to manage huge quantities of educational resources required for research, teaching and learning. It is due to this challenge that this paper sought to design and implement a digital resource management system on the Hadoop Distributed File System (HDFS). The main model implemented in the development of the system was the incremental and iterative model. The development of the application took into consideration the user requirements and what the system sought to achieve after implementation. The system’s architecture is divided into various levels and these are the user, systems management, data storage and infrastructural levels. The development of this application was focused on the systems management level. The system integration was achieved by connecting MYSQL database server with HDFS through the utilization of sqoop. Security features were implemented on the system to protect the system from attack. The system was tested to ensure that all the modules created were communicating perfectly and the system was producing the expected results. Keywords: Digital library; Hadoop distributed file system, cloud storage. I. INTRODUCTION The management of digital resources is an important aspect of Information and Communication Technology which is currently given much consideration since almost every of aspect of modern life is controlled by information. Due to the important considerations given to institutional information, many organizations are willing to make funds and infrastructure available for the provision of quality information which can be accessed quickly and reliably. It is due to this premise, that most educational institutions have made the great effort to provide libraries for their teachers and students to promote quick access to quality information. Most educational institutions are making efforts to create digital libraries due to the flexibility provided by Information and Communication Technology. A digital library is a place where various forms of information material such as journals, monograph, visual materials, voice recorder and moving pictures can be retrieve via the Internet (Shiri,2003) Current digital libraries have well researched user interfaces, architectures that allow ease of use and permit various levels of interactivity including searching and browsing. They are aimed to help users to retrieve useful information easily and quickly (Salim et al.2008). Revised Version Manuscript Received on June 30, 2017. Ebenezer Komla Gavua, Koforidua Technical University, Ghana E- mail: mgavua@yahoo.com Seth Okyere-Dankwa, Koforidua Technical University, Ghana E-mail: Sokyeredankwa@yahoo.com Collinson Colin Agbesi, Koforidua Technical University, Ghana E- mail: Koliny3k@yahoo.com The benefits provided by these digital management systems are enormous and as such there are always opportunities to improve their performance through the adopting of current of technologies to promote scalability and interrupted operations. II. BASIC CONCEPT OF CLOUD STORAGE Cloud storage is one of the primary use of cloud computing. With the cloud storage, data is stored on multiple third party servers, rather than on the dedicated servers used in traditional networked data storage. When storing data, the user sees a virtual server that is, it appears as if the data is stored in a particular place with specific name. But that place does not exist in reality. It is just a pseudonym used to reference virtual space carved out of the cloud. In reality, the user’s data could be stored on any one or more of computers used to create the cloud (Liu et al.2009). The actual storage location may even differ from day to day or even minute to minute, as the cloud dynamically manages available storage space. But even though the location is virtual, user sees a static location for his data and can actually manage his storage space as if it were connected to his own pc. A typical cloud storage system architecture includes a master control server and several storage servers, as shown in figure 2. At its most basic level, a cloud storage system needs just one data server connected to the internet. A client sends copies of files over internet to the data server, which then records the information. Figure 1 Typical Cloud Storage System Architecture When client wishes to retrieve the information, he or she accesses the data server through a web based interface. The server then either sends the files back to the client or allows the client to access and manipulate the files on the server itself. Cloud computing is a current trend that considers the Internet as a platform providing on-demand computing and software as a service to anyone, anywhere, and at any time.