The International Conference on Computer and Mathematical Sciences 2010 29 June 2010, Yogyakarta, Indonesia 1 Statistical Analysis of the Open Source Software Projects from Sourceforge.net Andi Wahju Rahardjo Emanuel #1 , Retantyo Wardoyo *2 , Jazi Eko Istiyanto *3 , Khabib Mustofa *4 # Bachelor Informatics, Faculty of Information Technology, Maranatha Christian University Jl. Prof. Drg. Suria Sumantri 65 Bandung, INDONESIA 1 awreman@gmail.com * Department of Computer Science and Electronics, Gadjah Mada University Jl. Sekip Utara, Sleman, Yogyakarta, INDONESIA 2 rw@ugm.ac.id 3 jazi@ugm.ac.id 4 khabib@ugm.ac.id Abstract— This paper presents the statistical analysis of Open Source Software (OSS) Projects crawled from sourceforge.net portal. Among many available OSS project development portals, sourceforge.net is selected since it is considered the largest small-to-medium OSS Project development portal with total of more than 160 thousands registered projects during the period of observation (January 2010). There are total 134,549 OSS projects that are successfully recorded with more than 28 parameters such as project's name, audience, audience count, database environment, database count, description, developer, development status, development status count, number of download, filename, file size, operating system, operating system count, programming language, programming language count, review, thumb (up and down), topic, topic count, translation, translation count, user interface, and user interface count. The result of this statistical analysis should provide the snapshot of the current trends and progresses in OSS Project, especially those which are developed using development portal. KeywordsOpen Source, Open Source Software Project, statistical analysis, sourceforge.net, small-to-medium I. INTRODUCTION Open Source Software (OSS) nowadays becomes a familiar term in Information Technology and Computer Science. This “method” was initially started by academics to communicate and exchange ideas in the form of software applications in their spare time, and then it was slowly getting wider recognition as one of the mainstream movement challenging the already established Software Engineering methodology. OSS is a software development methodology that allows developers to collaboratively download, modify, and improve the source code via Internet as the medium of communication. These developers are usually a form of group called Open Source Communities and the development process is called OSS Project. The method of developing source code in OSS is mostly consisting of building new features and fixing bugs [2] which is called 'bazaar-like' compared to 'cathedral-like' in modern Software Engineering methodology [12]. Interestingly, this seemingly chaotic and 'bazaar-like' development strategy is finally proving itself to be able to produce high quality software applications such as Apache web server, Linux Operating System, Mozilla web browser, etc. Since the development of software in OSS seems to be chaotic in nature, the existence of OSS development portals may eliminate some of the problems. One of the largest OSS development portals is sourceforge.net which hosted more than 226K projects (observed in April 2010). This portal provides free web hosting space with many integrated tools that are useful to the project initiators and interested developers for developing their project collaboratively such as CVS for version control, bug tracking, mailing list, wiki, etc. This paper presents statistical analysis of OSS Projects hosted in sourceforge.net which was crawled during the month of January 2010 with total 134,549 out of 160,141 registered OSS projects. This statistical analysis should provide snapshot of the current trends and progresses of OSS Projects that is useful for future research in this area. II. THEORETICAL BACKGROUND ON OSS PROJECTS Open Source Software (OSS) is software developed using a methodology based on several distinct characteristics not found in commercial / proprietary software: The source code of the application is freely available for everybody to download, improve and [12]. People who contribute to the development of the Open Source projects is usually forming a group called Open Source Communities which is completely voluntary [2][3]. The primary concern of the developers in Open Source Software Projects are building features and fixing bugs [5]. During the development of OSS, the developers and communities depend on some collaborative tools. Some of these tools are CVS (Concurrent Version System) which is used as the repository and version control of the source code, Bug Database software such as Bugzilla or TRAC which is used to keep track of all the bugs during development, TODO List / Wish List to 978-967-5741-01-2 © 2010