Proceeding of IC-ITS 2015 e-ISBN:978-967-0850-07-8 International Conference on Information Technology & Society 8-9 June 2015, Kuala Lumpur, MALAYSIA 417 Optimal Clustering Using Modified Fuzzy C-Means Clustering Algorithm Ahamed Shafeeq B M Department of Computer Science & Engineering, Manipal Institute Of Technology, ManipalUniversity, Manipal -576104, India email: ahamed.shafeeq@manipal.edu Abstract. Fuzzy clustering has been widely studied and applied in a variety of applications and areas. In hard clustering, data is divided into distinct clusters, where each data element belongs to exactly one cluster. In fuzzy clustering, data elements can belong to more than one cluster, and associated with each element is a set of membership levels. These indicate the strength of the association between that data element and a particular cluster. Fuzzy clustering is a process of assigning these membership levels, and then using them to assign data elements to one or more clusters. The investigation is needed to reveal whether the optimal number of clusters can be found on the run based on the cluster quality measure. The silhouette coefficient is the one of measure used to measure the quality of clusters. In the practical scenario, it is very difficult to fix the number of clusters in advance. In this paper we propose anoptimal clustering of data with modified Fuzzy C-Means algorithm. The proposed method works for both the cases i.e. for known number of clusters in advance as well as unknown number of clusters. The user has the flexibility either to fix the number of clusters or input the minimum number (K=2) of clusters required. In the former case it works same as Fuzzy C-means algorithm. In the latter case the algorithm computes the quality of clusters for each set of clusters. The process is repeated by incrementing the cluster counter by one in each iteration until it satisfies the validity of cluster quality. It is observed that the modified Fuzzy C-means algorithm produces quality clusters compared to the Fuzzy C-means clustering. It assigns the data point to their appropriate class or cluster more effectively. Keywords: Fuzzy C-means clustering, Cluster quality, Silhouette coefficient. INTRODUCTION A fundamental problem that frequently arises in a great variety of fields such as data mining andknowledge discovery, and pattern classification is the clustering problem [1]. Nowadays, there is an urgent need for good quality clustering algorithms to address the huge amount of data. The quality of information play a very crucial role in decision makingof policy has attracted a great deal of attention in the information industry and in society as a whole. There isvery large amount of data availability in real world and it is very difficult to excess the useful informationfrom this huge database and provide the information to which it is needed within time limit and in requiredpattern. So data mining