Data Min Knowl Disc (2007) 15:349–381 DOI 10.1007/s10618-007-0073-y Tree-Traversing Ant Algorithm for term clustering based on featureless similarities Wilson Wong · Wei Liu · Mohammed Bennamoun Received: 20 February 2007 / Accepted: 13 April 2007 / Published online: 8 June 2007 Springer Science+Business Media, LLC 2007 Abstract Many conventional methods for concepts formation in ontology learning have relied on the use of predefined templates and rules, and static resources such as WordNet. Such approaches are not scalable, difficult to port between different domains and incapable of handling knowledge fluctuations. Their results are far from desirable, either. In this paper, we propose a new ant- based clustering algorithm, Tree-Traversing Ant (TTA), for concepts formation as part of an ontology learning system. With the help of Normalized Google Dis- tance (NGD) and n of Wikipedia (n W) as measures for similarity and distance between terms, we attempt to achieve an adaptable clustering method that is highly scalable and portable across domains. Evaluations with an seven datasets show promising results with an average lexical overlap of 97% and ontological improvement of 48%. At the same time, the evaluations demonstrated several advantages that are not simultaneously present in standard ant-based and other conventional clustering methods. Keywords Ontology learning · Text mining · Term clustering · Concept discovery · Cluster analysis · Featureless similarity measures Responsible editor: M. J. Zaki. W. Wong · W. Liu (B ) · M. Bennamoun School of Computer Science and Software Engineering, University of Western Australia, 35 Stirling Highway, Crawley, WA 6009, Australia e-mail: wei@csse.uwa.edu.au W. Wong e-mail: wilson@csse.uwa.edu.au M. Bennamoun e-mail: bennamou@csse.uwa.edu.au