DOI Number : 10.5614/itbj.eng.sci.2006.38.2.1
Hits : 19

Exploiting Homogeneity of Density in Incremental Hierarchical Clustering

Dwi H. Widiyantoro1

1School of Electrical Engineering and Informatics, Institute of Technology Bandung

Bandung, 40132  INDONESIA

dwi@if.itb.ac.id


Abstract. Hierarchical clustering is an important tool in many applications. As it involves a large data set that proliferates over time, reclustering the data set periodically is not an efficient process. Therefore, the ability to incorporate a new data set incrementally into an existing hierarchy becomes increasingly demanding. This article describes Homogen, a system that employs a new algorithm for generating a hierarchy of concepts and clusters incrementally from a stream of observations. The system aims to construct a hierarchy that satisfies the homogeneity and the monotonicity properties. Working in a bottom-up fashion, a new observation is placed in the hierarchy and a sequence of hierarchy restructuring processes is performed only in regions that have been affected by the presence of the new observation. Additionally, it combines multiple restructuring techniques that address different restructuring objectives to get a synergistic effect. The system has been tested on a variety of domains including structured and unstructured data sets. The experimental results reveal that the system is able to construct a concept hierarchy that is consistent regardless of the input data order and whose quality is comparable to the quality of those produced by non incremental clustering algorithms.

Keywords: Clustering; Conceptual Clustering; Incremental Hierarchical Clustering.

Download Article
 
Bahasa Indonesia | English
 
 
 

Notification:

Begin on 10 October 2014 this website is no longer activated for article process in Journal of Mathematical and Fundamental Sciences, Journal of Engineering and Technological Sciences, Journal of ICT Research and Applications and Journal of Visual Art and Design. The next process will be proceeded under new website at http://journals.itb.ac.id.

For detail information please contact us to: journal@lppm.itb.ac.id.

 
       
       
       ITB Journal Visitor Number #24274100       
       Jl. Tamansari 64, Bandung 40116, Indonesia Visitor IP Address #       
       Tel : +62-22-250 1759 ext. 121 © 2011 Institut Teknologi Bandung       
       Fax : +62-22-250 4010, +62-22-251 1215 XHTML + CSS + RSS       
       E-mail : journal@lppm.itb.ac.id or proceedings@lppm.itb.ac.id Developed by AVE