학술논문

Multi-objective Clustering Ensemble for Varying Number of Clusters
Document Type
Conference
Source
2018 14th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS) SITIS Signal-Image Technology & Internet-Based Systems (SITIS), 2018 14th International Conference on. :387-395 Nov, 2018
Subject
Computing and Processing
Signal Processing and Analysis
Clustering algorithms
Partitioning algorithms
Optimization
Measurement
Linear programming
Prediction algorithms
Bonding
Clustering
Ensemble
Adjusted Rand Index
Language
Abstract
Clustering ensemble aims to obtain final clustering combining multiple diverse clustering solutions. It has already been established as an effective tool to yield a robust, accurate and stable consensus from the input clustering solutions. So far, a spectrum of approaches has already been proposed over the years to generate final ensemble from multiple solutions. One major drawback of most of the existing cluster ensemble approaches is that they require the final number of clusters as an input. In this paper, we propose a multi-objective optimization based algorithm for cluster ensemble problem that optimizes two objective functions simultaneously. The first objective is to maximize the overall similarity of the reference clustering solution to the input solutions, whereas the second objective is to minimize the standard deviation of the similarity values to avoid any bias. Moreover, in this proposed model, there is no need to supply the number of clusters a apriori to apply the algorithm, which is missing in most of the state-of-the-art approaches. The effectiveness of the proposed technique over the existing approaches is demonstrated by applying it on eight real-life datasets.