🎉 🎉 RAPIDMINER 9.10 IS OUT!!! 🎉🎉
Download the latest version helping analytics teams accelerate time-to-value for streaming and IIOT use cases.
"Subspace Clustering on Binary Attributes."
I am a beginner level professional in data mining and new to the topic of subspace clustering. I have a sample dataset which contains observations in terms of purchase orders and columns in terms of binary attributes (1/0) related to customization of same type of product.
The objective is to find whether there are any clusters present in this data. One of the approach is to use a PCA to convert binary to numerical scores and use these as input to k-means iterations.
However, I was trying to check if using hierarchical clustering on this data helps. I have used Jaccard dissimilarity metric and then dendrogram to find out the clusters. It seems no clear structure is present in the data, which the dendrogram containing few isolated clusters. This analysis was done in base R.
Later I came to know about subspace clustering. I am currently trying out an iteration in RapidMiner using subspace plugins, to be precise using the CLIQUE algorithm. However, it is being over an hour and no results have been obtained yet. I have set the tau and xi parameters as 0.1 and 2 respectively, which seem to be correct given the nature of dataset.
Would request comments/suggestions on improving the above situation. I am not sure on how the output of CLIQUE looks in RapidMiner, so would also appreciate some leads on this topic as well.