Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
K means group centroid and visualisation options
I am running a k means clustering in v6.0.008.
I am looking to visualise the results of the clustering as shown here (k means clustering graph): http://en.wikipedia.org/wiki/K-means_clustering#mediaviewer/File:ClusterAnalysis_Mouse.svg
Any suggestions on how to achieve this? I would be happy to use PCA before K Means clustering if that helps.
Also, as an aside, where is the 'cluster centroid' or the mean for each cluster? I have the centroids for each attribute in each cluster in the Cluster Model - cetroid table, but cannot find the cluster mean.
Thanks
I am looking to visualise the results of the clustering as shown here (k means clustering graph): http://en.wikipedia.org/wiki/K-means_clustering#mediaviewer/File:ClusterAnalysis_Mouse.svg
Any suggestions on how to achieve this? I would be happy to use PCA before K Means clustering if that helps.
Also, as an aside, where is the 'cluster centroid' or the mean for each cluster? I have the centroids for each attribute in each cluster in the Cluster Model - cetroid table, but cannot find the cluster mean.
Thanks
Tagged:
0
Answers
I used the following process to import the mouse data taken from here: http://elki.dbs.ifi.lmu.de/wiki/DataSets You can then simply use the Chart tab of the results to visualize this.
I'm not sure regarding your bonus question, I don't think there is an explicit option to see that, but I may be wrong there.
Regards,
Marco
First regarding the centeroids. If you take a look at the model itself, it has an "centeroid table" tab. There you can find your centeroids.
Furthermore there is a way to display the "boarders" of the cluster. Therfore you apply the clustering on random values in a given range. The result is the picture below:
I modified marco's process a bit so it creates this picture and connected the model:
Dortmund, Germany
what about a deviation plot? This way you could show in which attributes the cluster differ.
That would look like this for the sonar data set:
I would recommend the local normalization option
Edit: There is a similar plot for the centeroids in the model..
Dortmund, Germany
Dortmund, Germany
This table contains only values for each variable, not the mean group centroid - the mean group centroid is the value I am interested in. Any suggestions?