Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
How to find the traits of each cluster?
Hi everyone,
Thanks in advanced for checking my question out and providing help if you know how!
I am developing a customer segmentation model. My question is this: after performing the clustering I get a result that shows me the various clusters. How do I find out what these clusters represent? Said differently, I'm looking to discover the traits of customers who fall within a particular cluster. For example, if the customers tend to be frequent purchasers with a high volume of items per transaction and likes to shop on saturday.
Is this information available?
Thanks again!
Matt
Tagged:
0
Answers
Hi,
Sure, that is easy. You can simply turn your "cluster" attribute into a label attribute with the operator "Set Role". Afterwards you can use any of the classification or weighting algorithms to tell you what the clusters are about. Attached below is an example where we first cluster the Iris data set into 3 clusters and then learn a decision tree to describe what the clusters are about.
Extra cool: You can even combine this with the new operator "Get Decision Tree Path" to enrich each data point with the explanation why exacly it landed in this cluster. Check out this extended process here:
You can find a link which describes how to import those XML files in my footer below.
Hope this helps,
Ingo
Hi,
my personal favourite is to do what Ingo proposed in a 1-vs-All fashion. this way you get the answer to the question: What makes cluster_x different to the other clusters.
Best,
Martin
Dortmund, Germany