Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

New to AutoModel, trying to use Clustering

tonyboy9tonyboy9 Member Posts: 113 Contributor II
edited August 2020 in Help
I see both k-means and x-means on my screen shot. How do I play around with k-Means and x-Means? After I click on run, how will I know if there is an optimal number for k?

Best Answer

Answers

  • jacobcybulskijacobcybulski Member, University Professor Posts: 391 Unicorn
    edited August 2020
    Auto Model in general does not optimise k-Means, you need to specify the number of clusters (the screen you have attached shows the field to enter k). x-Means automatically selects the best k within a range, which in you case (default) is between 2 and 20. You can see what was the selected number of clusters by checking x-Means summary report. Before running clustering you may also request automatic feature selection, which is likely to improve clustering by removing highly correlated / linearly dependent attributes (it takes some time to do so on large data sets). In general, you'd be better off doing data pre-processing yourself, e.g. selecting independent attributes and normalising them before k-Means. Jacob
Sign In or Register to comment.