Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Cluster algortyms
what is the x-means and k- medoid ? And also what is the difference between k-means of this two algortym ?
Tagged:
0
Best Answer
-
rfuentealba RapidMiner Certified Analyst, Member, University Professor Posts: 568 UnicornHello,
Let's start with understanding k-Means. You set a number of clusters (k) and the algorithm determines what examples belong to that cluster by determining how far are they from that specific cluster. Then the centroids of each cluster are calculated by averaging the distances of all the examples that belong to that cluster to that cluster.
The k-Medoids algorithm is almost the same as the k-Means algorithm with one difference: the center of a cluster is moved to an example, rather than an imaginary number taken from the calculation specified above.
The x-Means algorithm is an improvement. You don't have to determine the number of clusters. Instead, someone said that there is a possibility of determining the correct number of clusters by running a quick heuristic (e.g. an algorithm that belongs to IA but not to Machine Learning). That heuristic determines how many K's are required for that specific example set, and then the algorithm is more or less the same as a k-Means.
There is a lot of n-dimensional geometry in explaining these algorithms. That is why you need to use these with numbers only.
Hope this helps.
All the best,
Rodrigo.
7
Answers
No, fuzzy clustering algorithms use a different type of function, called the "fuzzer" or "fuzzifier", to see if an algorithm belongs to certain cluster or not. While the idea of clustering remains the same, fuzzy clustering uses similarity, intensity and distance as the three stooges main points of analysis, and one example can potentially (though not commonly) belong to more than one cluster. That isn't possible with k-Means, k-Medoids and X-Means, because these are "hard labeled".
Fuzzy C means is available in the "Information Selection" plugin for RapidMiner. It's not part of the standard RapidMiner, BTW.
All the best,
Rodrigo.
kind regards
---------------------
Are you asking about taking the sum of volume column based on the cluster number? If so, you can use the Aggregate operator and group by based on cluster column.
If this is not the answer you are looking for. please explain a bit more about your requirement.
Thanks
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
I have the same questions as Varun has. I will work on your problem tomorrow, I promise.
Rodrigo.
firtsly ı need say that ı am doing clustering with 4 attribute which include "volume"
and ı am doing this clustering in the warehouse(storage) so volume is very important for me so when ı cluster to items sum of each clusters has to be equal(volume) . if ı have to give an example
.in this data ı wanna that cluster 1 gonna be = 1-3-5 cluster 2 gonna be =2-4-6 because sum of volume of every cluster same that it is 60 .ı hope u got what ı mean .ıf u dont pls say it to me.ı am waiting for ur answer .
- item no volume
1 102 15
3 20
4 25
5 30
6 20
Kind Regards,