Options

"Clustering data"

mskhmskh Member Posts: 13 Learner I
edited May 2019 in Help
Hi,
I calculate standard deviation and average of my data set. I want to cluster my data set into 3 clusters which cluster_0 consists of data between 0 to average, cluster_1 between average and average+standard deviation and cluster_3 consists of data between 2*standard deviation to maximum value. which clustering technique i should use?
Thanks
Tagged:

Answers

  • Options
    Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    If you have discrete cluster identities in mind, then this really isn't an application of clustering. Clustering techniques are generally non-deterministic and unsupervised ML algorithms.
    But you can easily code your "clusters" manually in RapidMiner using some if/then logic within Generate Attributes, or Discretize by User Specification.
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.