k-means clustering how to indicate what attributes will be used?

caryknoopcaryknoop Member Posts: 4 Contributor I
edited November 2018 in Help

How do you indicate what columns to use for k-means clustering. 

 

Assume you have the following attributes:  BadlyDistributedID,  IdentifyingName, Col1, Col2, Col3.

 

Only Col1, Col2 and Col3 should be considered, how to do this?

 

 

Answers

  • mschmitzmschmitz Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 2,131  RM Data Scientist

    Hi caryknoop,

     

    for all learning task, all regular attributes will be used. Simply set the role of your Name and ID to something using Set Role. You can type anything except regular there. Roles need to be unique.


    ~Martin

    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
    andreas_camenzi
  • Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,230   Unicorn

    You can also use a "select attributes" operator if you want to limit clustering to only a subset of the regular attributes in the file, just place it immediately before your clustering operator.

     

    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
  • andreas_camenziandreas_camenzi Member Posts: 1 Contributor I

    Hi Martin

     

    Thank you for your help.

     

    Do you also know a solution if the attributes are more than 7. We want to display a chart with 2 attributes but need the information of around 10 additional attributes.

     

    It is only possible to assign 1 additional attribute per role.

     

    Thanks & best regards

    Andreas

     

    If we just want to include 

  • mschmitzmschmitz Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 2,131  RM Data Scientist

    Hi,

     

    you can simply type anything you want instead of "label","id" etc. just type att1, att2, att3 for the desired role.

     

     ~Martin

    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
Sign In or Register to comment.