RapidMiner 9.7 is Now Available

Lots of amazing new improvements including true version control! Learn more about what's new here.

CLICK HERE TO DOWNLOAD

Information about Generate ID attributes

navnav Member Posts: 28  Maven
edited November 2018 in Help
Hello,

I want to know how work the special attribute ID. In particular when I add this attribute to an exampleSet with the generate id attribute, is it the same ID added by the k-means clustering operator? I ask this because my problem is the follow: i made a clustering with k-means using a subset of features, but after clustering I want to explore the examples in each cluster also for the features not used for clustering. So what i done is adding an id to the original dataset and then join the clusteredSet with the original dataset by the ID, but I want to be sure that the two ID are the same in the generated one and in the clusteredSet. Thank you to all.

Answers

  • MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869   Unicorn
    You should add the id to the dataset (using Generate Id) before removing the features and doing the clustering. In that case, the k-Means operator won't generate an own id attribute, but use the one already present in the dataset. That way you can be sure that the clustered example set and the original dataset have consistent ids, because it's basically the *same* id attribute.

    Cheers,
    Marius
Sign In or Register to comment.