Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Checking for repeating ID?
Hi Guys,
Sometimes I create an ID attribute from part numbers x,y locations etc and if there is ever a repeating sample in my large data set it would be nice to see. I am surprised that RM lets the ID attribute contain repeating values.
What is the quickest way to check if there are any repeating values (not necessarily nominal) for a particular attribute?
It would be cool if there was a meta data column for that.
-Gagi
Sometimes I create an ID attribute from part numbers x,y locations etc and if there is ever a repeating sample in my large data set it would be nice to see. I am surprised that RM lets the ID attribute contain repeating values.
What is the quickest way to check if there are any repeating values (not necessarily nominal) for a particular attribute?
It would be cool if there was a meta data column for that.
-Gagi
0
Answers
in fact there is a meta data value for this: Take a look at the mode, this is the most frequent nominal value. If this value occurs just once, then each value is unique.
Greetings,
Sebastian
Will mode only be computed for nominal attributes?
Thanks,
-Gagi
that's true: Mode is only calculated for nominal data. You could use Numerical to Polynomial to convert it.
Greetings,
Sebastian
-Gagi ;D