Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Checking for repeating ID?

dragoljubdragoljub Member Posts: 241 Contributor II
edited November 2018 in Help
Hi Guys,

Sometimes I create an ID attribute from part numbers x,y locations etc and if there is ever a repeating sample in my large data set it would be nice to see. I am surprised that RM lets the ID attribute contain repeating values.

What is the quickest way to check if there are any repeating values (not necessarily nominal) for a particular attribute?

It would be cool if there was a meta data column for that.

-Gagi

Answers

  • landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    Hi Gagi,
    in fact there is a meta data value for this: Take a look at the mode, this is the most frequent nominal value. If this value occurs just once, then each value is unique.

    Greetings,
      Sebastian
  • dragoljubdragoljub Member Posts: 241 Contributor II
    I guess my problem is my ID is an integer but stored as a real.

    Will mode only be computed for nominal attributes?

    Thanks,
    -Gagi
  • landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    Hi Gagi,
    that's true: Mode is only calculated for nominal data. You could use Numerical to Polynomial to convert it.

    Greetings,
      Sebastian
  • dragoljubdragoljub Member Posts: 241 Contributor II
    Ahh now that makes sense. Thanks!

    -Gagi  ;D
Sign In or Register to comment.