Options

Checking for repeating ID?

dragoljubdragoljub Member Posts: 241 Contributor II
edited November 2018 in Help
Hi Guys,

Sometimes I create an ID attribute from part numbers x,y locations etc and if there is ever a repeating sample in my large data set it would be nice to see. I am surprised that RM lets the ID attribute contain repeating values.

What is the quickest way to check if there are any repeating values (not necessarily nominal) for a particular attribute?

It would be cool if there was a meta data column for that.

-Gagi

Answers

  • Options
    landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    Hi Gagi,
    in fact there is a meta data value for this: Take a look at the mode, this is the most frequent nominal value. If this value occurs just once, then each value is unique.

    Greetings,
      Sebastian
  • Options
    dragoljubdragoljub Member Posts: 241 Contributor II
    I guess my problem is my ID is an integer but stored as a real.

    Will mode only be computed for nominal attributes?

    Thanks,
    -Gagi
  • Options
    landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    Hi Gagi,
    that's true: Mode is only calculated for nominal data. You could use Numerical to Polynomial to convert it.

    Greetings,
      Sebastian
  • Options
    dragoljubdragoljub Member Posts: 241 Contributor II
    Ahh now that makes sense. Thanks!

    -Gagi  ;D
Sign In or Register to comment.