RapidMiner 9.7 is Now Available

Lots of amazing new improvements including true version control! Learn more about what's new here.

CLICK HERE TO DOWNLOAD

Rank examples by attribute including duplicates

dandresdandres Member Posts: 2 Contributor I
edited November 2018 in Help
Is there an operator or combination of operators that will compute the rank of each example for a selected attribute?

The Sort/Generate ID method won't work here because some of the examples have the same value and thus should have the same rank.  The preference for those "ties" is to compute the lowest rank, not the average rank.

For example:

Value  Rank
100     1
98       2
97       3
97       3
96       5

Answers

  • mschmitzmschmitz Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 2,463  RM Data Scientist
    Hi,

    i don't know a one operator solution.

    You can go for Sort, Generate ID and then use Aggregate and Group By your Value coloumn to calculate min(Id) and average(Value). Should work quite fine.


    Cheers,

    Martin
    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
  • dandresdandres Member Posts: 2 Contributor I
    Thanks, Martin.

    I should have mentioned that I also need to maintain unique rows.  I guess I could use your method and then add one more step to Join back the original by value.
Sign In or Register to comment.