Options

Balancing Data based on class

b00122599b00122599 Member Posts: 26 Contributor II
Hey folks,

Get a bit lost here playing with Sampling Operators but not getting anywhere. I have a record set of 150k entries with three classes two of the classes are very small less than 10k each. I would like to output a result where I have an equal amount of all three classes so if I have 15k then I'll have 5k Class A,5k Class B and 5k Class C. I will lose a lot of the largest class but I want to compare all three classes in this way. Would anyone have any pointers? Thanks in advance.

Neil. 

Best Answer

Answers

  • Options
    b00122599b00122599 Member Posts: 26 Contributor II
    Thanks very much that did the trick! Neil.
Sign In or Register to comment.