Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Learning Imbalanced Data

dragonedisondragonedison Member Posts: 17 Contributor II
edited November 2018 in Help
Dear everyone,
I would like to know if there is any learning operator in RapidMiner(either supervised or unsupervised) that is suitable for imbalanced data learning.

Thanks,
Gary

Answers

  • landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    Hi,
    what exactly do you mean by "suitable"? All learning algorithms can cope with imbalanced data, some better some worse. But many support weighting of examples so that you can even the odds. There are also many sampling operators, that might help to train on an imbalanced data set.

    Greetings,
      Sebastian
  • dragoljubdragoljub Member Posts: 241 Contributor II
    I am also running into this problem, however I have had limited success with weighting the fewer example class with larger importance.

    I find that sub-sampling the larger class generally helps.

    I am trying to use LibSVM with the class weights, however they do not seem to do what is expected. How is the class weighting implemented in RM for LibSVM? It does not seem to be a standard option in the LibSVM C package?

    Thanks,
    -Gagi
Sign In or Register to comment.