Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Balancing Data Problem: I got zero results

yogipajarilloyogipajarillo Member Posts: 4 Contributor I
edited December 2018 in Help

Hi! I used the credit card fraud dataset from Kaggle. I just want to fix the Class column (where 0 = no fraud and 1 = fraud)

imbalanced dataset spreadsheet.png

 

It's highly imbalanced 

imbalanced dataset bar chart.png

 

I tried this method
sampling.png

Which got me nowhere

result.png

 

What am i doing wrong?

 

 

 

 

Best Answer

Answers

  • lionelderkrikorlionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn

    Hi @yogipajarillo,

     

    For me, it works with the following settings (chosen arbitrarily) : 

    Balanced_1.pngpmpmmp

    and then the statistics are : 

    Balanced_2.png

    Regards,

     

    Lionel

  • yogipajarilloyogipajarillo Member Posts: 4 Contributor I

    thank you @earmijo and @lionelderkrikor! i could swear i specified the Sample Size Per Class  the first time around, but in any case, i tried again now and the process worked. if you guys had not replied, i would have abandoned the problem and moved on to looking for a different method. thank you again for your time!

    result.png

     

Sign In or Register to comment.