Hi Im getting the following error when using SMOTE upsampling with decision trees

2016004120160041 Member Posts: 6 Newbie
Exception: java.lang.RuntimeException
Message: Cannot build average for different positive classes (At Risk/Good Standing).
Stack trace:
com.rapidminer.operator.performance.BinaryClassificationPerformance.buildSingleAverage(BinaryClassificationPerformance.java:395)
com.rapidminer.tools.math.Averagable.buildAverage(Averagable.java:137)
com.rapidminer.tools.math.AverageVector.buildAverages(AverageVector.java:131)
com.rapidminer.extension.concurrency.operator.validation.CrossValidationOperator.doWork(CrossValidationOperator.java:303)
com.rapidminer.operator.Operator.execute(Operator.java:1026)
com.rapidminer.operator.execution.SimpleUnitExecutor.execute(SimpleUnitExecutor.java:77)
com.rapidminer.operator.ExecutionUnit$2.run(ExecutionUnit.java:812)
com.rapidminer.operator.ExecutionUnit$2.run(ExecutionUnit.java:807)
java.security.AccessController.doPrivileged(Native Method)
com.rapidminer.operator.ExecutionUnit.execute(ExecutionUnit.java:807)
com.rapidminer.operator.OperatorChain.doWork(OperatorChain.java:423)
com.rapidminer.operator.Operator.execute(Operator.java:1026)
com.rapidminer.Process.executeRoot(Process.java:1386)
com.rapidminer.Process.execute(Process.java:1327)
com.rapidminer.Process.run(Process.java:1300)
com.rapidminer.Process.run(Process.java:1186)
com.rapidminer.Process.run(Process.java:1139)
com.rapidminer.Process.run(Process.java:1134)
com.rapidminer.Process.run(Process.java:1124)
com.rapidminer.gui.ProcessThread.run(ProcessThread.java:65)

Please advise.
Thank you

Answers

  • mschmitzmschmitz Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 1,979  RM Data Scientist
    Hi,
    can you please check with a break point if the data set contains positive and negative examples after upsampling?

    BR,
    Martin
    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
  • 2016004120160041 Member Posts: 6 Newbie
    No there are no positive or negative samples. The target label is binominal and contains 2 classes 'At Risk' and 'Good Standing'
    Appreciate your support
  • mschmitzmschmitz Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 1,979  RM Data Scientist
    Hi,
    thats usually ment by "positive' and "negative". Did you check that you have enough At Risk and Standing examples?
    BR,
    Martin
    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
  • 2016004120160041 Member Posts: 6 Newbie
    The dataset consists of 2018 examples (rows)
  • 2016004120160041 Member Posts: 6 Newbie
    Can you please let me know if there is a fix for the SMOTE operator?
  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor Posts: 2,188  Community Manager
    hi @20160041 can you please share your data set and XML? This is really the only way we can help you further. Thanks.

    Scott

    lionelderkrikor
Sign In or Register to comment.