🎉 🎉 RAPIDMINER 9.10 IS OUT!!! 🎉🎉

Download the latest version helping analytics teams accelerate time-to-value for streaming and IIOT use cases.


Provide bias to regression model to reduce total cost of mistakes

HaukeVHaukeV Member Posts: 5 Newbie

I am running a linear regression model to predict a monetary sum. If I miss the target by more than 15% however there is a fine imposed which is dependent on the direction in which I miss my target. So if I underestimate values by 15% or more, I have to pay a €500 fine, if I overestimate values by 15% or more, I have to pay a €1000 fine. Currently my model correctly predcits 3/4ths of the database. 
However I this leaves me with a significant cost, which is equally distributed on both predicting too high and predicting too low. I would like to modify the model to take this price differential into account to reduce the cost of the misses. This means I would rather undershoot than overshoot my target since that is cheaper.

Currently I have two columns generated, which define if I overshoot (true,False) or undershoot (true, false) the target by 15% or more. Is there any way I could do this? I know this will probably reduce the accuracy of the model, but getting total cost down is important from a business point of view.

kind regards,


  • mschmitzmschmitz Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,053  RM Data Scientist
    Hi @HaukeV ,
    you probably want to use thresholds for your classification model? Like if confidence(overshoot) needs to be only 30% and not 50% to call it an overshoot? that would be accomplished by Create and Apply threshold.

    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
Sign In or Register to comment.