🎉 🎉 RAPIDMINER 9.10 IS OUT!!! 🎉🎉
Download the latest version helping analytics teams accelerate timetovalue for streaming and IIOT use cases.
CLICK HERE TO DOWNLOAD
Is there a way to optimize the Cost / Benefit Matrix in Auto Model
Hello everyone, I am relatively new to the field and currently working on a university project.
I tried to experiment a bit with the Cost / Benefit Matrix of Auto Model and was able to get more precise models.
I
am now at the point where I ask myself if there is any way of
optimizing the matrix by for example calculating the distribution?
The labels are distributed with 49% in the first group, 18,5% in the second and third group and 14 % in the fourth group.
Thanks for help in advance
Best Answer

Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,625 Unicorn@Tyche I don't think so, because there can be large differences in cost that are totally independent of the distribution of classes. Think about cases like false positives vs false negatives for certain medical tests, or the cost of undetected fraud in financial institutions, etc. So I would say you should probably consult a domain expert if you want to use the cost matrix as the main model performance metric. Otherwise you can use something simpler like accuracy or ROC which does not require you to make different assumptions about the costs of different types of misclassification errors.
6
Answers
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
@Telcontar120 That's what I expected, since the domain not really has a cost assigned to the problem I was curious if there is an optimal way to configure the cost matrix based on the distribution of the attributes.