🎉 🎉 RAPIDMINER 9.10 IS OUT!!! 🎉🎉
Download the latest version helping analytics teams accelerate time-to-value for streaming and IIOT use cases.
Random Forest (and other Trees) producing total bias result
Hello! I am predicted whether or not an example will be LOW and HIGH, based on several attributes.
The challenge I am seeing is that the classifier is always choosing to predict everyone as HIGH.
I have seen this before in other programs when the model is bias - sometimes based on a variable or algorithm setting. I have checked these out and can't find the issue.
Does anyone have experience with getting a total bias on the prediction and what might be causing it.... checked with other data sets and the same result is produced. Thanks!
UPDATE: I opened a new project, used the Titanic Training dataset directly into a default RFC and it predicts all examples will not survive. So it appears something is happening outside of the data or settings themselves...