The RapidMiner community is on read-only mode until further notice. Technical support via cases will continue to work as is. For any urgent licensing related requests from Students/Faculty members, please use the Altair academic forum here.
How do I get a higher confidence of predicting true?
Hello,
I am taking a dataset of 4000 rows of customers who bought an insurance policy and trying to find the best 1000 potential buyers of another dataset based on that first data set. I have used optimization with cross-validation and Naive Bayes inside and correctly predicted 112 potential buyers, however, I know there are still more. I have tried many different things but I end up either getting the same potential buyers or less as my confidence of true goes way down. Is there a specific operator or something to change in the optimization process that may get me better confidence or higher sensitivity for true when predicting this?
Thanks
I am taking a dataset of 4000 rows of customers who bought an insurance policy and trying to find the best 1000 potential buyers of another dataset based on that first data set. I have used optimization with cross-validation and Naive Bayes inside and correctly predicted 112 potential buyers, however, I know there are still more. I have tried many different things but I end up either getting the same potential buyers or less as my confidence of true goes way down. Is there a specific operator or something to change in the optimization process that may get me better confidence or higher sensitivity for true when predicting this?
Thanks
0
Answers
The prediction is just an additional attribute created by Apply Model. If you want the 1000 most likely buyers, just sort by the confidence(True) attribute descending and filter the example range 1 to 1000. Many of these will have a prediction of False, but still a higher likelihood than the other 3000.
Otherwise, Apply Threshold belongs *after* applying the model on the test set if I understand your process correctly.
Regards,
Balázs