How do I get a higher confidence of predicting true?

KatL18KatL18 Member Posts: 2 Newbie
I am taking a dataset of 4000 rows of customers who bought an insurance policy and trying to find the best 1000 potential buyers of another dataset based on that first data set. I have used optimization with cross-validation and Naive Bayes inside and correctly predicted 112 potential buyers, however, I know there are still more. I have tried many different things but I end up either getting the same potential buyers or less as my confidence of true goes way down. Is there a specific operator or something to change in the optimization process that may get me better confidence or higher sensitivity for true when predicting this?



  • BalazsBaranyBalazsBarany Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert Posts: 788 Unicorn

    The prediction is just an additional attribute created by Apply Model. If you want the 1000 most likely buyers, just sort by the confidence(True) attribute descending and filter the example range 1 to 1000. Many of these will have a prediction of False, but still a higher likelihood than the other 3000.

    Otherwise, Apply Threshold belongs *after* applying the model on the test set if I understand your process correctly.

Sign In or Register to comment.