Learning Curves: Cost Functions

HyramHyram Member Posts: 39 Contributor II

I am trying to assess bias and variance for my classification problem. As such, I am trying to plot the cost functions J(Theta) for testing and training vs. number of examples to determine if I have high variance or high bias. How can I do this in RapidMiner? Alternatively, are there operators that let me visualise bias vs. variance?


Best Answer

  • Options
    Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    Solution Accepted
    There isn't anything automatic to produce this.  You can do it somewhat manually by using the Sample operator and using different values/percentages of your total dataset and then outputting the model performance results of each of these different scenarios.   You could automate this further by using a macro for sample value and putting it into a Loop.  But there isn't any built-in operator that would easily output this all for you. 
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts


Sign In or Register to comment.