Options

RM 9.1 feedback : Auto-Model limitation

lionelderkrikorlionelderkrikor Moderator, RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn
edited June 2019 in Help
Hi,

I work with a dataset containing 96 examples and thus I can't use Auto-Model because the new min number of examples is 100 !
Is there any reason to this new limitation ?


Regards,

Lionel
Tagged:

Best Answers

  • Options
    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,507 RM Data Scientist
    Solution Accepted
    Hi @lionelderkrikor ,
    i guess the answer is that the new features would overfit too much? @IngoRM ?
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • Options
    IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
    Solution Accepted
    Hi,
    Yes, indeed.  Plus we changed the validation approach a bit (see some of the other threads in the community - I will post answers there soon as well) to get to more robust estimations.  This unfortunately meant that we need more data for the validation part of the models which required to increase the limit from the 50 rows to 100. 
    We have looked into the statistics and it seemed that less than 3% of all AM runs have been on data sets of less than 100 rows and while we are sorry that we had to increase the limit (making the life harder for those 3% of the runs) we still believe that the improvements in validation and the addition of feature engineering justified this decision.
    Again, sorry for the inconvenience & best,
    Ingo

Answers

Sign In or Register to comment.