RM 9.1 feedback : Auto-Model limitation

lionelderkrikorlionelderkrikor Moderator, RapidMiner Certified Analyst, Member Posts: 948   Unicorn
edited June 2019 in Help
Hi,

I work with a dataset containing 96 examples and thus I can't use Auto-Model because the new min number of examples is 100 !
Is there any reason to this new limitation ?


Regards,

Lionel
Tagged:

Best Answers

  • mschmitzmschmitz Posts: 2,262  RM Data Scientist
    Solution Accepted
    Hi @lionelderkrikor ,
    i guess the answer is that the new features would overfit too much? @IngoRM ?
    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
  • IngoRMIngoRM Posts: 1,729  RM Founder
    Solution Accepted
    Hi,
    Yes, indeed.  Plus we changed the validation approach a bit (see some of the other threads in the community - I will post answers there soon as well) to get to more robust estimations.  This unfortunately meant that we need more data for the validation part of the models which required to increase the limit from the 50 rows to 100. 
    We have looked into the statistics and it seemed that less than 3% of all AM runs have been on data sets of less than 100 rows and while we are sorry that we had to increase the limit (making the life harder for those 3% of the runs) we still believe that the improvements in validation and the addition of feature engineering justified this decision.
    Again, sorry for the inconvenience & best,
    Ingo

Answers

Sign In or Register to comment.