The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options

RM 9.1 feedback : Auto-Model limitation

lionelderkrikorlionelderkrikor Moderator, RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn
edited June 2019 in Help
Hi,

I work with a dataset containing 96 examples and thus I can't use Auto-Model because the new min number of examples is 100 !
Is there any reason to this new limitation ?


Regards,

Lionel
Tagged:

Best Answers

  • Options
    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,525 RM Data Scientist
    Solution Accepted
    Hi @lionelderkrikor ,
    i guess the answer is that the new features would overfit too much? @IngoRM ?
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • Options
    IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
    Solution Accepted
    Hi,
    Yes, indeed.  Plus we changed the validation approach a bit (see some of the other threads in the community - I will post answers there soon as well) to get to more robust estimations.  This unfortunately meant that we need more data for the validation part of the models which required to increase the limit from the 50 rows to 100. 
    We have looked into the statistics and it seemed that less than 3% of all AM runs have been on data sets of less than 100 rows and while we are sorry that we had to increase the limit (making the life harder for those 3% of the runs) we still believe that the improvements in validation and the addition of feature engineering justified this decision.
    Again, sorry for the inconvenience & best,
    Ingo

Answers

Sign In or Register to comment.