RapidMiner 9.7 is Now Available
Lots of amazing new improvements including true version control! Learn more about what's new here.
Auto Model only take 500000 records for training? Missing ID for scoring output?
1) Auto Model - When opening process after Auto Model ran, I noticed Auto Model only take 500k records for training while my input data set have around 30 millions records but only several fields (we probably have close to 100M records but 30M ~ 900MB for starter). Is there anyway we can do so that Auto Model train with more data?
I've tried to open up the process and increase the size of sampling and it can run fine on 10M. I can only imagine data grow more and more and we eventually add more features in the mix. But I want to use Auto Model as much as possible if it serve the purpose in later stage like Deployment
2) Deployment - I had indicated ID-field in data to be scored but after the output was generated, the result didn't have ID field that I expect. Any explanation and how we can export the result with ID-fields and their predictions so that we can import to our datawarehouse ?