Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Automodel - Performance
I have 1.281 lines of data on my set. I launched 2 models in Auto-Model feature.
Both have based their performance on 512 lines, which is +/-40% of the lines, as Training Data.
Is this normal ratio? And how should I use the remaining 60% of lines to Validate my Data? After deployment?
Many thanks for your feedback.
Both have based their performance on 512 lines, which is +/-40% of the lines, as Training Data.
Is this normal ratio? And how should I use the remaining 60% of lines to Validate my Data? After deployment?
Many thanks for your feedback.
Tagged:
0
Best Answer
-
lionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 UnicornHi @Olus,
I think you are doing a little misunderstanding :
To sum up, Auto-Model is using :
- 60% of your initial dataset to train the model(s) : It's the training set
- the remaining 40% of your initial dataset to test the model(s) : It's the test set.
To be more accuracy, here an extract of the documentation of how is calculated the performance of a model in Auto-Model.
You can find this documentation by clicking in the "I" (at the top right of the screen) in the final screen of Auto-Model (results screen) :
Regards,
Lionel
6
Answers
Many thanks for your precious time and help on answering my question.
Regards,
Olus