RAPIDMINER 9.7 BETA ANNOUNCEMENT
The beta program for the RapidMiner 9.7 release is now available. Lots of amazing new improvements including true version control!
IDEAS WITH HIGH NUMBERS OF VOTES (USUALLY ≥ 10) ARE PRIORITIZED IN OUR ROADMAP.
NOTE: IF YOU WISH TO SUGGEST A NEW FEATURE, PLEASE POST A NEW QUESTION AND TAG AS "FEATURE REQUEST". THANK YOU.
Feature Request: Batch validation with optional fold numbers
I have a simple feature request if possible could be added in the cross-validation operator. Currently, we have a "Batch Validation" option that helps to set different batches and divides folds based on the number of batches. I am looking for an enhancement that helps control the number of folds created using these batches.
For example, if I have data related to 100 subjects and each subject has 10 samples, there will be 1000 samples of data. If I need to do a Leave Once subject out Cross-validation, I need to set 100 batch ID's (one for each subject) and do a batch validation in Cross-validation operator. If I need to try only 5 batches where 20 students belong to each batch, I need to generate attribute again with 5 batch ids, instead of this, we can provide an option where it uses the 100 batch ID's created first as an index and divide the 5 subsets based on that.
This will help switch between Leave one batch out and groupKfold validations.