Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
How to provide seperate datasets for both training and testing ?
Iam a new user to rapidminer tool.i had watched a video regarding the training and testing dataset through cross validation.But it says a single dataset can be used for both training and testing.How to provide seperate datasets for both training and testing ? Can anyone please explain how to do that ?
0
Answers
Here is a comprehensive explanation by @sgenzer on cross-validation (CV).
https://community.rapidminer.com/discussion/55112/cross-validation-and-its-outputs-in-rm-studio
Simple understanding: When you connect dataset to CV operator, it divides the data into multiple sets based on the number of folds value. Every time it runs it will use one subset for testing and others for training.
EDIT: If you would like to provide separate datasets, then you need to connect the training data to the model and this model is connected to apply model operator and also the test dataset is connected to apply model operator. In this way, you can connect training and testing separately. sample XML below.
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
According to the points that @varunm1 said if we have a data with label we dont need to separate dataset in to traning and testing. And also RM with cross validation is able to separte it automatically to the train and test parts And for the testing part it will not use the label like the training part.
Are these points correct?
Thank you
Once the auto model is done training, you can deploy the model and test it on new data. This is possible only after trainin automodel
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing