Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Performance of model with and without cross validation
Hello,
below are two models of random forest (RF). In the first one I used cross validation (CV) to validate my model. In the other one RF is used without any validation method directly on the training data. In both cases the same training data set, as well as the same real data set to apply the model is used. However the performance of both differs (slightly).
My question is: Shouldnt performance be the same?
If I use some other algorithm e.g. DT,NB, performance of both models (with and without CV) are the same.
Tagged:
0
Best Answer
-
MartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,525 RM Data ScientistHi,did you fix random seeds for both of them to make them the same? I guess this is just the flucation you get from different seeds.Best,Martin- Sr. Director Data Solutions, Altair RapidMiner -
Dortmund, Germany5
Answers