Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Auto Model: Performance is worse when auto feature selection / generation turned on?
Hello, I am new to the machine learning world am self teaching myself by playing around with rapid miner studio. I have just noticed something that doesn't seem to make sense to me and am hoping someone could explain it to me.
I put the same data set in auto model and at first ran it with 'automatic feature selection / generation' turned off, then I ran it again with feature selection/generation turned on.
When 'automatic feature selection / generation' was turned on the performance of the model was worse than when it was off. I am a bit confused why adding feature selection / generation would potentially make a model worse, if there aren't any features that improve the performance of the model would they not just be rejected and the original model would come out, so the performance should only be the same or better?
Again I am very new to this and am just a bit confused here, any help would be greatly appreciated!
Thank you
I put the same data set in auto model and at first ran it with 'automatic feature selection / generation' turned off, then I ran it again with feature selection/generation turned on.
When 'automatic feature selection / generation' was turned on the performance of the model was worse than when it was off. I am a bit confused why adding feature selection / generation would potentially make a model worse, if there aren't any features that improve the performance of the model would they not just be rejected and the original model would come out, so the performance should only be the same or better?
Again I am very new to this and am just a bit confused here, any help would be greatly appreciated!
Thank you
Tagged:
0
Answers
Dortmund, Germany
Dortmund, Germany
I was thinking it was picking the best model based on those results, so that must be where I was confused.