RapidMiner

RapidMiner

Feature Selection

Contributor II

Feature Selection

Hello.
I'm doing a work where i need experiment several methods of Feature Selection in rapid miner and then compare them. I'm testing rapid miner to do this work but i don't be well succeed with some operators. One of them are ANOVA. What i have to do to use use this operator.  ???
For example in the operator CFSFeatureSetEvaluator i use this inside another operator (FeatureSelection) but with ANOVA i don't now what to do ???
Someone can help?

Regards,

André.
3 REPLIES
RMStaff

Re: Feature Selection

Hi,


For example in the operator CFSFeatureSetEvaluator i use this inside another operator (FeatureSelection) but with ANOVA i don't now what to do


The first part is perfectly fine - but I don't get why you would want to integrate the ANOVA operator inside this setting?

The ANOVA operator can only used to calculate if two performance vectors (which in fact could guide the feature selection process) significantly differ.

Cheers,
Ingo
Contributor II

Re: Feature Selection

My idea is not integrate the ANOVA operator inside these setting but test separately ANOVA operator and CFSFeatureSetEvaluator in a data and then observe and conclude how can be better using for example SimpleValidation. I don't now if i am explicit... Basically my question is what operator i can apply to my data before use ANOVA  to do work this operator...

Cheers,

André.
RMStaff

Re: Feature Selection

You cannot use ANOVA as a performance calculator which could then be used to guide the feature selection - this is simply not what ANOVA does. As I said before, ANOVA can only be used for comparing two performances with respect to the question if they significanctly differ. But ANOVA cannot be used to create a performance measure.

You could get an idea of what can be done with ANOVA by looking into the samples delivered together with RapidMiner. There is also a sample process showing how ANOVA can be used.

Cheers,
Ingo