"Attribute weighting and cross validation"

UsernameUsername Member Posts: 39  Guru
edited May 23 in Help
Hi,

I want to select e.g the 80% best attributes according to the information gain. The first inner operator of the WrapperXValidation operator must return AttributeWeights, so how can I modify my AttributeWeights returned from InfoGainWeighting that the weights of the 80% worst attributes are set to zero?

Thanks

Answers

  • IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,645  RM Founder
    Hi,

    since the info gain weighting would deliver always the same weights I do not see why to use the Wrapper validation for that. You could simply apply the operator AttributeWeightSelection for using the top k attributes. Here is an example:

    <operator name="Root" class="Process" expanded="yes">
        <operator name="ExampleSetGenerator" class="ExampleSetGenerator">
            <parameter key="number_examples" value="500"/>
            <parameter key="number_of_attributes" value="100"/>
            <parameter key="target_function" value="sum classification"/>
        </operator>
        <operator name="InfoGainWeighting" class="InfoGainWeighting">
        </operator>
        <operator name="AttributeWeightSelection" class="AttributeWeightSelection">
            <parameter key="k" value="80"/>
            <parameter key="weight_relation" value="top k"/>
        </operator>
    </operator>
    Hope that gives you the basic idea.

    Cheers,
    Ingo
Sign In or Register to comment.