"Input for FPGrowth"

svpriyan · December 2008

Hai
Thanks earmijo, land for the previous hints.
I want to ask here, i got an error when i apply FPGrowth & Association Analysis.
I have the *.csv file and i try to do the following
-example source
-nominaltobinary
-FPGrowth
-Association Analysis

My *.CSV file contains the answer of the query [quote author=earmijo link=topic=499.msg1929#msg1929 date=1228839575]

I did not understand anyway there exists to make it correct.
thanks

[attachment deleted by admin]

TobiasMalbrecht · December 2008

Hi Pryian,

one look into the operator info does help here ...

Please note that the given data set is only allowed to contain binominal attributes, i.e. nominal attributes with only two different values. Simply use the provided preprocessing operators in order to transform your data set. The necessary operators are the discretization operators for changing the value types of numerical attributes to nominal and the operator Nominal2Binominal for transforming nominal attributes into binominal / binary ones.

It seems that your transaction id is not nominal. However, the id should be not considered in the analysis at all. You should therefore mark it as special (use the [tt]ChangeAttributeRole[/tt] operator).

Regards,
Tobias

svpriyan · December 2008

Thanks for the hints, I did in this way


<operator name="Root" class="Process" expanded="yes">
    <operator name="CSVExampleSource" class="CSVExampleSource">
        <parameter key="filename"	value="C:\Book1.csv"/>
    </operator>
    <operator name="Numerical2Polynominal" class="Numerical2Polynominal">
    </operator>
    <operator name="Aggregation" class="Aggregation">
        <list key="aggregation_attributes">
          <parameter key="item"	value="count"/>
        </list>
        <parameter key="group_by_attributes"	value="tid|item"/>
    </operator>
    <operator name="Example2AttributePivoting" class="Example2AttributePivoting">
        <parameter key="group_attribute"	value="tid"/>
        <parameter key="index_attribute"	value="item"/>
    </operator>
    <operator name="ChangeAttributeRole" class="ChangeAttributeRole">
        <parameter key="name"	value="tid"/>
    </operator>
    <operator name="UserBasedDiscretization" class="UserBasedDiscretization">
        <list key="classes">
          <parameter key="yes"	value="1.0"/>
          <parameter key="no"	value="0.0"/>
        </list>
    </operator>
    <operator name="FPGrowth" class="FPGrowth">
        <parameter key="min_support"	value="0.2"/>
    </operator>
    <operator name="AssociationRuleGenerator" class="AssociationRuleGenerator">
        <parameter key="keep_frequent_item_sets"	value="true"/>
        <parameter key="min_confidence"	value="1.0"/>
    </operator>
</operator>

Do I have some mistakes here.
thanks

TobiasMalbrecht · December 2008

Hi,

well, is it working the way you did it?

Tobias

svpriyan · December 2008

Hai
Thanks for the reply
Yea, I got the answers but Can I Simplify more this code.
also, I have lot of Rules with me.. how can i reduce the rules , means until which level i can increase min_sup & min_conf. normally max 1. is it?

thanks
priyan

land · December 2008

Hi,
yes. 1 is the maximum. As stated in the parameter description.

Greetings,
Sebastian

Howdy, Stranger!

Quick Links

Categories

Altair RapidMiner Community

GET HELP. LEARN BEST PRACTICES. NETWORK WITH YOUR PEERS.

"Input for FPGrowth"

Answers