Options

"Input for FPGrowth"

svpriyansvpriyan Member Posts: 29 Maven
edited June 2019 in Help
Hai
Thanks earmijo, land for the previous hints.
I want to ask here, i got an error when i apply FPGrowth & Association Analysis.
I have the *.csv file and i try to do the following
-example source
-nominaltobinary
-FPGrowth
-Association Analysis

My *.CSV file contains the answer of the query [quote author=earmijo link=topic=499.msg1929#msg1929 date=1228839575]


I did not understand anyway there exists to make it correct.
thanks



[attachment deleted by admin]

Answers

  • Options
    TobiasMalbrechtTobiasMalbrecht Moderator, Employee, Member Posts: 295 RM Product Management
    Hi Pryian,

    one look into the operator info does help here ...

    Please note that the given data set is only allowed to contain binominal attributes, i.e. nominal attributes with only two different values. Simply use the provided preprocessing operators in order to transform your data set. The necessary operators are the discretization operators for changing the value types of numerical attributes to nominal and the operator Nominal2Binominal for transforming nominal attributes into binominal / binary ones.
    It seems that your transaction id is not nominal. However, the id should be not considered in the analysis at all. You should therefore mark it as special (use the [tt]ChangeAttributeRole[/tt] operator).

    Regards,
    Tobias
  • Options
    svpriyansvpriyan Member Posts: 29 Maven
    Thanks for the hints, I did in this way

    <operator name="Root" class="Process" expanded="yes">
        <operator name="CSVExampleSource" class="CSVExampleSource">
            <parameter key="filename" value="C:\Book1.csv"/>
        </operator>
        <operator name="Numerical2Polynominal" class="Numerical2Polynominal">
        </operator>
        <operator name="Aggregation" class="Aggregation">
            <list key="aggregation_attributes">
              <parameter key="item" value="count"/>
            </list>
            <parameter key="group_by_attributes" value="tid|item"/>
        </operator>
        <operator name="Example2AttributePivoting" class="Example2AttributePivoting">
            <parameter key="group_attribute" value="tid"/>
            <parameter key="index_attribute" value="item"/>
        </operator>
        <operator name="ChangeAttributeRole" class="ChangeAttributeRole">
            <parameter key="name" value="tid"/>
        </operator>
        <operator name="UserBasedDiscretization" class="UserBasedDiscretization">
            <list key="classes">
              <parameter key="yes" value="1.0"/>
              <parameter key="no" value="0.0"/>
            </list>
        </operator>
        <operator name="FPGrowth" class="FPGrowth">
            <parameter key="min_support" value="0.2"/>
        </operator>
        <operator name="AssociationRuleGenerator" class="AssociationRuleGenerator">
            <parameter key="keep_frequent_item_sets" value="true"/>
            <parameter key="min_confidence" value="1.0"/>
        </operator>
    </operator>
    Do I have some mistakes here.
    thanks
  • Options
    TobiasMalbrechtTobiasMalbrecht Moderator, Employee, Member Posts: 295 RM Product Management
    Hi,

    well, is it working the way you did it?

    Tobias
  • Options
    svpriyansvpriyan Member Posts: 29 Maven
    Hai
    Thanks for the reply
    Yea, I got the answers but Can I Simplify more this code.
    also, I have lot of Rules with me.. how can i reduce the rules , means until which level i can increase min_sup & min_conf. normally max 1. is it?

    thanks
    priyan
  • Options
    landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    Hi,
    yes. 1 is the maximum. As stated in the parameter description.

    Greetings,
      Sebastian
Sign In or Register to comment.