exploratry data analysis on data using chi-square test

vishaldutt53vishaldutt53 Member Posts: 1 Contributor I
edited December 2018 in Help

 i m getting trouble to find the chi squar test on the dataset. my dataset has real or polynomial values.

here is xml process.

<?xml version="1.0" encoding="UTF-8"?><process version="8.1.001">
<operator activated="true" class="retrieve" compatibility="8.1.001" expanded="true" height="68" name="Retrieve house_prices" width="90" x="45" y="34">
<parameter key="repository_entry" value="//Local Repository/house_prices"/>
</operator>
</process>
<?xml version="1.0" encoding="UTF-8"?><process version="8.1.001">
<operator activated="true" class="weka:W-ChiSquaredAttributeEval" compatibility="7.3.000" expanded="true" height="82" name="W-ChiSquaredAttributeEval" width="90" x="313" y="34">
<parameter key="normalize_weights" value="false"/>
<parameter key="sort_weights" value="true"/>
<parameter key="sort_direction" value="ascending"/>
<parameter key="M" value="false"/>
<parameter key="B" value="false"/>
</operator>
</process>

can anyone help me please?

Answers

  • yyhuangyyhuang Administrator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 364 RM Data Scientist

    You can apply the feature weight by chi-squared test like this

    <?xml version="1.0" encoding="UTF-8"?><process version="8.1.001">
    <operator activated="true" class="retrieve" compatibility="8.1.001" expanded="true" height="68" name="Retrieve Titanic Training" width="90" x="112" y="34">
    <parameter key="repository_entry" value="//Samples/data/Titanic Training"/>
    </operator>
    </process>
    <?xml version="1.0" encoding="UTF-8"?><process version="8.1.001">
    <operator activated="true" class="weka:W-ChiSquaredAttributeEval" compatibility="7.3.000" expanded="true" height="82" name="W-ChiSquaredAttributeEval" width="90" x="313" y="34">
    <parameter key="normalize_weights" value="false"/>
    <parameter key="sort_weights" value="true"/>
    <parameter key="sort_direction" value="ascending"/>
    <parameter key="M" value="false"/>
    <parameter key="B" value="false"/>
    </operator>
    </process>
    <?xml version="1.0" encoding="UTF-8"?><process version="8.1.001">
    <operator activated="true" class="weight_by_chi_squared_statistic" compatibility="8.1.001" expanded="true" height="82" name="Weight by Chi Squared Statistic" width="90" x="514" y="136">
    <parameter key="normalize_weights" value="false"/>
    <parameter key="sort_weights" value="true"/>
    <parameter key="sort_direction" value="ascending"/>
    <parameter key="number_of_bins" value="10"/>
    </operator>
    </process>
  • Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn

    You can also check out the Statistics extension in the marketplace, which has a more generalized approach to calculating the chi-square statistics.

     

    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.