Options

Supplied test set...

yogafireyogafire Member Posts: 43 Contributor II
edited November 2018 in Help
Hello, I am dimas from Indonesia.

I wonder how to test my model based on supplied test set at RapidMiner. I usually do such activity on WEKA. ::)
My purpose is at least to find out performance of my model when it's applied on testing set. such that, I can choose the best model having best accuracy on training set and test set. ;D

I apreciate all replies from you all. if you're so kind to share the XML process, of course it will be great! ;)

Thank You

Answers

  • Options
    landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    Hi,
    here's a RapidMiner 5.0 process which gives you an impression how easy it is to accomplish such tasks. On might say, RapidMiner is designed for exactly this :)
    <?xml version="1.0" encoding="UTF-8" standalone="no"?>
    <process version="5.0">
      <context>
        <input>
          <location/>
        </input>
        <output>
          <location/>
          <location/>
        </output>
        <macros/>
      </context>
      <operator activated="true" class="process" expanded="true" name="Process">
        <process expanded="true" height="395" width="790">
          <operator activated="true" class="generate_data" expanded="true" height="60" name="Generate Train" width="90" x="45" y="30">
            <parameter key="target_function" value="sum classification"/>
          </operator>
          <operator activated="true" class="linear_regression" expanded="true" height="76" name="Learn" width="90" x="179" y="30"/>
          <operator activated="true" class="generate_data" expanded="true" height="60" name="Generate Test" width="90" x="45" y="165">
            <parameter key="target_function" value="sum classification"/>
          </operator>
          <operator activated="true" class="apply_model" expanded="true" height="76" name="Apply Model" width="90" x="313" y="165">
            <list key="application_parameters"/>
          </operator>
          <operator activated="true" class="performance_classification" expanded="true" height="76" name="Performance" width="90" x="447" y="165">
            <parameter key="accuracy" value="true"/>
            <list key="class_weights"/>
          </operator>
          <connect from_op="Generate Train" from_port="output" to_op="Learn" to_port="training set"/>
          <connect from_op="Learn" from_port="model" to_op="Apply Model" to_port="model"/>
          <connect from_op="Generate Test" from_port="output" to_op="Apply Model" to_port="unlabelled data"/>
          <connect from_op="Apply Model" from_port="labelled data" to_op="Performance" to_port="labelled data"/>
          <connect from_op="Performance" from_port="performance" to_port="result 1"/>
          <portSpacing port="source_input 1" spacing="0"/>
          <portSpacing port="sink_result 1" spacing="126"/>
          <portSpacing port="sink_result 2" spacing="18"/>
        </process>
      </operator>
    </process>
    Greetings,
      Sebastian
  • Options
    yogafireyogafire Member Posts: 43 Contributor II
    OK, Thanks Before.

    I'll try it.
    I'll keep you informed of the result
Sign In or Register to comment.