What is the preprocessing model output of some operators? [SOLVED]

vpmailvpmail Member Posts: 6 Contributor II
edited November 2018 in Help
I guess something but what is this output of some operators exactly? I searched lot without success. Any reference?
Many thanks

Answers

  • MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869 Unicorn
    Hi,

    the preprocessing output of e.g. the Normalization operator provides the so-called pre-processing model. In the case of the normalization, it stores the properties of the normalization (e.g. the mean of the training data etc.). That way, you can apply the same normalization to the test data. Please see the attached process for a very simple example.

    Best,
    Marius
    <?xml version="1.0" encoding="UTF-8" standalone="no"?>
    <process version="5.3.000">
      <context>
        <input/>
        <output/>
        <macros/>
      </context>
      <operator activated="true" class="process" compatibility="5.3.000" expanded="true" name="Process">
        <process expanded="true" height="449" width="547">
          <operator activated="true" class="generate_data" compatibility="5.3.000" expanded="true" height="60" name="Generate Training Data" width="90" x="45" y="30"/>
          <operator activated="true" class="normalize" compatibility="5.3.000" expanded="true" height="94" name="Normalize" width="90" x="179" y="30"/>
          <operator activated="true" class="generate_data" compatibility="5.3.000" expanded="true" height="60" name="Generate Test Data" width="90" x="36" y="229"/>
          <operator activated="true" class="apply_model" compatibility="5.3.000" expanded="true" height="76" name="Apply Model" width="90" x="380" y="165">
            <list key="application_parameters"/>
          </operator>
          <connect from_op="Generate Training Data" from_port="output" to_op="Normalize" to_port="example set input"/>
          <connect from_op="Normalize" from_port="preprocessing model" to_op="Apply Model" to_port="model"/>
          <connect from_op="Generate Test Data" from_port="output" to_op="Apply Model" to_port="unlabelled data"/>
          <connect from_op="Apply Model" from_port="labelled data" to_port="result 1"/>
          <portSpacing port="source_input 1" spacing="0"/>
          <portSpacing port="sink_result 1" spacing="144"/>
          <portSpacing port="sink_result 2" spacing="0"/>
        </process>
      </operator>
    </process>
  • vpmailvpmail Member Posts: 6 Contributor II
    Hi Marius,
    thank you, it's clear now.
    Best regards,
    Peter
Sign In or Register to comment.