The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.

What is the preprocessing model output of some operators? [SOLVED]

vpmailvpmail Member Posts: 6 Contributor II
edited November 2018 in Help
I guess something but what is this output of some operators exactly? I searched lot without success. Any reference?
Many thanks


  • Options
    MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869 Unicorn

    the preprocessing output of e.g. the Normalization operator provides the so-called pre-processing model. In the case of the normalization, it stores the properties of the normalization (e.g. the mean of the training data etc.). That way, you can apply the same normalization to the test data. Please see the attached process for a very simple example.

    <?xml version="1.0" encoding="UTF-8" standalone="no"?>
    <process version="5.3.000">
      <operator activated="true" class="process" compatibility="5.3.000" expanded="true" name="Process">
        <process expanded="true" height="449" width="547">
          <operator activated="true" class="generate_data" compatibility="5.3.000" expanded="true" height="60" name="Generate Training Data" width="90" x="45" y="30"/>
          <operator activated="true" class="normalize" compatibility="5.3.000" expanded="true" height="94" name="Normalize" width="90" x="179" y="30"/>
          <operator activated="true" class="generate_data" compatibility="5.3.000" expanded="true" height="60" name="Generate Test Data" width="90" x="36" y="229"/>
          <operator activated="true" class="apply_model" compatibility="5.3.000" expanded="true" height="76" name="Apply Model" width="90" x="380" y="165">
            <list key="application_parameters"/>
          <connect from_op="Generate Training Data" from_port="output" to_op="Normalize" to_port="example set input"/>
          <connect from_op="Normalize" from_port="preprocessing model" to_op="Apply Model" to_port="model"/>
          <connect from_op="Generate Test Data" from_port="output" to_op="Apply Model" to_port="unlabelled data"/>
          <connect from_op="Apply Model" from_port="labelled data" to_port="result 1"/>
          <portSpacing port="source_input 1" spacing="0"/>
          <portSpacing port="sink_result 1" spacing="144"/>
          <portSpacing port="sink_result 2" spacing="0"/>
  • Options
    vpmailvpmail Member Posts: 6 Contributor II
    Hi Marius,
    thank you, it's clear now.
    Best regards,
Sign In or Register to comment.