🎉 🎉 RAPIDMINER 9.10 IS OUT!!! 🎉🎉

Download the latest version helping analytics teams accelerate time-to-value for streaming and IIOT use cases.

CLICK HERE TO DOWNLOAD

Need Help Using STORE Operator, Getting Error Message

NS9NS9 Member Posts: 3 Newbie
edited February 24 in Help
Hello,

I'm trying to use the Store operator to store my model, just like the deprecated "Write Model" operator, so I can retrieve it in a different portion of the process. When I attempt to use the Store operator I get the following messages in my log.

Feb 24, 2021 9:38:49 AM WARNING: File reading error.
Feb 24, 2021 9:38:49 AM WARNING: Loading script failed from: with error java.io.IOException: (The filename, directory name, or volume label syntax is incorrect)

Below is my XML process. Please let me know if there's anything else needed to help me solve this issue.

<?xml version="1.0" encoding="UTF-8"?><process version="9.8.001">

  <context>

    <input/>

    <output/>

    <macros/>

  </context>

  <operator activated="true" class="process" compatibility="9.8.001" expanded="true" name="Process">

    <parameter key="logverbosity" value="init"/>

    <parameter key="random_seed" value="2001"/>

    <parameter key="send_mail" value="never"/>

    <parameter key="notification_email" value=""/>

    <parameter key="process_duration_for_mail" value="30"/>

    <parameter key="encoding" value="SYSTEM"/>

    <process expanded="true">

      <operator activated="true" class="read_excel" compatibility="9.8.001" expanded="true" height="68" name="Read Excel" width="90" x="45" y="34">

        <parameter key="excel_file" value="C:\Users\USERNAME\Desktop\GSPC Time Series Training Data.xls"/>

        <parameter key="sheet_selection" value="sheet number"/>

        <parameter key="sheet_number" value="1"/>

        <parameter key="imported_cell_range" value="A1"/>

        <parameter key="encoding" value="SYSTEM"/>

        <parameter key="first_row_as_names" value="true"/>

        <list key="annotations"/>

        <parameter key="date_format" value=""/>

        <parameter key="time_zone" value="SYSTEM"/>

        <parameter key="locale" value="English (United States)"/>

        <parameter key="read_all_values_as_polynominal" value="false"/>

        <list key="data_set_meta_data_information">

          <parameter key="0" value="Date.true.date.attribute"/>

          <parameter key="1" value="Open.true.real.attribute"/>

          <parameter key="2" value="High.true.real.attribute"/>

          <parameter key="3" value="Low.true.real.attribute"/>

          <parameter key="4" value="Close.true.real.attribute"/>

          <parameter key="5" value="Volume.true.real.attribute"/>

        </list>

        <parameter key="read_not_matching_values_as_missings" value="false"/>

        <parameter key="datamanagement" value="double_array"/>

        <parameter key="data_management" value="auto"/>

      </operator>

      <operator activated="true" class="set_role" compatibility="9.8.001" expanded="true" height="82" name="Set Role" width="90" x="179" y="34">

        <parameter key="attribute_name" value="Date"/>

        <parameter key="target_role" value="id"/>

        <list key="set_additional_roles"/>

      </operator>

      <operator activated="true" class="series:windowing" compatibility="7.4.000" expanded="true" height="82" name="Windowing" width="90" x="313" y="34">

        <parameter key="series_representation" value="encode_series_by_examples"/>

        <parameter key="window_size" value="1"/>

        <parameter key="step_size" value="1"/>

        <parameter key="create_single_attributes" value="true"/>

        <parameter key="create_label" value="true"/>

        <parameter key="select_label_by_dimension" value="false"/>

        <parameter key="label_attribute" value="Close"/>

        <parameter key="horizon" value="1"/>

        <parameter key="add_incomplete_windows" value="false"/>

        <parameter key="stop_on_too_small_dataset" value="false"/>

      </operator>

      <operator activated="true" class="series:sliding_window_validation" compatibility="7.4.000" expanded="true" height="124" name="Validation" width="90" x="447" y="34">

        <parameter key="create_complete_model" value="false"/>

        <parameter key="training_window_width" value="20"/>

        <parameter key="training_window_step_size" value="5"/>

        <parameter key="test_window_width" value="20"/>

        <parameter key="horizon" value="5"/>

        <parameter key="cumulative_training" value="false"/>

        <parameter key="average_performances_only" value="true"/>

        <process expanded="true">

          <operator activated="true" class="support_vector_machine" compatibility="9.8.001" expanded="true" height="124" name="SVM" width="90" x="179" y="34">

            <parameter key="kernel_type" value="dot"/>

            <parameter key="kernel_gamma" value="1.0"/>

            <parameter key="kernel_sigma1" value="1.0"/>

            <parameter key="kernel_sigma2" value="0.0"/>

            <parameter key="kernel_sigma3" value="2.0"/>

            <parameter key="kernel_shift" value="1.0"/>

            <parameter key="kernel_degree" value="2.0"/>

            <parameter key="kernel_a" value="1.0"/>

            <parameter key="kernel_b" value="0.0"/>

            <parameter key="kernel_cache" value="200"/>

            <parameter key="C" value="0.0"/>

            <parameter key="convergence_epsilon" value="0.001"/>

            <parameter key="max_iterations" value="100000"/>

            <parameter key="scale" value="true"/>

            <parameter key="calculate_weights" value="true"/>

            <parameter key="return_optimization_performance" value="true"/>

            <parameter key="L_pos" value="1.0"/>

            <parameter key="L_neg" value="1.0"/>

            <parameter key="epsilon" value="0.0"/>

            <parameter key="epsilon_plus" value="0.0"/>

            <parameter key="epsilon_minus" value="0.0"/>

            <parameter key="balance_cost" value="false"/>

            <parameter key="quadratic_loss_pos" value="false"/>

            <parameter key="quadratic_loss_neg" value="false"/>

            <parameter key="estimate_performance" value="false"/>

          </operator>

          <connect from_port="training" to_op="SVM" to_port="training set"/>

          <connect from_op="SVM" from_port="model" to_port="model"/>

          <portSpacing port="source_training" spacing="0"/>

          <portSpacing port="sink_model" spacing="0"/>

          <portSpacing port="sink_through 1" spacing="0"/>

        </process>

        <process expanded="true">

          <operator activated="true" class="apply_model" compatibility="9.8.001" expanded="true" height="82" name="Apply Model" width="90" x="112" y="34">

            <list key="application_parameters"/>

            <parameter key="create_view" value="false"/>

          </operator>

          <operator activated="true" class="series:forecasting_performance" compatibility="7.4.000" expanded="true" height="82" name="Performance" width="90" x="313" y="34">

            <parameter key="horizon" value="1"/>

            <parameter key="main_criterion" value="first"/>

            <parameter key="prediction_trend_accuracy" value="true"/>

            <parameter key="skip_undefined_labels" value="true"/>

            <parameter key="use_example_weights" value="true"/>

          </operator>

          <connect from_port="model" to_op="Apply Model" to_port="model"/>

          <connect from_port="test set" to_op="Apply Model" to_port="unlabelled data"/>

          <connect from_op="Apply Model" from_port="labelled data" to_op="Performance" to_port="labelled data"/>

          <connect from_op="Performance" from_port="performance" to_port="averagable 1"/>

          <portSpacing port="source_model" spacing="0"/>

          <portSpacing port="source_test set" spacing="0"/>

          <portSpacing port="source_through 1" spacing="0"/>

          <portSpacing port="sink_averagable 1" spacing="0"/>

          <portSpacing port="sink_averagable 2" spacing="0"/>

        </process>

      </operator>

      <operator activated="true" class="store" compatibility="9.8.001" expanded="true" height="68" name="Store" width="90" x="581" y="136">

        <parameter key="repository_entry" value="Data"/>

      </operator>

      <operator activated="true" class="read_excel" compatibility="9.8.001" expanded="true" height="68" name="Read Excel (2)" width="90" x="45" y="340">

        <parameter key="excel_file" value="C:/Users/USERNAME/Desktop/^GSPC.xls"/>

        <parameter key="sheet_selection" value="sheet number"/>

        <parameter key="sheet_number" value="1"/>

        <parameter key="imported_cell_range" value="A1"/>

        <parameter key="encoding" value="SYSTEM"/>

        <parameter key="first_row_as_names" value="true"/>

        <list key="annotations"/>

        <parameter key="date_format" value=""/>

        <parameter key="time_zone" value="SYSTEM"/>

        <parameter key="locale" value="English (United States)"/>

        <parameter key="read_all_values_as_polynominal" value="false"/>

        <list key="data_set_meta_data_information">

          <parameter key="0" value="Date.true.date.attribute"/>

          <parameter key="1" value="Open.true.real.attribute"/>

          <parameter key="2" value="High.true.real.attribute"/>

          <parameter key="3" value="Low.true.real.attribute"/>

          <parameter key="4" value="Close.true.real.attribute"/>

          <parameter key="5" value="Volume.true.real.attribute"/>

        </list>

        <parameter key="read_not_matching_values_as_missings" value="false"/>

        <parameter key="datamanagement" value="double_array"/>

        <parameter key="data_management" value="auto"/>

      </operator>

      <operator activated="true" class="set_role" compatibility="9.8.001" expanded="true" height="82" name="Set Role (2)" width="90" x="246" y="340">

        <parameter key="attribute_name" value="Date"/>

        <parameter key="target_role" value="id"/>

        <list key="set_additional_roles"/>

      </operator>

      <operator activated="true" class="series:windowing" compatibility="7.4.000" expanded="true" height="82" name="Windowing (2)" width="90" x="447" y="340">

        <parameter key="series_representation" value="encode_series_by_examples"/>

        <parameter key="window_size" value="1"/>

        <parameter key="step_size" value="1"/>

        <parameter key="create_single_attributes" value="true"/>

        <parameter key="create_label" value="false"/>

        <parameter key="select_label_by_dimension" value="false"/>

        <parameter key="label_attribute" value="Close"/>

        <parameter key="horizon" value="1"/>

        <parameter key="add_incomplete_windows" value="false"/>

        <parameter key="stop_on_too_small_dataset" value="false"/>

      </operator>

      <connect from_op="Read Excel" from_port="output" to_op="Set Role" to_port="example set input"/>

      <connect from_op="Set Role" from_port="example set output" to_op="Windowing" to_port="example set input"/>

      <connect from_op="Windowing" from_port="example set output" to_op="Validation" to_port="training"/>

      <connect from_op="Validation" from_port="model" to_op="Store" to_port="input"/>

      <connect from_op="Validation" from_port="training" to_port="result 1"/>

      <connect from_op="Validation" from_port="averagable 1" to_port="result 2"/>

      <connect from_op="Read Excel (2)" from_port="output" to_op="Set Role (2)" to_port="example set input"/>

      <connect from_op="Set Role (2)" from_port="example set output" to_op="Windowing (2)" to_port="example set input"/>

      <portSpacing port="source_input 1" spacing="0"/>

      <portSpacing port="sink_result 1" spacing="0"/>

      <portSpacing port="sink_result 2" spacing="0"/>

      <portSpacing port="sink_result 3" spacing="0"/>

    </process>

  </operator>

</process>


Answers

  • mschmitzmschmitz Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,089  RM Data Scientist
    Hi,
    Store expects a Repository location, not a disc location.

    Best,
    Martin
    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
    NS9
  • NS9NS9 Member Posts: 3 Newbie
    Hello mschmitz,

    How can I create a repository location that is not on my local hard drive?
  • BalazsBaranyBalazsBarany Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert Posts: 692   Unicorn
    Hi @NS9,

    you can create a new repository on a shared drive/folder in your company, or use an AI Hub repository. 
    Select the "Show repository actions" button in the Repository panel to add a new repository. It would be a "local" repository for the shared folder option (even if it is on a remote computer), or an AI Hub repository.

    What problem are you trying to solve? Would you like to share your results (e. g. the model) with outer people using RapidMiner? Then both ways mentioned would work.
    Another way for this scenario is to use Git projects, either integrated into RapidMiner ("Connect to a Git project") or with external versioning using a different tool. 

    Regards,

    Balázs
    NS9
  • NS9NS9 Member Posts: 3 Newbie
    Thank you Balazs,

    I got it working somehow. all I did was create a new folder in my local repository and for some reason it allowed me to save it there.

    Kind regards,

    NS
    BalazsBaranymschmitz
Sign In or Register to comment.