Options

Need Help Using STORE Operator, Getting Error Message

NS9NS9 Member Posts: 3 Newbie
edited February 2021 in Help
Hello,

I'm trying to use the Store operator to store my model, just like the deprecated "Write Model" operator, so I can retrieve it in a different portion of the process. When I attempt to use the Store operator I get the following messages in my log.

Feb 24, 2021 9:38:49 AM WARNING: File reading error.
Feb 24, 2021 9:38:49 AM WARNING: Loading script failed from: with error java.io.IOException: (The filename, directory name, or volume label syntax is incorrect)

Below is my XML process. Please let me know if there's anything else needed to help me solve this issue.

<?xml version="1.0" encoding="UTF-8"?><process version="9.8.001">

  <context>

    <input/>

    <output/>

    <macros/>

  </context>

  <operator activated="true" class="process" compatibility="9.8.001" expanded="true" name="Process">

    <parameter key="logverbosity" value="init"/>

    <parameter key="random_seed" value="2001"/>

    <parameter key="send_mail" value="never"/>

    <parameter key="notification_email" value=""/>

    <parameter key="process_duration_for_mail" value="30"/>

    <parameter key="encoding" value="SYSTEM"/>

    <process expanded="true">

      <operator activated="true" class="read_excel" compatibility="9.8.001" expanded="true" height="68" name="Read Excel" width="90" x="45" y="34">

        <parameter key="excel_file" value="C:\Users\USERNAME\Desktop\GSPC Time Series Training Data.xls"/>

        <parameter key="sheet_selection" value="sheet number"/>

        <parameter key="sheet_number" value="1"/>

        <parameter key="imported_cell_range" value="A1"/>

        <parameter key="encoding" value="SYSTEM"/>

        <parameter key="first_row_as_names" value="true"/>

        <list key="annotations"/>

        <parameter key="date_format" value=""/>

        <parameter key="time_zone" value="SYSTEM"/>

        <parameter key="locale" value="English (United States)"/>

        <parameter key="read_all_values_as_polynominal" value="false"/>

        <list key="data_set_meta_data_information">

          <parameter key="0" value="Date.true.date.attribute"/>

          <parameter key="1" value="Open.true.real.attribute"/>

          <parameter key="2" value="High.true.real.attribute"/>

          <parameter key="3" value="Low.true.real.attribute"/>

          <parameter key="4" value="Close.true.real.attribute"/>

          <parameter key="5" value="Volume.true.real.attribute"/>

        </list>

        <parameter key="read_not_matching_values_as_missings" value="false"/>

        <parameter key="datamanagement" value="double_array"/>

        <parameter key="data_management" value="auto"/>

      </operator>

      <operator activated="true" class="set_role" compatibility="9.8.001" expanded="true" height="82" name="Set Role" width="90" x="179" y="34">

        <parameter key="attribute_name" value="Date"/>

        <parameter key="target_role" value="id"/>

        <list key="set_additional_roles"/>

      </operator>

      <operator activated="true" class="series:windowing" compatibility="7.4.000" expanded="true" height="82" name="Windowing" width="90" x="313" y="34">

        <parameter key="series_representation" value="encode_series_by_examples"/>

        <parameter key="window_size" value="1"/>

        <parameter key="step_size" value="1"/>

        <parameter key="create_single_attributes" value="true"/>

        <parameter key="create_label" value="true"/>

        <parameter key="select_label_by_dimension" value="false"/>

        <parameter key="label_attribute" value="Close"/>

        <parameter key="horizon" value="1"/>

        <parameter key="add_incomplete_windows" value="false"/>

        <parameter key="stop_on_too_small_dataset" value="false"/>

      </operator>

      <operator activated="true" class="series:sliding_window_validation" compatibility="7.4.000" expanded="true" height="124" name="Validation" width="90" x="447" y="34">

        <parameter key="create_complete_model" value="false"/>

        <parameter key="training_window_width" value="20"/>

        <parameter key="training_window_step_size" value="5"/>

        <parameter key="test_window_width" value="20"/>

        <parameter key="horizon" value="5"/>

        <parameter key="cumulative_training" value="false"/>

        <parameter key="average_performances_only" value="true"/>

        <process expanded="true">

          <operator activated="true" class="support_vector_machine" compatibility="9.8.001" expanded="true" height="124" name="SVM" width="90" x="179" y="34">

            <parameter key="kernel_type" value="dot"/>

            <parameter key="kernel_gamma" value="1.0"/>

            <parameter key="kernel_sigma1" value="1.0"/>

            <parameter key="kernel_sigma2" value="0.0"/>

            <parameter key="kernel_sigma3" value="2.0"/>

            <parameter key="kernel_shift" value="1.0"/>

            <parameter key="kernel_degree" value="2.0"/>

            <parameter key="kernel_a" value="1.0"/>

            <parameter key="kernel_b" value="0.0"/>

            <parameter key="kernel_cache" value="200"/>

            <parameter key="C" value="0.0"/>

            <parameter key="convergence_epsilon" value="0.001"/>

            <parameter key="max_iterations" value="100000"/>

            <parameter key="scale" value="true"/>

            <parameter key="calculate_weights" value="true"/>

            <parameter key="return_optimization_performance" value="true"/>

            <parameter key="L_pos" value="1.0"/>

            <parameter key="L_neg" value="1.0"/>

            <parameter key="epsilon" value="0.0"/>

            <parameter key="epsilon_plus" value="0.0"/>

            <parameter key="epsilon_minus" value="0.0"/>

            <parameter key="balance_cost" value="false"/>

            <parameter key="quadratic_loss_pos" value="false"/>

            <parameter key="quadratic_loss_neg" value="false"/>

            <parameter key="estimate_performance" value="false"/>

          </operator>

          <connect from_port="training" to_op="SVM" to_port="training set"/>

          <connect from_op="SVM" from_port="model" to_port="model"/>

          <portSpacing port="source_training" spacing="0"/>

          <portSpacing port="sink_model" spacing="0"/>

          <portSpacing port="sink_through 1" spacing="0"/>

        </process>

        <process expanded="true">

          <operator activated="true" class="apply_model" compatibility="9.8.001" expanded="true" height="82" name="Apply Model" width="90" x="112" y="34">

            <list key="application_parameters"/>

            <parameter key="create_view" value="false"/>

          </operator>

          <operator activated="true" class="series:forecasting_performance" compatibility="7.4.000" expanded="true" height="82" name="Performance" width="90" x="313" y="34">

            <parameter key="horizon" value="1"/>

            <parameter key="main_criterion" value="first"/>

            <parameter key="prediction_trend_accuracy" value="true"/>

            <parameter key="skip_undefined_labels" value="true"/>

            <parameter key="use_example_weights" value="true"/>

          </operator>

          <connect from_port="model" to_op="Apply Model" to_port="model"/>

          <connect from_port="test set" to_op="Apply Model" to_port="unlabelled data"/>

          <connect from_op="Apply Model" from_port="labelled data" to_op="Performance" to_port="labelled data"/>

          <connect from_op="Performance" from_port="performance" to_port="averagable 1"/>

          <portSpacing port="source_model" spacing="0"/>

          <portSpacing port="source_test set" spacing="0"/>

          <portSpacing port="source_through 1" spacing="0"/>

          <portSpacing port="sink_averagable 1" spacing="0"/>

          <portSpacing port="sink_averagable 2" spacing="0"/>

        </process>

      </operator>

      <operator activated="true" class="store" compatibility="9.8.001" expanded="true" height="68" name="Store" width="90" x="581" y="136">

        <parameter key="repository_entry" value="Data"/>

      </operator>

      <operator activated="true" class="read_excel" compatibility="9.8.001" expanded="true" height="68" name="Read Excel (2)" width="90" x="45" y="340">

        <parameter key="excel_file" value="C:/Users/USERNAME/Desktop/^GSPC.xls"/>

        <parameter key="sheet_selection" value="sheet number"/>

        <parameter key="sheet_number" value="1"/>

        <parameter key="imported_cell_range" value="A1"/>

        <parameter key="encoding" value="SYSTEM"/>

        <parameter key="first_row_as_names" value="true"/>

        <list key="annotations"/>

        <parameter key="date_format" value=""/>

        <parameter key="time_zone" value="SYSTEM"/>

        <parameter key="locale" value="English (United States)"/>

        <parameter key="read_all_values_as_polynominal" value="false"/>

        <list key="data_set_meta_data_information">

          <parameter key="0" value="Date.true.date.attribute"/>

          <parameter key="1" value="Open.true.real.attribute"/>

          <parameter key="2" value="High.true.real.attribute"/>

          <parameter key="3" value="Low.true.real.attribute"/>

          <parameter key="4" value="Close.true.real.attribute"/>

          <parameter key="5" value="Volume.true.real.attribute"/>

        </list>

        <parameter key="read_not_matching_values_as_missings" value="false"/>

        <parameter key="datamanagement" value="double_array"/>

        <parameter key="data_management" value="auto"/>

      </operator>

      <operator activated="true" class="set_role" compatibility="9.8.001" expanded="true" height="82" name="Set Role (2)" width="90" x="246" y="340">

        <parameter key="attribute_name" value="Date"/>

        <parameter key="target_role" value="id"/>

        <list key="set_additional_roles"/>

      </operator>

      <operator activated="true" class="series:windowing" compatibility="7.4.000" expanded="true" height="82" name="Windowing (2)" width="90" x="447" y="340">

        <parameter key="series_representation" value="encode_series_by_examples"/>

        <parameter key="window_size" value="1"/>

        <parameter key="step_size" value="1"/>

        <parameter key="create_single_attributes" value="true"/>

        <parameter key="create_label" value="false"/>

        <parameter key="select_label_by_dimension" value="false"/>

        <parameter key="label_attribute" value="Close"/>

        <parameter key="horizon" value="1"/>

        <parameter key="add_incomplete_windows" value="false"/>

        <parameter key="stop_on_too_small_dataset" value="false"/>

      </operator>

      <connect from_op="Read Excel" from_port="output" to_op="Set Role" to_port="example set input"/>

      <connect from_op="Set Role" from_port="example set output" to_op="Windowing" to_port="example set input"/>

      <connect from_op="Windowing" from_port="example set output" to_op="Validation" to_port="training"/>

      <connect from_op="Validation" from_port="model" to_op="Store" to_port="input"/>

      <connect from_op="Validation" from_port="training" to_port="result 1"/>

      <connect from_op="Validation" from_port="averagable 1" to_port="result 2"/>

      <connect from_op="Read Excel (2)" from_port="output" to_op="Set Role (2)" to_port="example set input"/>

      <connect from_op="Set Role (2)" from_port="example set output" to_op="Windowing (2)" to_port="example set input"/>

      <portSpacing port="source_input 1" spacing="0"/>

      <portSpacing port="sink_result 1" spacing="0"/>

      <portSpacing port="sink_result 2" spacing="0"/>

      <portSpacing port="sink_result 3" spacing="0"/>

    </process>

  </operator>

</process>


Answers

  • Options
    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,517 RM Data Scientist
    Hi,
    Store expects a Repository location, not a disc location.

    Best,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • Options
    NS9NS9 Member Posts: 3 Newbie
    Hello mschmitz,

    How can I create a repository location that is not on my local hard drive?
  • Options
    BalazsBaranyBalazsBarany Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert Posts: 955 Unicorn
    Hi @NS9,

    you can create a new repository on a shared drive/folder in your company, or use an AI Hub repository. 
    Select the "Show repository actions" button in the Repository panel to add a new repository. It would be a "local" repository for the shared folder option (even if it is on a remote computer), or an AI Hub repository.

    What problem are you trying to solve? Would you like to share your results (e. g. the model) with outer people using RapidMiner? Then both ways mentioned would work.
    Another way for this scenario is to use Git projects, either integrated into RapidMiner ("Connect to a Git project") or with external versioning using a different tool. 

    Regards,

    Balázs
  • Options
    NS9NS9 Member Posts: 3 Newbie
    Thank you Balazs,

    I got it working somehow. all I did was create a new folder in my local repository and for some reason it allowed me to save it there.

    Kind regards,

    NS
Sign In or Register to comment.