Im new in Rapidminer, can someone help me with this Kaggle challenge ? only got 69% right

Xandesaraiva_Xandesaraiva_ Member Posts: 2 Contributor I


<context>
    <input>
      <location>//Local Repository/data/1º DESAFIO</location>
    </input>
    <output/>
    <macros/>
  </context>
  <operator activated="true" class="process" compatibility="9.3.001" expanded="true" name="Process">
    <parameter key="logverbosity" value="init"/>
    <parameter key="random_seed" value="2001"/>
    <parameter key="send_mail" value="never"/>
    <parameter key="notification_email" value=""/>
    <parameter key="process_duration_for_mail" value="30"/>
    <parameter key="encoding" value="SYSTEM"/>
    <process expanded="true">
      <operator activated="true" class="retrieve" compatibility="9.3.001" expanded="true" height="68" name="Retrieve 1º DESAFIO" width="90" x="45" y="34">
        <parameter key="repository_entry" value="//Local Repository/data/1º DESAFIO"/>
      </operator>
      <operator activated="true" class="naive_bayes" compatibility="9.3.001" expanded="true" height="82" name="Naive Bayes" width="90" x="179" y="34">
        <parameter key="laplace_correction" value="true"/>
      </operator>
      <operator activated="true" class="apply_model" compatibility="9.3.001" expanded="true" height="82" name="Apply Model" width="90" x="313" y="34">
        <list key="application_parameters"/>
        <parameter key="create_view" value="false"/>
      </operator>
      <operator activated="true" class="performance" compatibility="9.3.001" expanded="true" height="82" name="Performance" width="90" x="447" y="34">
        <parameter key="use_example_weights" value="true"/>
      </operator>
      <connect from_op="Retrieve 1º DESAFIO" from_port="output" to_op="Naive Bayes" to_port="training set"/>
      <connect from_op="Naive Bayes" from_port="model" to_op="Apply Model" to_port="model"/>
      <connect from_op="Naive Bayes" from_port="exampleSet" to_op="Apply Model" to_port="unlabelled data"/>
      <connect from_op="Apply Model" from_port="labelled data" to_op="Performance" to_port="labelled data"/>
      <connect from_op="Performance" from_port="performance" to_port="result 2"/>
      <connect from_op="Performance" from_port="example set" to_port="result 1"/>
      <portSpacing port="source_input 1" spacing="0"/>
      <portSpacing port="sink_result 1" spacing="0"/>
      <portSpacing port="sink_result 2" spacing="0"/>
      <portSpacing port="sink_result 3" spacing="0"/>
    </process>
  </operator>
</process>



Answers

  • Xandesaraiva_Xandesaraiva_ Member Posts: 2 Contributor I
    the goal is to find out (starting by the number 892)  if the person survived or not. databases (train)
  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager
    hi @Xandesaraiva_ great to have you here! Maybe can you please give us a bit more info (Which Kaggle challenge? There are thousands). And the XML you posted is cut off. Could you please try editing your message or posting a new reply with a clean copy?

    Scott
  • varunm1varunm1 Moderator, Member Posts: 1,207 Unicorn
    Hello @Xandesaraiva_

    From your .rmp file, I can say that you are training and testing on the same dataset. If you are interested in learning good validation methods or how the Rapidminer process work, you can go to the rapidminer academy that consists of tutorial videos (easy to understand). 

    https://academy.rapidminer.com/

    If you need more information or any help with the processes we are happy to help as well.

    Thanks
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

Sign In or Register to comment.