How do I introduce random/new data into a Neural Net model?

FrancoisR · October 2023

I have built a (supervised training) Neural Net model to predict instances of Diabetes. How do I introduce new patient data to the model to get predictions on these patients?

Image: https://us.v-cdn.net/6030995/uploads/editor/tf/7032iqc4a35e.png

MartinLiebig · October 2023

Hi,

what you did was using a Feature Selection, whose task it is to give you the best possible columns to build a model on, by using a model. You would need to use another ANN on this and then the Apply Model operator to score unknown data.

Have you seen the academy? academy.rapidminer.com should help you a lot.

Best,

Martin

CKönig · October 2023

Hello @FrancoisR,

in this case, you would need to rebuild the model on the whole dataset (old and new combined) to create a better model. This is the usual way to handle new data. This also helps you to discard possibly outdated information from your training dataset, depending on your actual use case.

However, if the algorithm for creating a machine learning model supports it, we do have the option to update pre-trained models with new data. This capability can be checked in the Operator Information:

An example for an updatable model would be a Naive Bayes model. Updating an existing model trained somewhen earlier can be done by the Update Model (surprise!) operator:

Image: https://us.v-cdn.net/6030995/uploads/editor/rq/gp68dxnf1sqj.png

See this example process (copy the whole xml code, click in the background of an empty process and press "Ctrl+V" to paste it there):

<div><?xml version="1.0" encoding="UTF-8"?><process version="10.2.000"></div><div>&nbsp; <context></div><div>&nbsp; &nbsp; <input/></div><div>&nbsp; &nbsp; <output/></div><div>&nbsp; &nbsp; <macros/></div><div>&nbsp; </context></div><div>&nbsp; <operator activated="true" class="process" compatibility="10.2.000" expanded="true" name="Process"></div><div>&nbsp; &nbsp; <parameter key="logverbosity" value="init"/></div><div>&nbsp; &nbsp; <parameter key="random_seed" value="2001"/></div><div>&nbsp; &nbsp; <parameter key="send_mail" value="never"/></div><div>&nbsp; &nbsp; <parameter key="notification_email" value=""/></div><div>&nbsp; &nbsp; <parameter key="process_duration_for_mail" value="30"/></div><div>&nbsp; &nbsp; <parameter key="encoding" value="SYSTEM"/></div><div>&nbsp; &nbsp; <process expanded="true"></div><div>&nbsp; &nbsp; &nbsp; <operator activated="true" class="retrieve" compatibility="10.2.000" expanded="true" height="68" name="Retrieve Deals" width="90" x="45" y="391"></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="repository_entry" value="//Samples/data/Deals"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <description align="center" color="transparent" colored="false" width="126">Old data</description></div><div>&nbsp; &nbsp; &nbsp; </operator></div><div>&nbsp; &nbsp; &nbsp; <operator activated="true" class="naive_bayes" compatibility="10.2.000" expanded="true" height="82" name="Naive Bayes" width="90" x="179" y="391"></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="laplace_correction" value="true"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <description align="center" color="transparent" colored="false" width="126">create a model based on &amp;quot;old data&amp;quot;</description></div><div>&nbsp; &nbsp; &nbsp; </operator></div><div>&nbsp; &nbsp; &nbsp; <operator activated="true" class="retrieve" compatibility="10.2.000" expanded="true" height="68" name="Retrieve Deals-Testset" width="90" x="179" y="187"></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="repository_entry" value="//Samples/data/Deals-Testset"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <description align="center" color="transparent" colored="false" width="126">New data</description></div><div>&nbsp; &nbsp; &nbsp; </operator></div><div>&nbsp; &nbsp; &nbsp; <operator activated="true" class="multiply" compatibility="10.2.000" expanded="true" height="103" name="Multiply" width="90" x="313" y="187"/></div><div>&nbsp; &nbsp; &nbsp; <operator activated="true" class="apply_model" compatibility="10.2.000" expanded="true" height="82" name="Apply Old Model" width="90" x="447" y="391"></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <list key="application_parameters"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <description align="center" color="transparent" colored="false" width="126">apply old model on new, unseen data</description></div><div>&nbsp; &nbsp; &nbsp; </operator></div><div>&nbsp; &nbsp; &nbsp; <operator activated="true" class="performance_classification" compatibility="10.2.000" expanded="true" height="82" name="Performance Old" width="90" x="849" y="391"></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="main_criterion" value="first"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="accuracy" value="true"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="classification_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="kappa" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="weighted_mean_recall" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="weighted_mean_precision" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="spearman_rho" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="kendall_tau" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="absolute_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="relative_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="relative_error_lenient" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="relative_error_strict" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="normalized_absolute_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="root_mean_squared_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="root_relative_squared_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="squared_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="correlation" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="squared_correlation" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="cross-entropy" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="margin" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="soft_margin_loss" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="logistic_loss" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="skip_undefined_labels" value="true"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="use_example_weights" value="true"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <list key="class_weights"/></div><div>&nbsp; &nbsp; &nbsp; </operator></div><div>&nbsp; &nbsp; &nbsp; <operator activated="true" class="update_model" compatibility="10.2.000" expanded="true" height="82" name="Update Model" width="90" x="581" y="187"></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <description align="center" color="transparent" colored="false" width="126">update the model with new data</description></div><div>&nbsp; &nbsp; &nbsp; </operator></div><div>&nbsp; &nbsp; &nbsp; <operator activated="true" class="apply_model" compatibility="10.2.000" expanded="true" height="82" name="Apply New Model" width="90" x="715" y="187"></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <list key="application_parameters"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <description align="center" color="transparent" colored="false" width="126">apply the updated model on the new data</description></div><div>&nbsp; &nbsp; &nbsp; </operator></div><div>&nbsp; &nbsp; &nbsp; <operator activated="true" class="performance_classification" compatibility="10.2.000" expanded="true" height="82" name="Performance New" width="90" x="849" y="187"></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="main_criterion" value="first"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="accuracy" value="true"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="classification_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="kappa" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="weighted_mean_recall" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="weighted_mean_precision" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="spearman_rho" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="kendall_tau" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="absolute_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="relative_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="relative_error_lenient" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="relative_error_strict" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="normalized_absolute_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="root_mean_squared_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="root_relative_squared_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="squared_error" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="correlation" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="squared_correlation" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="cross-entropy" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="margin" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="soft_margin_loss" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="logistic_loss" value="false"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="skip_undefined_labels" value="true"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <parameter key="use_example_weights" value="true"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <list key="class_weights"/></div><div>&nbsp; &nbsp; &nbsp; &nbsp; <description align="center" color="transparent" colored="false" width="126">Increased performance on new dataset</description></div><div>&nbsp; &nbsp; &nbsp; </operator></div><div>&nbsp; &nbsp; &nbsp; <connect from_op="Retrieve Deals" from_port="output" to_op="Naive Bayes" to_port="training set"/></div><div>&nbsp; &nbsp; &nbsp; <connect from_op="Naive Bayes" from_port="model" to_op="Apply Old Model" to_port="model"/></div><div>&nbsp; &nbsp; &nbsp; <connect from_op="Retrieve Deals-Testset" from_port="output" to_op="Multiply" to_port="input"/></div><div>&nbsp; &nbsp; &nbsp; <connect from_op="Multiply" from_port="output 1" to_op="Update Model" to_port="example set"/></div><div>&nbsp; &nbsp; &nbsp; <connect from_op="Multiply" from_port="output 2" to_op="Apply Old Model" to_port="unlabelled data"/></div><div>&nbsp; &nbsp; &nbsp; <connect from_op="Apply Old Model" from_port="labelled data" to_op="Performance Old" to_port="labelled data"/></div><div>&nbsp; &nbsp; &nbsp; <connect from_op="Apply Old Model" from_port="model" to_op="Update Model" to_port="model"/></div><div>&nbsp; &nbsp; &nbsp; <connect from_op="Update Model" from_port="example set" to_op="Apply New Model" to_port="unlabelled data"/></div><div>&nbsp; &nbsp; &nbsp; <connect from_op="Update Model" from_port="model" to_op="Apply New Model" to_port="model"/></div><div>&nbsp; &nbsp; &nbsp; <connect from_op="Apply New Model" from_port="labelled data" to_op="Performance New" to_port="labelled data"/></div><div>&nbsp; &nbsp; &nbsp; <portSpacing port="source_input 1" spacing="0"/></div><div>&nbsp; &nbsp; &nbsp; <portSpacing port="sink_result 1" spacing="0"/></div><div>&nbsp; &nbsp; </process></div><div>&nbsp; </operator></div><div></process></div><div></div>

Kind regards,

Christian

FrancoisR · October 2023

Excellent, thank you very much

FrancoisR · October 2023

Hi Christian, this is excellent feedback. Thank you very much for assisting me! Kind regards

Howdy, Stranger!

Quick Links

Categories

Altair RapidMiner Community

GET HELP. LEARN BEST PRACTICES. NETWORK WITH YOUR PEERS.

How do I introduce random/new data into a Neural Net model?

Best Answers

Answers