Hello i have a homework of rapidminer, can anybody help me?

deledele Member Posts: 4 Newbie

You will use the following process:

1. Based on the training dataset,create a training sample and a validation sample by splitting the data into 2 groups. Steps 2-5 below  will then be performed on the training and the validation data.

2. Set up the dependent variable

  1. Make a preliminary assessment of the relative importance of the explanatory variables using visualization tools and simple descriptive statistics.

  2. Estimate the classification model using the training data,and interpret the results.

  3. Assess the accuracy of classification with the validation sample, possibly repeating steps 2-5 a few times changing the classifier in different ways to increase performance.

  4. Finally, score each observation of the scoring dataset and determine the list of applicants with a good credit risk (probability equal of higher than to 0.80) that the marketing department will be able to contact. 


  • deledele Member Posts: 4 Newbie
    i have excel of training data and scoring data
  • NeuralMarketNeuralMarket Member Posts: 13 Contributor II
    I'm sure you can use one of the templates in Studio to build out your process.
Sign In or Register to comment.