Options

# Predicting Numbers

Member Posts: 2 Contributor I
edited November 2018 in Help
Hi All,

I am looking for any operator which can predict numbers. Specialy real numbers. Input is like

Name | Factor1 | Factor2 |  Factor3  |  Factor4   |  Rate
===========================================
A        |   1         |   7         |        6      |     3           | 100
B        |   5         |   3         |        7      |     9           | 150
C        |   3         |   9         |       11     |     2           |  75

I want an operator which could predict Rate if we have values of factors.

• Options
RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
Hi,
we call these type of learning algorithms "regression" and there is a really great number of learners able to do regression. Take a look on the operator info on the learner capabilities. Numerical_label indicates that the learner can predict numbers.
One example for a regression learner is the linear regression.

Greetings,
Sebastian
• Options
Member Posts: 2 Contributor I
Thanks Sebastian,

I tried using linear regression but I am not getting expected result. What I am getting is some sort of mean/ average

Name | Factor1 | Factor2 |  Factor3  |  Factor4  |  Rate  | Predicted*
======================================================
A        |  1        |  7        |        6      |    3          | 100    |  129.23
B        |  5        |  3        |        7      |    9          | 150    |  129.23
C        |  3        |  9        |      11    |    2          |  75    |  129.23

In the output I am getting only one value. I am exceting 100/150/75. Am I doing something wrong.
• Options
Member Posts: 849 Maven
Hi,

In general when people do not spell out what they are doing it is difficult to tell if they are making a mistake, but not in this case...

If I make this into a csv....
ID,          F1,         F2,              F3,          F4,           LA        ,   1         ,   7         ,        6      ,     3           , 100   B        ,   5         ,   3         ,        7      ,     9           , 150   C        ,   3         ,   9         ,       11     ,     2           ,  75
and set things up like this....
<operator name="Root" class="Process" expanded="yes">    <operator name="CSVExampleSource" class="CSVExampleSource">        <parameter key="filename"	value="C:\Users\CJFP\Documents\rm_workspace\joke.csv"/>        <parameter key="label_name"	value="L"/>        <parameter key="id_name"	value="ID"/>    </operator>    <operator name="LinearRegression" class="LinearRegression">        <parameter key="keep_example_set"	value="true"/>    </operator>    <operator name="ModelApplier" class="ModelApplier">        <list key="application_parameters">        </list>    </operator></operator>
Then I get the answers you are expecting