The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.

Options

# Linear Regression Prediction - wrong calculation?

Hi - I've used Linear regression to analyze the below data

Row, interest_rate ('label'), credit_score

1 7.31 500.0

2 6.7 600.0

3 5.95 700.0

4 6.4 700.0

5 5.4 800.0

6 5.7 800.0

7 5.9 750.0

8 7.0 550.0

9 6.5 650.0

10 5.7 825.0

The output Linear Regression Model in "Description" says:

- 0.005 * credit_score

+ 10.000

When I applied the above model to the same data set, I've got the below output:

Row No, interest_rate, prediction(interest_rate), credit_score

1 7.31 7.277008403361254 500.0

2 6.7 6.732470588235252 600.0

3 5.95 6.18793277310925 700.0

4 6.4 6.18793277310925 700.0

5 5.4 5.643394957983249 800.0

6 5.7 5.643394957983249 800.0

7 5.9 5.915663865546249 750.0

8 7.0 7.004739495798253 550.0

9 6.5 6.460201680672251 650.0

10 5.7 5.507260504201748 825.0

Using the Linear Model, the predicted interest_rate for credit_score 500 (row 1 data) should be calculated as:

-0.005 * 500 + 10 = 7.5

Please share any thought about the discrepancy in prediction (7.5 vs. 7.2770 as in row 1)

Thank you

Row, interest_rate ('label'), credit_score

1 7.31 500.0

2 6.7 600.0

3 5.95 700.0

4 6.4 700.0

5 5.4 800.0

6 5.7 800.0

7 5.9 750.0

8 7.0 550.0

9 6.5 650.0

10 5.7 825.0

The output Linear Regression Model in "Description" says:

- 0.005 * credit_score

+ 10.000

When I applied the above model to the same data set, I've got the below output:

Row No, interest_rate, prediction(interest_rate), credit_score

1 7.31 7.277008403361254 500.0

2 6.7 6.732470588235252 600.0

3 5.95 6.18793277310925 700.0

4 6.4 6.18793277310925 700.0

5 5.4 5.643394957983249 800.0

6 5.7 5.643394957983249 800.0

7 5.9 5.915663865546249 750.0

8 7.0 7.004739495798253 550.0

9 6.5 6.460201680672251 650.0

10 5.7 5.507260504201748 825.0

Using the Linear Model, the predicted interest_rate for credit_score 500 (row 1 data) should be calculated as:

-0.005 * 500 + 10 = 7.5

Please share any thought about the discrepancy in prediction (7.5 vs. 7.2770 as in row 1)

Thank you

Tagged:

0

## Answers

297RM Researchthe results are correct.

The apparently discrepancy you see results in how RapidMiner shows you the results.

In the result view all numbers are rounded to four decimal places. If you copy and paste the coefficients in another editor you will see, that the actual results are:

-0.0054453781512600165 and 9.999697478991262. And with these numbers you get:

-0.0054453781512600165* 500 + 9.999697478991262 = 7.277008.

You can set the number of digits displayed under Settings -> Preferences -> General.

Best,

David

849MavenH

5Contributor II