Question regarding linear regression model output
Hi RapidMiner Community
I tried to make a linear regression model and tried testing the performance of the model through cross validation. The output is a linear function:
If the first values are inserted into the output function in Row No. 12, with a distance of 48 and WTG quantity of 1, the output is 48,381.73. However, the model predicts 60,651.
Does anyone know how the 'predict' column in crossvalidation works when it predicts based on the variables that are set up. and why it is different from the result of the linear regression model?
Thanks in advance for taking your time to read my question.
Kind regards
Aksel
Hi @akselerator,
It is because during the 10 fold cross validation, RapidMiner produce 10 different models with each fold of data.
However, the model delivered at the output is built with the entire dataset.
Thus the models of each cross validation fold are different from the "production" model (the equation you showed).
That's why you can not retrieve the prediction of one or several models of the cross validation with the equation of the "production model".
I hope it is clear
Regards,
Lionel
Thank you so much. It makes much sense.
Kind regards,
Aksel
Regards,
Lionel