Correlation - need help getting started
Folks- I've just started using Rapid Miner and am trying to calculate a correlation coefficient as a first test.
My data set includes three columns, the first is the key, so the role is set to "id". The second column would be my dependent variable, so I set role to "label". Finally, the third column I set to "regular".
In the Designer, I piped a "retrieve" of the data to a "Correlation Matrix" operator.
When I run the process, the results perspective does not show correlation in the Meta Data View. Under the Statistics column, it only shows avg = 1235 +/- 123, as well as a Range column.
Can anyone tell me how to get it to calculate and display a Pearson correlation coefficient?
Thanks,
RCL1
My data set includes three columns, the first is the key, so the role is set to "id". The second column would be my dependent variable, so I set role to "label". Finally, the third column I set to "regular".
In the Designer, I piped a "retrieve" of the data to a "Correlation Matrix" operator.
When I run the process, the results perspective does not show correlation in the Meta Data View. Under the Statistics column, it only shows avg = 1235 +/- 123, as well as a Range column.
Can anyone tell me how to get it to calculate and display a Pearson correlation coefficient?
Thanks,
RCL1
Tagged:
0
Answers
Four columns: IP ID (Integer, ID); Income (integer, weight); Expenditure (integer, attribute); WC QT (integer, attribute)
Now when I pipe this data retrieve to the correlation matrix, I only get average and range for statistics under Meta Data View.
Where do I calculate a correlation coefficient?
Thanks,
RL
Best,
Marius
This is my process: