🎉 🎉 RAPIDMINER 9.10 IS OUT!!! 🎉🎉
Download the latest version helping analytics teams accelerate time-to-value for streaming and IIOT use cases.
Correlate data against criteria
Hi guys - I am a newby - I am looking for some advice pertaining to a specific analysis I would like to perform.
I have a pile of fluid sample data, these have been graded by a labratory as: 'Normal', 'Abnormal' or 'caution'. I would like to correlate the remaining data pertaining to those titles with a view to understanding the reason for the grading.
So, I have given up trying to use text analysis as an input to the correlation matrix - that would have been too good!
I have achieved some results by making three columns, zeros and ones. For example, a 'Caution' column where all caution rows are populated with a 'one', and all other rows a 'zero'. And similar columns for the 'abnormal' and 'normal'.
While the above has yeilded an interesting result, I am certain I could be doing this a better way?
Any assistance appreciated, thanks