How can I compare statistically two datasets in RapidMiner
data:image/s3,"s3://crabby-images/47bc7/47bc714bae8a0bcaf51cae3ce4c40b4a7130e657" alt="hapaydin"
Hi,
I have two data sets. First measured values, second calculated using generate attributes. I would like to compare these data sets statistically. I prefer Leave-one-out Cross Validation [https://en.wikipedia.org/wiki/Cross-validation_(statistics)], but usage of Cross Validation is different in RapidMiner (Divide datasets as training and validation).
Any suggestion?
Data:
3 independent variable( > 200 examples)
1 dependent variable
1 predicted variable
Answers
-
0
-
-
-
Hi @hapaydin,
You have directly in RapidMiner access to the staistics of Runoff(M) and Runoff(Pre) in the Statistics
panel of the Results and then compare their statistics :
or you can use the Charts panels to represent your two datasets (Here an example using histograms) :
If it's not enough, can you precise what you want .
Regards,
Lionel
1 -
Hi again @hapaydin,
To go further et to complete my last reply, you can explore the different operators
of the Statistics Extension (to download and install from the Marketplace).
I hope it will be helpful.
Regards,
Lionel
1 -
Thank you for your kind interst. I will examine Statistics Extension
1