[SOLVED] Standard Deviation
I'm aware of operators for average, min, max.. etc.. But I see none for standard deviation. I'm trying to filter data to include only values that are within 3 standard deviations from the mean for a given attribute. How is the best way to go about this in Rapidminer?
Thanks,
Josh
Thanks,
Josh
0
Best Answer
-
dan_agape Member Posts: 106
Maven
Hi Josh,
Use Generate Attribute to make a copy of the given attribute (assume C is the new attribute), then use Normalize to modify the values of C using the Z-transformation method, and then use Filter Examples to keep only the rows for which the values of C are between -3 and 3 in the dataset. Finally you can discard the attribute C.
Dan1
Answers
u can use generate attribute operator to implement the STD formula and then use filter examples operator.
Or try the operator Weight by deviation.
Thanks,
Josh