RapidMiner 9.7 is Now Available
Lots of amazing new improvements including true version control! Learn more about what's new here.
Setting Ranges in a Declare Missing Value Operator
I have an attribute in my data set containing millions of examples (observations). This attribute, Crime_Instances, has no missing values, but has about 25 distinct outlier values comprising less than 1% of the total examples. I would like to use a Declare Missing Values operator to submit an expression such as Crime_Instances > 8, so that all examples that have a value of more than 8 in the Crime_Instances attribute would be set to missing. It wouldn't matter if one example had 15 in this attribute, while another had 25 and another had 90, they would all be set to missing using a single expression in a single operator. This feature would be very useful to me.
As a side note, Sebastian suggested a method for accomplishing this and it did work, though it required six extra operators to achieve my desired result, so it's not as if I can't do it, I just can't do it very easily right now.