How to filter out one class of the label?

Miner_MarkusMiner_Markus Member Posts: 3 Newbie
edited July 2019 in Help

Hi together,

I want to analyze some data in regard to a label that has 3 classes. Due to bad data quality in one of the classes and because it is also not of interest, I want to focus only on 2 classes of the label. I used "Filter Example" to delete all examples, that are labeled with the label, that is not of interest. But in my model (decision tree) it is still displayed when I click on the leafs, for examples as (25/8/0) or (6/32/0) etc. I would like to eliminate the 0 in those brackets.

Somebody has an idea? :)


Best Answer

  • varunm1varunm1 Moderator, Member Posts: 1,207 Unicorn
    edited July 2019 Solution Accepted
    Hello @Miner_Markus

    Once you filter the examples, then connect the "exa" port of filter examples operator to another operator called "Guess Types". In the parameters of guess types operator select attribute filter type "Single" and in attribute select your desired label attribute. Also, you should tick "include special attributes". This will remove that class from your tree.

    This trick from @IngoRM works well. I scratched my head so hard to remember this operator name.

    Be Safe. Follow precautions and Maintain Social Distancing


  • Miner_MarkusMiner_Markus Member Posts: 3 Newbie
    Perfect, thank you!
  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,485 RM Data Scientist

    while this works, the usual way to do this is the Remove Unused Values operator. I think guess types calls this in the backend, but also does a few more things.

    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
Sign In or Register to comment.