Filtering collection with criteria

pblack476pblack476 Member Posts: 83 Maven
edited February 2020 in Help
I am on a quest to retrieve useful data from PDF.

I already conquered the first battle with the Table Extraction extension. I am now faced with another challenge:

How do I filter out a collection? Let's say I want to ignore examplesets with less than 10 examples in a collection and output a collection of al the other examplesets. How can I go about going that?

Best Answer


  • Options
    BalazsBaranyBalazsBarany Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert Posts: 955 Unicorn
    Hi @pblack476,

    try the Loop Collection operator. 
    In each loop execution you'll get one example set. You can then use for example Extract Macro to determine the number of examples, and conditionally return the example set or not.


  • Options
    pblack476pblack476 Member Posts: 83 Maven
    Oh wow. I did not know the branch operator could have empty connections inside. That solves it. I was trying to do just that but it was getting really complex to filter out the "else" examples out.

    Thanks to both of you!
Sign In or Register to comment.