"Filter Top K samples"
Hello everybody,
I have a question regarding the filtering of samples. I would like to filter my samples like the Top 10% of attribute X. I know it is possible to use the "Filter Examples" operator but as far as I know it can only use a static value as filter like X>=1.
Does anybody know a way to tackle my problem?
Thanks in advance
I have a question regarding the filtering of samples. I would like to filter my samples like the Top 10% of attribute X. I know it is possible to use the "Filter Examples" operator but as far as I know it can only use a static value as filter like X>=1.
Does anybody know a way to tackle my problem?
Thanks in advance
0
Answers
you can use a combination of sort, generate ID and a Filter examples to extract the top k in attribute X. If you want to have the top k % you simply need to provide the sample Size or extract it using aggregate and extract macro
Attached is a example process to select the top 3 values of att1 in the iris dataset
Dortmund, Germany