Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
"Filter Top K samples"
Hello everybody,
I have a question regarding the filtering of samples. I would like to filter my samples like the Top 10% of attribute X. I know it is possible to use the "Filter Examples" operator but as far as I know it can only use a static value as filter like X>=1.
Does anybody know a way to tackle my problem?
Thanks in advance
I have a question regarding the filtering of samples. I would like to filter my samples like the Top 10% of attribute X. I know it is possible to use the "Filter Examples" operator but as far as I know it can only use a static value as filter like X>=1.
Does anybody know a way to tackle my problem?
Thanks in advance
0
Answers
you can use a combination of sort, generate ID and a Filter examples to extract the top k in attribute X. If you want to have the top k % you simply need to provide the sample Size or extract it using aggregate and extract macro
Attached is a example process to select the top 3 values of att1 in the iris dataset
Dortmund, Germany