Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
How can I count the number of the missing values in a row?
I'm a beginner, and I would like to know that the data lack in a row is repairable or not. It depends on the number o the missing values in a row therefore I would like to know how can I calculate it.
Tagged:
0
Answers
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
[edit: I just realized that this is totally wrong ;-). The operator does not calculate this attribute, it is just created in the tutorial process of the operator. The tutorial process also uses loop attributes as @Telcontar120 is using. So I don't know of a better method. You may still want to have a look into the operator if you want to filter Examples depending on the number of missings per Example.]
The "Filter Examples with Missing Values" operator from the Operator Toolbox extension does the trick. The actual goal of the operator is to remove Examples (=rows) with more than a specific number of missing attributes (this can be configured). But it also adds a new attribute with the number of missing attributes for each row. If you set the filter method to "keep all" (and maybe include special attributes, if you want to count them too), it is exactly what you are looking for.
Best regards,
Fabian