How can we implement dropna() in the rapidminer?

AnushaAnusha Member Posts: 19 Maven
Hi All!

I have a dataset that has NAs, N/A, null, NULL, and multiple spaces in different cells. I just want to remove those particular rows.
Can anyone guide me.

Source Data:

C1                  C2                      C3                C4

12                ADNF                   NCJK               NA
34                HDDW                  CNJ                        -(single space ) 
38               CNJKD                  JIC                  N/A
78                NJDS                    NCSW            NULL
90                 CJNEK                 C JDSK          12NJDNC
08             DNCJS                      CSKJ               null
13                           -(tab space)  bdjf                ndf097

Desired Data:

C1                  C2                    C3                 C4

90                 CJNEK               C JDSK          12NJDNC

Thanks in Advance!

Best Answer

  • Options
    ceaperezceaperez Member Posts: 522 Unicorn
    Solution Accepted
    Hi @Anusha

    Into the Select Attributes operator you have many alternatives to carry out the filtration of your dataset, for example remove the missing values, or work with regular expressions. 



  • Options
    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,507 RM Data Scientist
    First you use declare missing values to make it a missing, then you can use filter examples with 'is not missing' to remove it.

    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
Sign In or Register to comment.