Options

How can we implement dropna() in the rapidminer?

AnushaAnusha Member Posts: 19 Maven
Hi All!

I have a dataset that has NAs, N/A, null, NULL, and multiple spaces in different cells. I just want to remove those particular rows.
Can anyone guide me.

Source Data:

C1                  C2                      C3                C4

12                ADNF                   NCJK               NA
34                HDDW                  CNJ                        -(single space ) 
38               CNJKD                  JIC                  N/A
78                NJDS                    NCSW            NULL
90                 CJNEK                 C JDSK          12NJDNC
08             DNCJS                      CSKJ               null
13                           -(tab space)  bdjf                ndf097

Desired Data:

C1                  C2                    C3                 C4

90                 CJNEK               C JDSK          12NJDNC

Thanks in Advance!

Best Answer

  • Options
    ceaperezceaperez Member Posts: 522 Unicorn
    Solution Accepted
    Hi @Anusha

    Into the Select Attributes operator you have many alternatives to carry out the filtration of your dataset, for example remove the missing values, or work with regular expressions. 

    Best

Answers

  • Options
    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,510 RM Data Scientist
    Hi,
    First you use declare missing values to make it a missing, then you can use filter examples with 'is not missing' to remove it.

    Best,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
Sign In or Register to comment.