The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options

How can we implement dropna() in the rapidminer?

AnushaAnusha Member Posts: 19 Maven
Hi All!

I have a dataset that has NAs, N/A, null, NULL, and multiple spaces in different cells. I just want to remove those particular rows.
Can anyone guide me.

Source Data:

C1                  C2                      C3                C4

12                ADNF                   NCJK               NA
34                HDDW                  CNJ                        -(single space ) 
38               CNJKD                  JIC                  N/A
78                NJDS                    NCSW            NULL
90                 CJNEK                 C JDSK          12NJDNC
08             DNCJS                      CSKJ               null
13                           -(tab space)  bdjf                ndf097

Desired Data:

C1                  C2                    C3                 C4

90                 CJNEK               C JDSK          12NJDNC

Thanks in Advance!

Best Answer

  • Options
    ceaperezceaperez Member Posts: 541 Unicorn
    Solution Accepted
    Hi @Anusha

    Into the Select Attributes operator you have many alternatives to carry out the filtration of your dataset, for example remove the missing values, or work with regular expressions. 

    Best

Answers

  • Options
    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,525 RM Data Scientist
    Hi,
    First you use declare missing values to make it a missing, then you can use filter examples with 'is not missing' to remove it.

    Best,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
Sign In or Register to comment.