Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

First steps of cleansing

WenkyWongWenkyWong Member Posts: 1 Learner I
edited November 2019 in Help
Hi,

For a schoolproject I have to build a model that achieves the highest accuracy as possible. I have a retrieved a train set and test set from the professor, but I am wondering what would be the first steps of the data cleansing. 

I've attached the train_set and test_set as well. The label variable will be the price_category.

Anyone?

Comments

  • lionelderkrikorlionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn
    Hi @WenkyWong,

    As a starting point, I would use Turbo Prep

    Click on CLEANSE --> AUTO CLEANSING

    Turbo prep will clean automatically your dataset for you.
    Then you can open the process RapidMiner has builded for you and see the different operations RapidMiner has performed on your initial dataset.

    Then, I would use Auto-Model to train several models with your cleaned training dataset.

    Hope this helps,

    Regards,

    Lionel 
Sign In or Register to comment.