Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
First steps of cleansing
Hi,
For a schoolproject I have to build a model that achieves the highest accuracy as possible. I have a retrieved a train set and test set from the professor, but I am wondering what would be the first steps of the data cleansing.
I've attached the train_set and test_set as well. The label variable will be the price_category.
Anyone?
For a schoolproject I have to build a model that achieves the highest accuracy as possible. I have a retrieved a train set and test set from the professor, but I am wondering what would be the first steps of the data cleansing.
I've attached the train_set and test_set as well. The label variable will be the price_category.
Anyone?
0
Comments
As a starting point, I would use Turbo Prep :
Click on CLEANSE --> AUTO CLEANSING
Turbo prep will clean automatically your dataset for you.
Then you can open the process RapidMiner has builded for you and see the different operations RapidMiner has performed on your initial dataset.
Then, I would use Auto-Model to train several models with your cleaned training dataset.
Hope this helps,
Regards,
Lionel