Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

how to select important variables for my model

ed1305ed1305 Member Posts: 8 Learner III
I have a large data set .I want to reduce the attribute count and keep only the important ones.So how do i do that in rapidminer using the whole data set as input

Best Answer

Answers

  • varunm1varunm1 Member Posts: 1,207 Unicorn
    edited February 2019
    Hi @ed1305

    You can use feature selection techniques like "Optimize selection" operator or any other relevant operator provided by RapidMiner.  You can find how to use them in a tutorial of that operator.

    Sometimes, I just run automodel to check which attributes are in red, yellow and green and select only attributes that are in green and yellow for model training.
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • Brian_WellsBrian_Wells Member Posts: 6 Contributor II
    If you haven't already seen Ingo's feature engineering webinar, I would highly recommend taking a few minutes to watch it (link below).  His explanation is extremely intuitive and easy to connect with and will leave you with a gut level (verses academic) understanding of how it works and its true power to transform a dataset so that it can be represented by a wider array of models.  This could drastically reduce training time, make implementation easier, and allow the model to be more easily explained to the traditional skeptics of more advanced "machine learning" techniques.




Sign In or Register to comment.