how to select important variables for my model

ed1305ed1305 Member Posts: 8 Contributor I
I have a large data set .I want to reduce the attribute count and keep only the important ones.So how do i do that in rapidminer using the whole data set as input

Best Answer


  • Options
    varunm1varunm1 Moderator, Member Posts: 1,207 Unicorn
    edited February 2019
    Hi @ed1305

    You can use feature selection techniques like "Optimize selection" operator or any other relevant operator provided by RapidMiner.  You can find how to use them in a tutorial of that operator.

    Sometimes, I just run automodel to check which attributes are in red, yellow and green and select only attributes that are in green and yellow for model training.

    Be Safe. Follow precautions and Maintain Social Distancing

  • Options
    Brian_WellsBrian_Wells Member Posts: 6 Contributor II
    If you haven't already seen Ingo's feature engineering webinar, I would highly recommend taking a few minutes to watch it (link below).  His explanation is extremely intuitive and easy to connect with and will leave you with a gut level (verses academic) understanding of how it works and its true power to transform a dataset so that it can be represented by a wider array of models.  This could drastically reduce training time, make implementation easier, and allow the model to be more easily explained to the traditional skeptics of more advanced "machine learning" techniques.

Sign In or Register to comment.