Options

When AutoModel asks you to "Deselect" red or yellow attributes, is AutoModel doing feature selection

tonyboy9tonyboy9 Member Posts: 113 Contributor II

Best Answers

  • Options
    rjones13rjones13 Member Posts: 168 Unicorn
    Solution Accepted
    Hi @tonyboy9,

    The colouring of different attributes is based off their measured quality, the results of which you can see on the right hand side columns. For example, the top 2 columns appear to be marked yellow given their zero correlation with your target. When AutoModel is asking you to deselect, it's asking you if you'd like to remove lower quality features based off the measures. Of course, there will be instances where you want to keep certain features anyway, and other times when it would be inappropriate to keep features labelled green/good. 

    Hope this helps,

    Best,
    Roland
  • Options
    rjones13rjones13 Member Posts: 168 Unicorn
    Solution Accepted
    Hi @tonyboy9,

    It comes down to how in depth you want to improve the input data to your model to improve predictive power. The AutoModel stage which you showed is a set of statistical measures where we grade each variable and then select or deselect based on quality - this is a first step of feature selection. What you'll find is that later on when you use AutoModel, you get the opportunity to do advanced Feature Selection and Feature generation. This would be termed as a topic "Feature Engineering". I'd recommend this video as a nice starting point on why we do this: https://academy.rapidminer.com/learn/video/feature-engineering-intro

    Within RapidMiner Studio, there's a number of different ways we can approach it, in order to produce the best outcome. 

    Best,
    Roland

Answers

  • Options
    tonyboy9tonyboy9 Member Posts: 113 Contributor II
    Okay, Roland, thanks for that. Perhaps I can take this one step further. What good are all the feature selection operators available to build a process? I tried to follow a response explaining the roles of these operators, which just reads overly complicated. Why not just rely on AutoModel decisions?
Sign In or Register to comment.