🦉 🎤   RapidMiner Wisdom 2020 - CALL FOR SPEAKERS DEADLINE IS NOVEMBER 15   🦉 🎤

CLICK HERE TO GO TO ENTRY FORM

Feature Selection in Multiple Linear Regression

AD2019AD2019 Member Posts: 7 Newbie
edited October 29 in Help
I selected "t-test" as the feature selection method in a multiple regression model, and set alpha to 0.01.  Why is the regression output including an independent variable with a p-value of 0.05? Only when I reduce the alpha to 0.001 does this variable go away.  If I set alpha = 0.01, shouldn't the selected model only show me independent variables whose p-value is less than 0.01?
thank you in advance for answers.
AD

Answers

  • sgenzersgenzer 12Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,581  Community Manager
    hmm @AD2019 let's see if someone like @mschmitz @David_A @IngoRM can help here.

    Scott
    ----------------------
    Don't forget to submit your great ideas for Wisdom 2020! Deadline is November 15.

    Wisdom 2020 – Call for Speakers Form 
  • Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,267   Unicorn
    Can you be more specific about what you are experiencing or provide sample xml or output?
    Do you have any attributes that are generated from recoding numerical to nominal?  I think RapidMiner might keep those as a group if so which means some of the individual values have p values greater than the threshold but the overall attribute does not, although it would be good for one of the RM staffers to confirm that.
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.