Options

[SOLVED] Weights in a weka random forest model

misanthropic789misanthropic789 Member Posts: 8 Contributor II
edited November 2018 in Help
First off, apologies if any of this sounds ignorant.  I'd appreciate any insight, as I am making the transition from working with SPSS doing logistic regression prediction models to exploring other types of models in rapidminer.

In a logistic regression model you are given the coefficients, which can be used to approximate the importance of each element in the model.  You can use them to tell whether, for example, predictor A is 3x as important as predictor B.  My management has gotten used to seeing that type of information.

I am now working with rapidminer and have hit upon a weka random forest model that is substantially better at predicting (per 10-fold  x-validation testing) than my original logistic regression model.  However my management is asking for some comparable indicator of comparative importance.  Can anyone suggest an appropriate way to get that type of information?  What objects?  What should I be looking for?

Rebecca

Answers

  • Options
    MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869 Unicorn
    Hi Rebecca,

    a big disantvantage of Random Forests is infact that it does not create an easily interpretable model (with respect to information about the data). It basically creates a number of decision trees on different random subsets of the input data. On application it applies all trees and returns the result of a majority vote of all trees. Thus is it de-facto impossible to learn anything from the model, even if it often produces good prediction results.

    Best, Marius
  • Options
    misanthropic789misanthropic789 Member Posts: 8 Contributor II
    Thanks for getting back to me Marius.

    My current plan is to include a Weight by Information Gain step in the run to get some comparative information for that purpose, even if it isn't quite the same as the final model.  That should satisfy management while still giving me a chance to use the best model for my project.
Sign In or Register to comment.