Options

Inspecting the examples indicated by confusion matrix--possible?

tmurraytmurray Member Posts: 2 Contributor I
edited November 2018 in Help
Hi everyone,

Loving getting to know RapidMiner and I've gotten myself deep into it.  I am doing multi-class classification using several methods (random forest at the moment) and looking at the confusion matrix in the Performance output.  The cells of the confusion matrix tell me where the model is in error and give hints about new attributes I might try.  My question is:

Is there an easy way to click on or indicate a cell in the matrix and see a list of the examples that it refers to?  (e.g. to answer a question like "what is it about all these examples in category-8 that get misclassified into category-7?).

[As a side question, I have 8 categories, 6000 examples, and about 50 attributes.  Initial messing around shows random forest seems best (with certain parameters), gives about 50% accuracy, which is a pretty good information gain over random guess.  Does anyone have sage advice about what types of models work well with so many attributes and categories?]

--many thanks, Tom

Answers

  • Options
    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,510 RM Data Scientist
    Hi tom,

    if you do a X-Prediction instead of a X-Validation, you get the scored example set. Afterwards you can use a Filter examples or the filter in the example set result view to get only the misclassified.
    Of course you might be careful not to overtrain by hand doing that.

    Cheers,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • Options
    tmurraytmurray Member Posts: 2 Contributor I
    thanks Martin, will try!
Sign In or Register to comment.