Inspecting the examples indicated by confusion matrix--possible?

tmurray · September 2015

Hi everyone,

Loving getting to know RapidMiner and I've gotten myself deep into it. I am doing multi-class classification using several methods (random forest at the moment) and looking at the confusion matrix in the Performance output. The cells of the confusion matrix tell me where the model is in error and give hints about new attributes I might try. My question is:

Is there an easy way to click on or indicate a cell in the matrix and see a list of the examples that it refers to? (e.g. to answer a question like "what is it about all these examples in category-8 that get misclassified into category-7?).

[As a side question, I have 8 categories, 6000 examples, and about 50 attributes. Initial messing around shows random forest seems best (with certain parameters), gives about 50% accuracy, which is a pretty good information gain over random guess. Does anyone have sage advice about what types of models work well with so many attributes and categories?]

--many thanks, Tom

MartinLiebig · September 2015

Hi tom,

if you do a X-Prediction instead of a X-Validation, you get the scored example set. Afterwards you can use a Filter examples or the filter in the example set result view to get only the misclassified.
Of course you might be careful not to overtrain by hand doing that.

Cheers,
Martin

tmurray · September 2015

thanks Martin, will try!

Howdy, Stranger!

Quick Links

Categories

Altair RapidMiner Community

GET HELP. LEARN BEST PRACTICES. NETWORK WITH YOUR PEERS.

Inspecting the examples indicated by confusion matrix--possible?

Answers