The RapidMiner community is on read-only mode until further notice. Technical support via cases will continue to work as is. For any urgent licensing related requests from Students/Faculty members, please use the Altair academic forum here.
Need a working example of Find Threshold (Meta) operator in RapidMiner
I've been working with text classification processes in RapidMiner and I can't figure out the proper way to use Find Threshold (Meta) operator for multiclass classification which seems to be the closest one to implement Threshold family operators used for binary classification.
I am using k-NN models and have 11 different classes and a corpus of about 300-500 text documents as test dataset.
Specifically, I don't see any impact of putting a learner inside the operator since performance values are always the same, whether I do assign any weights to the classes or not. Moreover, there's no explanation what are the weights of classes are. And moreover, I don't see any way to extract (possibly) generated thresholds as the output of this operator in order to apply them to the model. And there's no RapidMiner documentation entry for this operator at all.
Does anyone have a working example of Find Threshold (Meta) operator so far?
I am using k-NN models and have 11 different classes and a corpus of about 300-500 text documents as test dataset.
Specifically, I don't see any impact of putting a learner inside the operator since performance values are always the same, whether I do assign any weights to the classes or not. Moreover, there's no explanation what are the weights of classes are. And moreover, I don't see any way to extract (possibly) generated thresholds as the output of this operator in order to apply them to the model. And there's no RapidMiner documentation entry for this operator at all.
Does anyone have a working example of Find Threshold (Meta) operator so far?
0
Answers
i never used the meta one. What is the reason not to use the standard one?
Best,
Martin
Dortmund, Germany
Instead I use Optimise Parameters combined with Create Threshold to test various options for the threshold and select the one that delivers the best performance.
Here is a short version of what I use:
i think your process is dangouerous, because you do not use a x-validation to ensure quality. This will tend to overestimate your performances.
~Martin
Dortmund, Germany
I removed all the X-Validations + number formatting and some other stuff. It's just as a demo of the use of Create Threshold.