Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
How to scale up underrepresented classes from Dataset?
hi,
I have 5000 data exemplars with 3 classes... but distribution of the classes (1 , 2, 3) are about 50/30/20 ... classification of 2 and 3 is rather poor compared to 1.. to achieve better classification results for them I used MetaCost for weighting class 2 and 3 in the testing / prediction process....
but I would also like to do weighting of the classes 2 or 3 in the learning process of the model... before testing, how can I do that? is there a way to scale up my underrepresented classes with an operator? or what are good weighting functions for learning models?
0
Answers
There a few ways to do that. One is the use the Sample operator and toggle on "Balance Data." From there you can set the class size you want. Of course, you can use the Sample by Bootstrapping operator as well.
If you weight your classes, you can use the Select By Weights operator as well. Many different ways to do this in Studio. Good luck.