The RapidMiner community is on read-only mode until further notice. Technical support via cases will continue to work as is. For any urgent licensing related requests from Students/Faculty members, please use the Altair academic forum here.
[SOLVED] Balancing data - pull with undelete possible ?
Hi,
say we have data consisting 1000 times class 0 and 50 times class 1.
Using the Operator "Sample" I can resample class 0 to e.g. 800.
BUT I would like to resample class 1 to e.g. 100 so I have to blow them up some how
what is also called "pull with undelete".
Is this possible ?
Thx, Frank
say we have data consisting 1000 times class 0 and 50 times class 1.
Using the Operator "Sample" I can resample class 0 to e.g. 800.
BUT I would like to resample class 1 to e.g. 100 so I have to blow them up some how
what is also called "pull with undelete".
Is this possible ?
Thx, Frank
0
Answers
I'm using the dataset Golf that comes with Rapidminer. There are two classes: yes (9 obs) and no (5). I end up with a new dataset which has yes(8 obs) and no(8 obs). That's exactly what you want.