Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

"Sampling to an known distribution of labes"

pickmaypickmay Member Posts: 2 Contributor I
edited May 2019 in Help
Hi all.

I have a large data set, and I want to to produce a small sample in which there will be the same amount of examples from each label. is there any Rapid function that does that?

thanks
Yishai
Tagged:

Answers

  • TobiasMalbrechtTobiasMalbrecht Moderator, Employee, Member Posts: 295 RM Product Management
    Hi,

    that is already on our todo list, but unfortunately we have not managed to implement such a sample operator yet, since there is plenty of other things to do at the moment. One thing RM does already have is an operator to distribute weights among the examples giving every class the same sum of weights. The operator is called [tt]EqualLabelWeighting[/tt].

    Regards,
    Tobias
Sign In or Register to comment.