The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options

"Sampling to an known distribution of labes"

pickmaypickmay Member Posts: 2 Contributor I
edited May 2019 in Help
Hi all.

I have a large data set, and I want to to produce a small sample in which there will be the same amount of examples from each label. is there any Rapid function that does that?

thanks
Yishai
Tagged:

Answers

  • Options
    TobiasMalbrechtTobiasMalbrecht Moderator, Employee, Member Posts: 295 RM Product Management
    Hi,

    that is already on our todo list, but unfortunately we have not managed to implement such a sample operator yet, since there is plenty of other things to do at the moment. One thing RM does already have is an operator to distribute weights among the examples giving every class the same sum of weights. The operator is called [tt]EqualLabelWeighting[/tt].

    Regards,
    Tobias
Sign In or Register to comment.