The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options

"[SOLVED] Attribute weighting for unbalanced data"

makakmakak Member Posts: 13 Contributor II
edited June 2019 in Help
Hi,

I am trying to experimet with various techniques for attribute weighting for my dataset which is quite unbalanced. I am subsampling the majority class when I am training classifier. My question is, when I am applying "Weight by ..." operators, should I apply them for original (unbalanced) dataset, or for balanced dataset? Intuitively for balanced I'm just not sure.

Thank you.
Tagged:

Answers

  • Options
    MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869 Unicorn
    Hi,

    yes, in general you should use the balanced data set for any kind of data mining and data analysis, at least as long as you are in the training process.

    Performance measurements can also be taken on the original class distributions, depending on the desired output and interpretation of the performance values.

    Best regards,
    Marius
  • Options
    makakmakak Member Posts: 13 Contributor II
    Thank you.
Sign In or Register to comment.