[SOLVED] Information Gain Feature Selection with Integer Values

rvazrvaz Member Posts: 7 Contributor II
edited November 2018 in Help

      I'm using Information Gain as a method for feature selection.
      The features represent the number of occurrence of a term in a document tf(t,d).
      My doubt is:
                         In order to calculate the IG does the method implemented in RapidMiner
                         takes in consideration all the possible values of a feature
                         or only the presence or absence of a feature in a certain example?

Thanks in Advance,

Rui Vaz


  • Options
    michaelhechtmichaelhecht Member Posts: 89 Maven
    Can you give an example of the features?
  • Options
    MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869 Unicorn
    RapidMiner considers all values, since it does not have a concept of "absence" (left aside missing values, but I think that you don't have any missing values).
  • Options
    rvazrvaz Member Posts: 7 Contributor II
    Ok thanks Marius.
Sign In or Register to comment.