Select Terms

gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor II
edited November 2018 in Help
Hi, Everybody

I don't if the topic is in correct place. Anyway...

Is it possible to make a filtering of the terms as follows in RapidMiner?

Supose:

NA - Number of occurrence of a word in Class A
NB - Number of occurrence of a word in Class B
NC - Number of occurrence of a word in Class C
Total = NA + NB + NC

They remain the terms that meet the following criteria:

(NA / Total) * 100%> X% or (NB / Total) * 100%> Y% or (CN / Total) * 100%> Z%

Is it possible?

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,503 RM Data Scientist
    Sure,

    tokenize, generalte attributes and filter example should do that.
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor II
    Classes are defined by the value of "label1" which are respectively -1, 0, 1. Suppose I want to remain the only terms which meet the following criteria:

    ([occurrence of words with label1] = -1) / total)> 0.7
    ([occurrence of words with label1] = -0) / total)> 0.7
    ([occurrence of words with label1] = 1) / total)> 0.7

    Total =[ Occurrence in label1= -1] + [ Occurrence in label1= 0] +[ Occurrence in label1= 1]


    What should I write on the fields inside "Generate Attribuite" and "Filter Examples"?



    I cannot sendo the image to show you better. But there are fields to be filled inside "Generate Attribuite" and "Filter Examples".
  • gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor II
    Do the people here abandoned me?
  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,503 RM Data Scientist
    no,

    but you posted at x differnt threads got different answers and i do not know what the status is.

    ~Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor II
    I did not know the correct place this is a reason I had done it.
  • gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor II
    What should I do now?
Sign In or Register to comment.