Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Select Terms

gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor II
edited November 2018 in Help
Hi, Everybody

I don't if the topic is in correct place. Anyway...

Is it possible to make a filtering of the terms as follows in RapidMiner?

Supose:

NA - Number of occurrence of a word in Class A
NB - Number of occurrence of a word in Class B
NC - Number of occurrence of a word in Class C
Total = NA + NB + NC

They remain the terms that meet the following criteria:

(NA / Total) * 100%> X% or (NB / Total) * 100%> Y% or (CN / Total) * 100%> Z%

Is it possible?

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,529 RM Data Scientist
    Sure,

    tokenize, generalte attributes and filter example should do that.
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor II
    Classes are defined by the value of "label1" which are respectively -1, 0, 1. Suppose I want to remain the only terms which meet the following criteria:

    ([occurrence of words with label1] = -1) / total)> 0.7
    ([occurrence of words with label1] = -0) / total)> 0.7
    ([occurrence of words with label1] = 1) / total)> 0.7

    Total =[ Occurrence in label1= -1] + [ Occurrence in label1= 0] +[ Occurrence in label1= 1]


    What should I write on the fields inside "Generate Attribuite" and "Filter Examples"?



    I cannot sendo the image to show you better. But there are fields to be filled inside "Generate Attribuite" and "Filter Examples".
  • gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor II
    Do the people here abandoned me?
  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,529 RM Data Scientist
    no,

    but you posted at x differnt threads got different answers and i do not know what the status is.

    ~Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor II
    I did not know the correct place this is a reason I had done it.
  • gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor II
    What should I do now?
Sign In or Register to comment.