image

🎉 🎉 RAPIDMINER 9.10 IS OUT!!! 🎉🎉

Download the latest version helping analytics teams accelerate time-to-value for streaming and IIOT use cases.

CLICK HERE TO DOWNLOAD

Select Terms

gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor I
edited November 2018 in Help
Hi, Everybody

I don't if the topic is in correct place. Anyway...

Is it possible to make a filtering of the terms as follows in RapidMiner?

Supose:

NA - Number of occurrence of a word in Class A
NB - Number of occurrence of a word in Class B
NC - Number of occurrence of a word in Class C
Total = NA + NB + NC

They remain the terms that meet the following criteria:

(NA / Total) * 100%> X% or (NB / Total) * 100%> Y% or (CN / Total) * 100%> Z%

Is it possible?

Answers

  • mschmitzmschmitz Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,053  RM Data Scientist
    Sure,

    tokenize, generalte attributes and filter example should do that.
    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
  • gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor I
    Classes are defined by the value of "label1" which are respectively -1, 0, 1. Suppose I want to remain the only terms which meet the following criteria:

    ([occurrence of words with label1] = -1) / total)> 0.7
    ([occurrence of words with label1] = -0) / total)> 0.7
    ([occurrence of words with label1] = 1) / total)> 0.7

    Total =[ Occurrence in label1= -1] + [ Occurrence in label1= 0] +[ Occurrence in label1= 1]


    What should I write on the fields inside "Generate Attribuite" and "Filter Examples"?



    I cannot sendo the image to show you better. But there are fields to be filled inside "Generate Attribuite" and "Filter Examples".
  • gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor I
    Do the people here abandoned me?
  • mschmitzmschmitz Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,053  RM Data Scientist
    no,

    but you posted at x differnt threads got different answers and i do not know what the status is.

    ~Martin
    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
  • gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor I
    I did not know the correct place this is a reason I had done it.
  • gustavo_medeirogustavo_medeiro Member Posts: 12 Contributor I
    What should I do now?
Sign In or Register to comment.