Selecting Tokenized Attributes

gn582gn582 Member Posts: 2 Contributor I
Hey guys,

Using Rapidminer for the first time for one of my university subjects. I'm trying to make a decision tree that uses the least_square criterion as it has a numerical attribute as the label.

The data deals with Youtube attributes and I am trying to tokenize the tag column (which I can successfully do) and then select each tag individually using the "Select Attribute" operator so that I can input the data into the decision tree and have the number of likes column be the label for the decision tree. 

The problem is when I process the tag data and tokenize it, I can only select the attribute "text" rather than each token individually. Is there a way around this or a way I can select each token to input into the data set?

Sorry if I worded it confusingly,

Thanks for the help!

Best Answer

  • lionelderkrikorlionelderkrikor Moderator, RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn
    Solution Accepted
    Hi @gn582,

    If I good understand, you want select only the token attributes and exclude the "text" attribute ?

    In this case : 
     - select attribute filter type = single
     - select your "text" attribute
    AND
     - check invert selection



    Regards,

    Lionel

Answers

  • gn582gn582 Member Posts: 2 Contributor I
    Exactly what I wanted! Thanks heaps Lionel, was stuck on that problem for a while and couldn't figure out why it wouldn't work. Saved me so much time!

    Thanks again 
  • lionelderkrikorlionelderkrikor Moderator, RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn
    You're welcome @gn582,

    Good luck for your study ! 

    Regards,

    Lionel
Sign In or Register to comment.