Options

word list

hasnoooooorhasnoooooor Member Posts: 2 Newbie
edited December 2019 in Help
i have a model that has been trained and tested with dataset a. now i am inserting a new dataset into the model and try to get the wordlist output. the problem is if i did not connected the wordlist output from dataset a, the missing attribute problem will appear. but, when i connected the old wordlist data along with the new datasets, i got the wordlist for dataset a and not for the new dataset.. how can i do if i want the wordlist for the new example set? 

Answers

  • Options
    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,510 RM Data Scientist
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • Options
    hasnoooooorhasnoooooor Member Posts: 2 Newbie
    Hi,
    yes.. i already read it. currently i am doing this, and i would like to have an output of word list from the new input example set, but the result keep giving me the word list that i have for training.
  • Options
    kaymankayman Member Posts: 662 Unicorn
    The word list is to be seen as a filter, or in other words, new words are ignored as they were not included in the training and are therefore not relevant for the model. 

    Since your new data will be labeled using the words found during training only, it is therfore kind of logical the same words are returned. If you want to include new words it requires retraining of your data. 
  • Options
    Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    You could also convert your wordlist to data from the Toolbox extension and then use the normal Join operators with another dataset that you create that has only the words you want in it.

    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.