The RapidMiner community is on read-only mode until further notice. Technical support via cases will continue to work as is. For any urgent licensing related requests from Students/Faculty members, please use the Altair academic forum here.

Dictionary Approach: Avoid multiple count of words

FeliceFelice Member Posts: 3 Newbie
edited November 2019 in Help
Hi, I have a problem with the Dictionary Approach in Text-Mining. The dictionary contains the words digit and digital acceleration. My process counts the ouccurance of digital acceleration double, so once as digit and once as digit acceleration.
Can you recommend an operator which enables that only the occurence of digital acceleration is counted. So that in the end I have only one ouccurance. 

Tanks for helping! 


  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,525 RM Data Scientist
    what operator did you use to do it? Can you maybe post the XML?

    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • FeliceFelice Member Posts: 3 Newbie
    Hi Martin, thanks for your reply! Attached you can find the xml. 
  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,525 RM Data Scientist
    Hi @Felice ,
    you can just switch to binary occurances? Then it is only counting if, not how often a word occurs.

    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • FeliceFelice Member Posts: 3 Newbie
    Hi Martin, 
    thanks for your reply. But I need the quantity of occurences, not only if a word occurs. I just want to avoid that longer word combination like digit accel are counted also as digit. 

    Thanks for helping! 
Sign In or Register to comment.