πŸŽ‰ πŸŽ‰ RAPIDMINER 9.10 IS OUT!!! πŸŽ‰πŸŽ‰

Download the latest version helping analytics teams accelerate time-to-value for streaming and IIOT use cases.

CLICK HERE TO DOWNLOAD

"Filter minimum occurence of token?"

erocoarerocoar Member Posts: 6 Contributor I
edited June 2019 in Help
Hey,

I am still in the process of setting up my text mining / analyzing process :) Since you can filter your words by minimum characters, maximum characters etc. I had the question of whether you can also filter your document; removing all words/tokens that occur >x times?

Thanks in advance!!Β  :D
Tagged:

Answers

  • mschmitzmschmitz Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,055  RM Data Scientist
    Hi erocoar,

    sure. In Process documents is a pruning option. This can be set to absolute.

    ~Martin
    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
  • cc4699cc4699 Member Posts: 6 Contributor I

    Prune method on the "Process Documents" is only filtering the number of document occurrences. How can I filter by "Total Occurrences" ?

    Β 

    Thanks

Sign In or Register to comment.