The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options

using descretize by frequency

abeetbhat1995abeetbhat1995 Member Posts: 6 Contributor I
edited November 2018 in Help

i want to count the no. of occurences of each word in a text document....i think i can do it using descretize by frequency but i dont know how to do it.....thanx in advance

 

Tagged:

Answers

  • Options
    Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn

    Actually, the easiest way to do that is to Tokenize your document by words and then create the word vector inside the Process Documents parameters based on term occurrences.  The resulting word list will contain a count of all occurrences of your words and your dataset will have a corresponding attribute for each word and its count for each document.

     

    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.