Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
using descretize by frequency
abeetbhat1995
Member Posts: 6 Contributor I
i want to count the no. of occurences of each word in a text document....i think i can do it using descretize by frequency but i dont know how to do it.....thanx in advance
Tagged:
0
Answers
Actually, the easiest way to do that is to Tokenize your document by words and then create the word vector inside the Process Documents parameters based on term occurrences. The resulting word list will contain a count of all occurrences of your words and your dataset will have a corresponding attribute for each word and its count for each document.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts