It looks like you're new here. Sign in or register to get started.
i want to count the no. of occurences of each word in a text document....i think i can do it using descretize by frequency but i dont know how to do it.....thanx in advance
Actually, the easiest way to do that is to Tokenize your document by words and then create the word vector inside the Process Documents parameters based on term occurrences. The resulting word list will contain a count of all occurrences of your words and your dataset will have a corresponding attribute for each word and its count for each document.