Finding most common words in text attribute
This is my first post on this forum, though I have been using RapidMiner for some time now, so hi to all of you! I hope you can help me out with a problem that I just can't seem to solve.
I want to get a list (like a top 10 or a top 20) of the most common word throughout a text attribute. I have already performed the basics (Nominal to text, Process Documents, tokenize, filter stopwords) and even developed some prediction models, but I am just not finding any operator that will show me the words that occur most commonly throughout the dataset (or better yet, the most common words per label). Can anyone help?
Thank you so much in advance. Regards, Rick