RapidMiner 9.8 Beta is now available
Be one of the first to get your hands on the new features. More details and downloads here:
Parts of Speech (POS) Filtering
I am have tokenised some text and am now trying to remove POS, using the Filter by POS Operator. I have used the following expression: N.*|VB.*|RB.*|JJ.*|MD.*|PP.* in an attempt to keep nouns, adjectives, verbs and adverbs. The problem is that as an example, nouns and verbs were filtered out e.g. the word "need" is no longer present in my text.
What am I doing wrong and do I have the right expression for the POS tokens I want to keep (nouns, adjectives, verbs and adverbs)?