RapidMiner 9.7 is Now Available
Lots of amazing new improvements including true version control! Learn more about what's new here.
"Increasing text categorization performance through dedicated wordlists"
My text categorization models have an accuracy of around 62% (SVM, with SVD for dimensionality reduction)
I want to try to improve this by "helping" the learner a little bit. For a category 'Product related' I know all possible products (something RapidMiner - of course - does not know). Another example would be a list of swear words for tagging cases with a category 'Flame'.
Is it possible to help the leaner by connecting or relating wordlists to certain categories?
Thanks for your help!