Learner II aidanclifford
Learner II

Text analysis & preprocessing

Hi Guys


Really new to Rapidminer and text mining. I have a CSV with 3000+ tweets from one user. I want to remove all the stop words from the dataset. I have tried following examples but I end up with blank results. Could anyone explain the stages of this process in Rapidminer it would be appreciated.



RM Staff
RM Staff

Re: Text analysis & preprocessing



Here are some generic hints about how to perform text analytics in RapidMiner.


For all types of text analysis, you will need the Text Mining extension for RapidMiner which you can download for free from our Marketplace.  You can find it in the menu “Extensions” – “Marketplace” and type “Text” in the search box (here is also a link directly to our marketplace:https://marketplace.rapidminer.com/UpdateServer/faces/product_details.xhtml?productId=rmx_text).  There are also many more extensions on our Marketplace so make sure that you check them out…


There is a community member who created a nice set of tutorials for text analysis with RapidMiner: http://vancouverdata.blogspot.com/2010/11/text-analytics-with-rapidminer-loading.html


Finally, there are two more extensions which might be interesting from our partners (Aylien and Rosette).


Hope this helps,


How to load processes in XML from the forum into RapidMiner: Read this!
How can RapidMiner increase participation in our new competitions?
Twitter Feed