RapidMiner

RapidMiner

Applying Machine Learning to Text Mining with Amazon S3 and RapidMiner

by Community Manager on ‎11-28-2016 04:31 AM
Tweet from @kortxltd

DataViZ, Data Science and Mac https://t.co/0kXcsCLGZE #herearesomewhitepapersabouttamr #lavastorm #looker #rapidminer #teradata #thingworx

View on Web
Comments
CraigBostonUSA
Regular Contributor

 Picture1.png

The link is dead, here is the working link for utilizing RapidMiner with Amazon S3:

 

https://aws.amazon.com/blogs/big-data/applying-machine-learning-to-text-mining-with-amazon-s3-and-ra...

 

Also, the following six step course on RapidMiner text, has over 40k views in youtube! 

 

Loading text in to RapidMiner

 

Processing Text in RapidMiner - tokenizing, stripping HTML, stemming, stopwords, n-grams, and word frequency tables.

Association rules with text in RapidMiner - making word vectors, finding frequent item-sets and high-confidence association rules in text documents.

Finding similar documents: how to automatically calculate the similarity between documents. TF-IDF, cosine similarity and K-Means clustering are covered.

Automatic classification: How to classify documents into classes (like positive/negative reviews, or spam/not spam or sports/finance/leisure news), and which words are important.

NEW: Applying A Model To New Documents