Text Data Cleaning, Preprocessing and Text Mining with Rapidminer

abby · August 2023

I have a large number of English texts (online reviews). I need to do a data cleaning, preprocessing on them and then mine which are the effective high-frequency words? For example, "bathroom", "traffic", etc., are likely to appear as valid high-frequency words. Do you have any specific steps to do so?

ceaperez · August 2023

Hi @abby,
Yes, you can use Rapidminer for that and you have specific content in Rapidminer Acafemy for free.

please check this thread
please check this thread

https://community.rapidminer.com/discussion/60355/can-process-documents-calculate-term-occurences-of-all-words-without-having-to-give-it-a-word-list#latest

best,

Cesar

Howdy, Stranger!

Quick Links

Categories

Altair RapidMiner Community

GET HELP. LEARN BEST PRACTICES. NETWORK WITH YOUR PEERS.

Text Data Cleaning, Preprocessing and Text Mining with Rapidminer

Answers