I'm new to this forum and to Rapidminer and text mining as well, so I need your help:
I have a large number of documents (.txt) each one containing a specific question for solving a problem, and the relative answer.
My objective is given a new question to identify the closest ones (all the questions are in italian) in order to suggest the possible solution according to the answers given to the other similar questions.
I have downloaded the Text Mining Extencion and I imagine I have to use the "Process Document from files" operator (Tokenize, Filter Stopwords( Italian), Transform Cases, Stem...) first and than probably use "Document to Similarity" and "Clustering" operators.
Could you please give me some hints?
Thanks a lot!