Find Similarities in documents and group them into clusters
i am new to rapid miner and data mining in general. i run the support team in my organisation and we have some much data from previous resolved cases that can be useful to find slimier issues and present the solution to people encountering the same issues. what we have is a free text filed for the engineer to write the RCA "summery of the issue" and of course the Product filed. my question how can i use Rapid miner to achieve this.
RCA column contains:
7) appliance low space
once processed through rapid miner i would like the output to be
7) appliance low space group 2
also if anyone has used rapidminer to do support case analysis examples would be much appropriated