Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

"Word Clustering/Classification"

dsaraphdsaraph Member Posts: 10 Contributor II
edited June 2019 in Help
Thought this question deserved it's own topic for others looking in the future.

Hi Matthias,

Just wanted to report back that I was able to run the n-grams quite well, but in the end the results were not exactly what I was looking for so I'm going to be tinkering with the data for the next little bit.  Thanks for all your help on this.

On another topic, I wanted to inquire if anyone was familiar with word clustering.  For example, is there a way that I can cluster the text (by a certain topic..an example would be if it was major league baseball data, it would allow me to cluster by teams.. in this case I want it to form the clusters on its own) without considering the order (n-grams are formed based on the order of the words)... I was looking into some of the clustering operators but I'm not sure what would be applicable to what I'm trying to do.  I was hoping there would be an operator that could just replace the n-gram operator in order to carry this out since I still wanted the pre-processing of the data, stemming, and filtering as I currently have.  Any suggestions are greatly appreciated.

Thanks.
Sign In or Register to comment.