Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Text Classification
Hi,
I am working with a text document.It contains around 1000 small paragraphs.My objective is to group the words which frequently repeating in the paragraphs or sentences (need not continuous words but those words are in that paragraph) .Which operarator can I use to classify the document on the basis of group of words.
Thanks
Nancy
I am working with a text document.It contains around 1000 small paragraphs.My objective is to group the words which frequently repeating in the paragraphs or sentences (need not continuous words but those words are in that paragraph) .Which operarator can I use to classify the document on the basis of group of words.
Thanks
Nancy
Tagged:
0
Answers
sorry, but what you are asking for seems not to be consistent. In the first sentence you are explaining, that you are going to group the words. In the question, you want to classify the documents. From this I cannot comprehend what your real objective is and how I could help you, reaching it.
Greetings,
Sebastian
My objective is to classify the documents.But now I want to group the words.After that I will classify the entire documents on the basis of these words.So can you suggest any way to group the words.
Thanks
Nancy
if you want to select words, which will be suitable for later classification, you could use feature selection or simply weighting.
Otherwise, you would have to specify, would objective you have with grouping the words. Every combination is a group, so just building a grouping isn't much sensible.
Greetings,
Sebastian