Regarding Text Classification

sudheendra · December 2009

Hai,

I have 1000 Text documents. I want to classify these records on the basis of some words in the document, ie if the document contains a particular number of words(word1, word2......... word10) I need to classify these as a group. I have already tried it with clustering algorithm and got around 20 clusters.But there I couldn't find any option for the above mentioned type of classification. Is there any way to classify the records on the basis of input word list.

Thanks,
Sudheendra

land · December 2009

Hi,
of course. But you didn't learn anything at all then. You simply could use an attribute construction operator, adding the if clauses and generate a new label attribute.
But this isn't text mining at all...

Greetings,
Sebastian

sudheendra · December 2009

Hi Sebastain,

I already worked with attribute construction operator using numerical attributes.If we can use the same operator in Text data how will I label to "Type A" if the text contains "payment " and "claimant".

Thanks,
Sudheendra

land · December 2009

Hi,
wasn't it you, whom I recommended to read a book about text mining? It will become clear to you, then. The word vector representation in TFIDF is just the very basic. Sorry, but without knowledge of that, it doesn't make sense to continue.

Greetings,
Sebastian

Howdy, Stranger!

Quick Links

Categories

Altair RapidMiner Community

GET HELP. LEARN BEST PRACTICES. NETWORK WITH YOUR PEERS.

Regarding Text Classification

Answers