Options

Bad Agg

emaema Member Posts: 33 Maven
I tried to apply agglomerative Clustering
on text
and acheived very bad results


did any one try it ?

Answers

  • Options
    landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    Hi Ema,
    the most important aspects on clustering text is the representation of the text and the distance measure. Which did you use? I would encourage you to select TFIDF as representation and Cosine Similarity for distance calculation.


    Greetings,
      Sebastian
Sign In or Register to comment.