RapidMiner 9.8 Beta is now available

Be one of the first to get your hands on the new features. More details and downloads here:

GET RAPIDMINER 9.8 BETA

Bad Agg

emaema Member Posts: 33  Guru
I tried to apply agglomerative Clustering
on text
and acheived very bad results


did any one try it ?

Answers

  • landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531   Unicorn
    Hi Ema,
    the most important aspects on clustering text is the representation of the text and the distance measure. Which did you use? I would encourage you to select TFIDF as representation and Cosine Similarity for distance calculation.


    Greetings,
    Β  Sebastian
Sign In or Register to comment.