"Finding Top relevant document in kmeans cluster"

amir_askary_shaamir_askary_sha Member Posts: 11 Contributor I
edited June 2019 in Help

Hi,

 

After running kmeans clustering, how can I find out which document is the most relevant (top document) in one cluster?

 

Right now the documents in a cluster are sorted ascendingly by their id. I want to have them sorted by a weight score showing how relevant this document is in this cluster, or at least to see the most relevant doc in the cluster.

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,503 RM Data Scientist

    Hi,

     

    how do you define relevancy?

     

    Best,

    Martin

    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • amir_askary_shaamir_askary_sha Member Posts: 11 Contributor I

    I don't know exactly; any kind of relevancy. For example let's say every cluster has some top words in it (the centroids that kmeans finds), and then the document which has the shortest cosine/euclidian distance to those top words of the cluster, is the most relevant doc in the cluster.

Sign In or Register to comment.