Options

What does this results mean from cross distances using euclidean distance

ekotasekotas Member Posts: 8 Contributor I
edited December 2018 in Help

Hi,

My apologies if this is a really stupid question...

I've ran cross distances using Euciladean distance on 2 excel files, one containing 1 row of relevant keywords etc, 1 containing 504 rows. The aim was to compare the similarity of each row to the relevant keywords. If that makes sense. 

Ok so it all worked, which is great!! Except I don't know what the numbers mean.... can anyone shed some light on this please. Perhaps it isn't actually doing what I think it is doing??

thank you so much :) 

 

screenshot.jpg

 

screenshot (2).jpg

 

Best Answer

  • Options
    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,509 RM Data Scientist
    Solution Accepted

    Hi,

     

    lets say you got two attributes age and salary. What Cross Distance is doing for each item in the reference and the request set is:

     

    distance(ref_i, req_j)

     

    if you use euclidian distance you get:

     

    d = sqrt (   (age_i - age_j)² + (salary_i - salary_j)² )

     

    i hope this helps.

     

    BR,

    Martin

    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany

Answers

  • Options
    ekotasekotas Member Posts: 8 Contributor I

    Sorry for the late reply, but that's great. Thank you for your help

Sign In or Register to comment.