What does this results mean from cross distances using euclidean distance

ekotasekotas Member Posts: 8 Contributor I
edited December 2018 in Help


My apologies if this is a really stupid question...

I've ran cross distances using Euciladean distance on 2 excel files, one containing 1 row of relevant keywords etc, 1 containing 504 rows. The aim was to compare the similarity of each row to the relevant keywords. If that makes sense. 

Ok so it all worked, which is great!! Except I don't know what the numbers mean.... can anyone shed some light on this please. Perhaps it isn't actually doing what I think it is doing??

thank you so much :) 




screenshot (2).jpg


Best Answer

  • Options
    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,517 RM Data Scientist
    Solution Accepted



    lets say you got two attributes age and salary. What Cross Distance is doing for each item in the reference and the request set is:


    distance(ref_i, req_j)


    if you use euclidian distance you get:


    d = sqrt (   (age_i - age_j)² + (salary_i - salary_j)² )


    i hope this helps.




    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany


  • Options
    ekotasekotas Member Posts: 8 Contributor I

    Sorry for the late reply, but that's great. Thank you for your help

Sign In or Register to comment.