# How is Jaccard / Dice similarity defined for numerical variables?

hi,

as stated here: http://www.stata.com/manuals13/mvmeasure_option.pdf

Jaccard is TP/(TP+FP+FN)... for as it seems binary variables...

but how is it defined for numerical values?? as it can be chosen e.g as numerical distance measure in k-NN operator..

and similar how is it defined for Dice similarity?

edit: I found the implementation here: https://github.com/rapidminer/rapidminer-studio/tree/master/src/main/java/com/rapidminer/tools/math/similarity/numerical

edit2: ok it seems its simply 2 * x*y / x+y

where X and Y are two vectors with attributes x_i and y_i,

2 * wxy / (wx + wy);

where wxy is the product of the corresponding attributes of the two vectors summed up,

and wx , wy is just the sum of the attributes values of x or y respectively...

looks like some weird distance measure to me, don't know if that makes a lot of sense...

## Comments

1Learner I