RapidMiner 9.7 is Now Available

Lots of amazing new improvements including true version control! Learn more about what's new here.


Jaccard distance with Rapid Miner

giggs102giggs102 Member Posts: 2 Contributor I
edited November 2018 in Help

First, I want to know if there is jaccard's distance implemented in Rapid miner ? I want to know how to compute the distance between objects/items using Data miner widgets . Also , I don't know what is the data format. Say for example I need to find the distance between the items in the following matrix and rank them how to do it using widgets?

val1 val2 val3 val4
item1 1 0 1 1
item2 0 1 0 1
item3 1 1 1 0
item4 0 1 1 1


  • MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869   Unicorn
    To my knowledge the jaccard distance is not implemented in RapidMiner.
    To calculate distances in general, e.g. the distance of each example to each other, use the Cross Distances operator. Use the same dataset as input for both req and ref. You can select from a number of measures the one that you'd like to calculate.

    Best regards,
  • awchisholmawchisholm RapidMiner Certified Expert, Member Posts: 458   Unicorn

    There is a chapter in this upcoming book http://www.crcpress.com/product/isbn/9781482205497 that describes how to do this.Β  If you can't wait that long, I can tell you that it uses an R package called profdpm. The function pci() within that calculates the Jaccard index.


Sign In or Register to comment.