Options

Jaccard distance with Rapid Miner

giggs102giggs102 Member Posts: 2 Contributor I
edited November 2018 in Help
Hello,

First, I want to know if there is jaccard's distance implemented in Rapid miner ? I want to know how to compute the distance between objects/items using Data miner widgets . Also , I don't know what is the data format. Say for example I need to find the distance between the items in the following matrix and rank them how to do it using widgets?

val1 val2 val3 val4
item1 1 0 1 1
item2 0 1 0 1
item3 1 1 1 0
item4 0 1 1 1

Answers

  • Options
    MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869 Unicorn
    To my knowledge the jaccard distance is not implemented in RapidMiner.
    To calculate distances in general, e.g. the distance of each example to each other, use the Cross Distances operator. Use the same dataset as input for both req and ref. You can select from a number of measures the one that you'd like to calculate.

    Best regards,
    Marius
  • Options
    awchisholmawchisholm RapidMiner Certified Expert, Member Posts: 458 Unicorn
    Hello

    There is a chapter in this upcoming book http://www.crcpress.com/product/isbn/9781482205497 that describes how to do this.  If you can't wait that long, I can tell you that it uses an R package called profdpm. The function pci() within that calculates the Jaccard index.

    regards

    Andrew
Sign In or Register to comment.