Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

[SOLVED] Similarity to data: sorting and selecting rows

UgoUgo Member Posts: 20 Contributor II
edited November 2018 in Help
Hello,

I have used a data set to generate a matrix with the similarity values (using the Similarity to Data operator).
I would like to go through each example and:
1. sort the matrix row
2. select the top k elements

So far I managed to use a "Loop Example" operator to go over each data set line, however
I cannot seem to manage to extract the whole line (all attributes for sorting). How can I do this?
The "Extract Macro" operator seems to require that I indicate a single attribute  :( .

Assuming I can get a data row I am also taking the opportunity to ask:
How do we sort and select the top k elements of a data row?

I have also looked t the "Sort" operator but that seems work on data set columns only.

TIA,
Hugo F.


Answers

  • MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869 Unicorn
    Hi Hugo,

    maybe the Transpose operator can help you!

    Best regards,
    Marius
  • UgoUgo Member Posts: 20 Contributor II
    Hi Marius,

    Thanks for the input. Unfortunately the "transpose" won't work. I have just realized something.
    The matrix output of the "Similarity to Data" operator is _not_ in fact a matrix  ??? .
    The meta data I have is:

    Similarity to Data (2).exampleSet (exampleSet) Meta data: Data Table Number of examples = 0 At most 1 attribute: Generated by:
    ROLE NAME TYPE          RANGE                                                                          MISSING
    id        ID      polynominal ⊇[efl123009c, efls001012, efls001020, efls001040....] = 0

    So even if I could extract all of the line it only has one attribute.
    I expected a matrix wherein the ID is as is above, but I would have all of the range values above as attributes. 
    So my question now is how can I actually get the data row from from the

    Can anyone point out an example that uses the results of "Similarity to Data"?

    Thank you,
    Hugo F.

  • UgoUgo Member Posts: 20 Contributor II
    Finally took another approach.
    Did not use the matrix.
    For more information see:

    http://rapid-i.com/rapidforum/index.php/topic,6212.msg21746.html#msg21746

    Hugo F.
Sign In or Register to comment.