Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

[solved] Can not sort on a Similarity column

krish_rapidkrish_rapid Member Posts: 1 Learner III
edited November 2018 in Help
Hello

I am using the operator Data to similarity to find out similarity in the texts present in my data.
Text data comes from an xcel file containing 920 rows and just the text column.
The parameters of the object being measure type which should be numerical and type of numerical measure as cosinesimilarity.
After running the process i want to sort on the similarity column but i do not get it.  >:(
Anyone knows possible reason or solution for this?

Answers

  • MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869 Unicorn
    Hi,

    to process textual data you should have a look at the text processing extension - please find documentation on this topic in the post linked in my signature.

    If you can't sort by a column after running a process, then probably the table contains more rows than specified in gui.max_sortable_rows in the properties. You can either increase that value, or sort the table directly in the process via the Sort operator. The latter option only works if you are dealing with Example Sets. In your case you will need on of the Data to Similarity Data and Similarity to Data operators.

    Happy Mining!
    Marius
Sign In or Register to comment.