Further Term Frequency measures

limegreenman900limegreenman900 Member Posts: 26 Contributor II
edited November 2018 in Help

Hi everyone,

 

I am comparing a document to another template document. For this I am doing some preprocessing (tokenize, transform cases etc.) and afterwards I am comparing both document with a cross distance operator using cosine similarity. So when my "process documents from files" (using term frequency) operator is finished I get a table with my two examples and XY attributes. Is there now any chance to get some other frequencies on my attributes like logarithm or augmented within my "process documents from files" operator or do I have to work with a seperate loop operator afterwards?

 

Any help appreciated!

Best Answer

  • IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
    Solution Accepted

    Hi,

     

    You will need to loop over the attributes afterwards and use Generate Attribute to apply the desired transformations.

     

    Cheers,

    Ingo

Sign In or Register to comment.