The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.

Process Documents multiple times to get TF-IDF and TO in one output file

websiteguywebsiteguy Member Posts: 24 Maven
edited November 2018 in Help
Hi , this is my first post, so hello all.

Ok sorted that using multiply but need the term frequency but not total occurances but by document.
So if the word cheap appears in both documents I need to get the amount of occurances in document A and the amount of occurances in document B  and NOT the combined total off occurances across both documents.

anyone healp me out with this?? cheers,

Best Answer

  • Options
    websiteguywebsiteguy Member Posts: 24 Maven
    Solution Accepted
    thanks thats a good idea i had not thought of that.


  • Options
    JEdwardJEdward RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 578 Unicorn
    How is your process setup?  If you use the option Term Occurances it will tell you how many times a word appears in each document. 

    (This can be handy in large datasets that are often growing and you have limited memory because you can then batch create TF-IDF by storing the term occurances for each document and calculating the TF-IDF as needed). 
Sign In or Register to comment.