The Altair Community and the RapidMiner community is on read-only mode until further notice. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options
Process Documents multiple times to get TF-IDF and TO in one output file
websiteguy
Member Posts: 24 Maven
Hi , this is my first post, so hello all.
Ok sorted that using multiply but need the term frequency but not total occurances but by document.
So if the word cheap appears in both documents I need to get the amount of occurances in document A and the amount of occurances in document B and NOT the combined total off occurances across both documents.
anyone healp me out with this?? cheers,
Ok sorted that using multiply but need the term frequency but not total occurances but by document.
So if the word cheap appears in both documents I need to get the amount of occurances in document A and the amount of occurances in document B and NOT the combined total off occurances across both documents.
anyone healp me out with this?? cheers,
Tagged:
0
Answers
(This can be handy in large datasets that are often growing and you have limited memory because you can then batch create TF-IDF by storing the term occurances for each document and calculating the TF-IDF as needed).