entropy
Best Answer
-
Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635
Unicorn
In that case, yes, it will affect entropy because the calculation of TFIDF is not simply a linear transformation of frequency. It is impossible to say in advance which would give you better results. As I mentioned before, I would probably start with term occurrences first since that is more representative of the data in its raw form. RapidMiner will allow you to easily do it both ways and compare the results!
6
Answers
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts