The RapidMiner community is on read-only mode until further notice. Technical support via cases will continue to work as is. For any urgent licensing related requests from Students/Faculty members, please use the Altair academic forum here.
Sentiment Analysis Vader Normalization of results
Hi RapidMiner Community,
I have a question about the results of my sentiment analysis I conducted for online reviews from Airbnb that I want to compare with ratings given from the reviewers. As a result of the Extract Sentiment operator using Vader I get unstandardized values that I can hardly compare with other values like ratings (am I correct).
In another post in the forum by @mschmitz I found two formulas to normalize the sentiment scores and get results between -1 and +1:
- Sentiment Score/Total tokens
- Sentiment Score/(Total tokens - uncovered toked)
- Sentiment Score/(Total tokens - uncovered toked)
I used both formulas for my data and obviously got two different results. Can someone help me with the interpretation of the results for both formulas? What is the 'right' formula? What is the best way to standardize my data to be able to compare it with the ratings?
Thanks a lot for your help! Really appreciate it!
Tagged:
0
Answers
Dortmund, Germany