RapidMiner 9.7 is Now Available
Lots of amazing new improvements including true version control! Learn more about what's new here.
"Getting started with sentiment analysis"
I am seeking advice on how to get started with a sentiment-analysis in the least painfull way. I am currently writing my bachelor thesis about social behaviour on online forums. For this, i have been crawling topics on a danish forum for the last 2 months, and it finally looks like i have the data i need.
I am doing the most basic statistical analysis in SPSS, where i will compare user-rank, the amount of posts the user has made, to the amount of answers to his or her topics. However, i also have the topic text, which i would love to classify using the logic of sentiment analysis.
As you might have guessed, i am totally new to rapidminer. I have been trying to copy-paste the workflow of the accelerator sentiment analysis. But it seems i keep getting errors about my data format. However, I have only two colums: post & category. In the category column, i have mapped some of the rows with "Positive" and others with "Negative". The text in the rows is in danish, and some topics contain links, quotation marks etc.
You can have a look at my csv-file here:
And here's the error i get:
The most important two classifications i need to create/predict are:
- Subject (based on a list of subjects, with each of their keywords)
So here are the questions:
1) What am i doing wrong in the sentiment analysis?
2) Is it possible to make a prediction model, that classifies topics and labels them with subject names, based on their use of keywords(apple, win etc.)?
I have one month left to get to learn this stuff. Does that seem realistic?
Thanks in advance,