So, I have this requirement. I have a bunch of pdf files (nearly 50 pdf files) and I need to know the sentiment score of each of the pdf files. Can this be done in Rapidminer using any of the extensions? Also, it would be really great if we could create an output as an excel sheet which has the pdf file name as one column and the corresponding sentiment score in another column.
Yes, you want the Text Processing extension to read in your PDF's, do your tokenization/stemming/etc, and then you can either use the Wordnet extension and it's Extract Sentiment operator or one of the 3rd party (fee based) extensions like Aylien or Rosette to create sentiment.