🎉 🎉 RAPIDMINER 9.10 IS OUT!!! 🎉🎉

Download the latest version helping analytics teams accelerate time-to-value for streaming and IIOT use cases.


label is positive and sentiscore is in negative

sukhsukh Member Posts: 43 Contributor I
edited November 2018 in Help
Hi all,
i am working with sentiment analysis. for that i am using IMDb data set which has two directories one for positive and another one is for negative which further consists text files in each directory. these text files are used for training . but the problem arises when the result comes.
As in results for text files fall in positive directory get the sentiment score as negative.

like  label          Sentiscore      document id
        positive        -0.076            1
        negative        0.34              2

how we can say, whether the document is positive or negative?? 

Thanks ans Regards:


  • awchisholmawchisholm RapidMiner Certified Expert, Member Posts: 458   Unicorn

    Do you know how the positive and negative sentiments were assigned to the documents in the first place? Do you have any reason to think that a sentiment calculated by you would match the original sentiment?


  • sukhsukh Member Posts: 43 Contributor I
    Sir, i have used a standard dataset downloaded from:


    polarity dataset v2.0 ( 3.0Mb) (includes README v2.0): 1000 positive and 1000 negative processed reviews. Introduced in Pang/Lee ACL 2004. Released June 2004.

    i have used this dataset,
    Thanks ans Regards:
  • awchisholmawchisholm RapidMiner Certified Expert, Member Posts: 458   Unicorn
    The dataset is marked as positive or negative based on analysis of stars given by people. I would be amazed if sentiment analysis based on words would give the same result.
  • sukhsukh Member Posts: 43 Contributor I
    Actually if the label is negative then why the magnitude of sentiment comes in positive and vice versa. I could not figure out this.
Sign In or Register to comment.