Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

10 most important words

Ev_LazarouEv_Lazarou Member Posts: 3 Learner I
edited June 2020 in Help
I face a problem that i have not solved so far:
I am trying to find the most important words from a dataset. How could I do this?

Best Answer

Answers

  • sara20sara20 Member Posts: 110 Unicorn
    @Ev_Lazarou


    Hello

    Could you please explain your question more?


    Thank you
    Sara
  • Ev_LazarouEv_Lazarou Member Posts: 3 Learner I
    Hello Sara!
    I uploaded 2 csv files, I preprocessed them (according to an exercise of my university exams), and i cross validate them with 3 algorithms. The last part of the exercise ask us to prepare a graph with which are the 10 most important (not most common) words in fake news (1 csv file) and the 10 most important words in real news (other csv file)

    .  
    I am uploading photos of the processes run so far in order to understand a little bit more about the concept. 

  • Ev_LazarouEv_Lazarou Member Posts: 3 Learner I
    edited June 2020
    Dear BalazsBarany

    I have already found the most important words in entire text using weight by information gain operator and on the other hand I used wordlist to data and I found the document occurancy and total occurancy how can I merge it and see the results?
    Where I have to use aggregate and sum? 
    Thank you!


Sign In or Register to comment.