Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

About text mining

morphismmorphism Member Posts: 18 Learner I
edited December 2018 in Help

Hello, how are you?


I have interest in text mining using RapidMiner


Is there any way I can do

"Nonnegative Matrix Factorization" or "Probabilistic Latent Sementic Analysis"

or "Nonlinear Transformation" to Document-Term-Matrix??


I want to do Classification, Clustering, Summarizing, Information Retrieval etc for text data


Thank you in advance and have a nice day.

Answers

  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    These types of text mining functions do not have native RapidMiner operators to support them. However, you could potentially accomplish them using the relevant R or Python packages through the scripting operators.  Having said that, these techniques are also somewhat more advanced or even esoteric approaches to text mining.  Have you tried the more straightforward bag-of-words approach using standard word vector creation (TF-IDF or similar) yet?  You might want to start with those and see what kind of results you get before moving onto the more complex approaches.
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
  • morphismmorphism Member Posts: 18 Learner I

    Hello, Telcontar120.

    Thank you for your explanation.

    I am a beginner in Text mining using RapidMiner.

    I found in books such that,

    SVD or Nonnegative Matrix Factorization techniques can  be used before doing clustering so on,

    and I guessed there are no such operators for that.

    I wanted to know the full possible functions RapidMiner can do for text mining,

    and wanted to use "Nonnegative Matrix technique"


    Then, I found RapidMiner has "Singular Value Decomposition(SVD)"


    So could you please explain to me about How "SVD"  can be applied to text mining projects

    such as clustering, classification??



Sign In or Register to comment.