Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

[SOLVED] Getting TF-IDF from unpivoted data

louismlouism Member Posts: 8 Contributor II
edited November 2018 in Help
Hi, I am trying to do text mining.  I don't have the original documents, but my words are already in a database.  For example:

Doc A:  How are you?
Doc B: I am fine

What I have is a mysql table like

A How
A are
A you
B I
B am
B fine

The fact being I am a total newbie and relying heavily on text mining tutorials, it would perhaps be easier for me to go back to the document form so I can take that and "plug it" with what I see in most text mining tutorials and then generate my TF-IDF word vectors after my data clean up.  

Answers

  • louismlouism Member Posts: 8 Contributor II
    Solved this by using the GROUP_CONCAT operator in MySQL to rebuild a table with one row per document that includes a text field with all words appended one after the other.  :)
Sign In or Register to comment.