Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
"Read from Database, Process Documents From Data, kMeans Clustering"
natenash203
Member Posts: 2 Contributor I
Greetings - My question concerns what I imagine is something very simple that as a newbie, I am merely overlooking. However, after reading the manual and similar posts (like this one: http://rapid-i.com/rapidforum/index.php/topic,5518.0.html ), I am still at a loss.
I am reading data from a DB with the following columns:
Many thanks in advance for helping me through what I imagine is a total noob oversight.
I am reading data from a DB with the following columns:
- entity_id
- raw_text
- Add Set Role operator for the attribute entity_id to id, after Process Documents From Data operator
- Add Set Role operator for the attribute entity_id to id, before Process Documents From Data operator
Many thanks in advance for helping me through what I imagine is a total noob oversight.
Tagged:
0
Answers
Best regards,
Marius
My data looks like this in the database I am querying. In real life, that values within the raw_text column are significantly longer. Also, I rename my database's id column to entity_id and only return ids over 1000, as well as limit it to 100 rows.