Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Cutting reviews in phrases, while still knowing to what video gamthey belong to.
Hi all.
I have multiple reviews from video games in a dataset, in which i want to cut into phrases. However, I still need to know to which video game they belong to. So lets say we have Game A and Game B. If Game A has 4 phrases, I want to chop up the document to those 4 phrases, while in the next column i can see to which game the sentence belong to.
I have tried some methods, but unfortunately my experience with rapidminer is too limited to get this done.
I have multiple reviews from video games in a dataset, in which i want to cut into phrases. However, I still need to know to which video game they belong to. So lets say we have Game A and Game B. If Game A has 4 phrases, I want to chop up the document to those 4 phrases, while in the next column i can see to which game the sentence belong to.
I have tried some methods, but unfortunately my experience with rapidminer is too limited to get this done.
0
Answers
have a look at this process: The first part is only about creating fake data for demo purposes. The real meat begins with "Split". You split on either . or ! followed by a whitespace. De-Pivot then grabs the resulting columns and converts them to rows.
Note that I have added a breakpoint before "Split" so you can inspect the input data which is probably somewhat similar to what you have. Once you run the process, you will activate the breakpoint, pausing the process. After looking at the data, you can press the (now green) run button again to finish the process.
Cheers,
Marco