The RapidMiner community is on read-only mode until further notice. Technical support via cases will continue to work as is. For any urgent licensing related requests from Students/Faculty members, please use the Altair academic forum here.
TWITTER: Splitting Text from Tweets
Martin_Novak_95
Member Posts: 1 Newbie
Hello, I'm Martin and I'm doing a homework regarding the Twitter analysis.
After getting the data, one cell containing text is providing me with many pieces of information which I would like to be separated or deleted.
For example:
After getting the data, one cell containing text is providing me with many pieces of information which I would like to be separated or deleted.
For example:
RT @DefendingB: @Disinfo1982 @GeorgeFoulkes @CorinneJaneBrya
I'll happily explain it to you. This edition was broadcast on 11th April
2014.… or
|
Tagged:
0
Answers
I don't know exactly what are you trying to accomplish, but for starters, I would check this:
Inside the super operator Process Documents from Data, I have this:
That is NLP 101. Then, you can Filter Stopwords (English), or play with POS Tagging, as pointers to do what you want. I'm sorry I can't be more helpful, but most of my NLP processing is done with Python (which, BTW, you can integrate here by using the Python Scripting Extension, if you want to be even more creative).
All the best,
Rodrigo.