Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Counting Emojis
Hi guys, I created community account just for this problem:
I have a excel file full of extracted comments from facebook group and I need to mine all the emojis out of it and count them. Could you please tell me how to do it? I've seen one post where it was described but it uses Encode/decode operator and I don't have them and I don't really understand how to do these kinds of things (and also I'm newb, downloaded trial version just for this one-time use) Any help is greatly appreciated
I have a excel file full of extracted comments from facebook group and I need to mine all the emojis out of it and count them. Could you please tell me how to do it? I've seen one post where it was described but it uses Encode/decode operator and I don't have them and I don't really understand how to do these kinds of things (and also I'm newb, downloaded trial version just for this one-time use) Any help is greatly appreciated
0
Answers
In your text your emojis might already be represented in their unicode format, if not the decode may be useful.
Then the challenge will be to find the valid unicode ranges, and transform them into a meaningful name for grouping purposes.
You can find the whole unicode list here : https://unicode.org/emoji/charts/full-emoji-list.html
So a possible workflow could be as follows :
-> use the text operators to tokenize all your content, by splitting on space or so
-> keep only the ones within the emoji unicode range (1F600 to E007F)
-> count these and eventually map them to something meaningful (like 1F4A9 = pile of poo). You could use the above link to generate this mapping table also.
https://community.rapidminer.com/discussion/44237/counting-emojis-in-text-mining
Scott
So for example in "encode url" what do I put into the url attribute bar? (obviously not cell range lol hence it doesn't get me anywhere) and is the encoding selected right? (UTF-8)
Next on, Replace (dictionary), I have no idea whatsoever what to do with it (which attribute filter do I need? What do I need to write after "from attribute" and "to attribute"?)
And same goes for "Decode URL" what am I supposed to put in url attribute and encoding ?
I would provide you with screenshots but I'm not long enough a member to post them.
I mean don't get me wrong this programme looks amazing I just can't seem to learn these things in one day (been up till 4 AM last night trying to figure things out)
Thanks
https://academy.rapidminer.com/
Scott