Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
How to Extract Numbers from Text Mining
Hi,
i have tokenize and filtered out some words which left only numbers and english words,
then my problem now is i want to extract out both numbers and english words seperately and putting them in different results,
how can i achieve that?
Btw, i'm using text mining tool here, the file is in .txt format and is semi-structured.
Thanks for helping.
i have tokenize and filtered out some words which left only numbers and english words,
then my problem now is i want to extract out both numbers and english words seperately and putting them in different results,
how can i achieve that?
Btw, i'm using text mining tool here, the file is in .txt format and is semi-structured.
Thanks for helping.
0
Answers
sorry, I did not get your point. Can you give us an example, best of the data before the desired transformation and what you would like to achieve?
Cheers,
Ingo
i had solved the problem actually.
okay i will rephrase my problem here:
i had a text file, for example : "Bobbie goes to school today in the morning at 8 oclock with his 30 packs of noodles."
i would like to filter out english words (bobbie, goes, to ... etc) and as well numberings (8, 30)
but i found that the filter only allow to do one thing only, either english word or numberings,
but does not allow for filtering both.
i could not find other way,
but lastly i load the file 2 times, and do filtering seperately and i got it solved.
thanks.
I am a beginner of Rapidminer
i am facing a similar issue that i want to extract the numerics from the text, eg:
" the task finished at the year 2018"
I just need the numeric information " 2018". how to filter the words when tokenizing?
Thanks
much appreciate
Ingo
Same outcome of course but only one time dataload.
Please see below screenshots. You have a big icon "Ask Question" on the top right of this community window. If you click that you can read some quick tips on posting question. You need to provide the title of the question and give a detailed version of your process and issue.
Once you click this, you get the below screen. Read the three steps provided in the below screen and provide your detailed explanation of the issue.
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing