Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Extract drug names
I am a medical doctor and doing research.
I have an excel sheet with freetext wich contains drugs names.
I want to filter out these drug names and count how many drugs are noted in each field (excel cell).
Any suggestions??
Tagged:
0
Answers
Hi @t_klok,
this is one of the problems were i started with "hey that's easy" and it turned out to be a 15operator process. Maybe there is another way to do this? @sgenzer might find one . Anyway, my solution is attached.
You might want to link up with @SvenVanPoucke . He is a physician and our medical expert in the community.
Best,
Martin
Dortmund, Germany
Hi Martin,
Rapid(miner) answers..
Thx I think I understand.
But I would like to filter out drugnames using a list which contains the drugnames.
I do not want to enter all the reference drugnames by hand....
Hi,
sure you can just read in the Excel file instead of generating them by hand. That was just to generate some dummy data.
Best,
Martin
Dortmund, Germany
hi @t_klok - I'd want to see the data before really weighing in but just from what you describe I would use the Text Processing extension, tokenize, and then Filter Tokens (Dictionary) with the drug names. It's very similar to what @mschmitz built with his XML.
Scott
Hi each country provides a list with official drug names. Additionally, SNOMED can help you find drug names in a text.
Hi,
Please take a look at the technology Microsoft is testing: https://www.youtube.com/watch?v=c6exHAzNwy4#action=share
Cheers Sven
Thank you all.
I have a (large) list of drugnames and I want to see if freetext fields in an xcl contain any of these names.
So I query an xcl file with freetext cells and the reference is a file with all drugnames.
I do not want to enter all these drugnames one by one in rapidminer.