Has anyone created any pipelines that can accurately return ICD 9 /10 clinical codes from free text, any insights around this topic are much appreciated !!
I'm not specialist of medecine, so I went to Internet to understand what is ICD 9 classification.
If I good understand ICD 9 is an expression like abc.d (for example 123.5) or abc.de (for example 425.23)where a,b,c,d,e are numbers, to define a disease, right ?
Hi , Thank you looking into this , i have attached the data file below that has the ICD 10 codes, the use case i have is as follows, IF a user types Typhoid Fever, it will in turn return the ICD 10 Code , A04. I was planning on using this file to train my model and then use the input as Free Text to return the ICD 10 with a confidence interval.
Perhaps you could try to specify more the use case you have.
Negation and similar, are important to consider. The example universally used is the radiologist note mentioning: "Probably a possible tumor. "
Next, what is the granularity you need to use while coding? In other words how perfect should your ICD coding be? Is G30 sufficient or do you need G30.1 (in the context of Alzheimer)?
What is your source free text? notes, another coding? How unstructured is the text. Are spelling mistakes and abreviations part of your text?
I have seen acceptable results using Elastic Search in this setting.
What is the language you are working in? More tools are available in English for instance. Happy to help you in the next steps. Sven
Negation and similar, are important to consider. The example universally used is the radiologist note mentioning: "Probably a possible tumor. "
This would be too high level to get a proper Estimation
Next, what is the granularity you need to use while coding? In other words how perfect should your ICD coding be? Is G30 sufficient or do you need G30.1 (in the context of Alzheimer)?
G30 would be ideal
What is your source free text? notes, another coding? How unstructured is the text. Are spelling mistakes and abreviations part of your text?
It is free Text, where users enter the diagnosis in a CRM application
I have seen acceptable results using Elastic Search in this setting.
What is the language you are working in? More tools are available in English for instance. All English.
Answers
Hi @rtrivedi,
I'm not specialist of medecine, so I went to Internet to understand what is ICD 9 classification.
If I good understand ICD 9 is an expression like abc.d (for example 123.5) or abc.de (for example 425.23) where a,b,c,d,e are numbers, to define a disease, right ?
In this case I propose the following process :
To execute this process, you have to :
- Set the path where your text file is stored in the parameters of Process Documents from Files operator.
- Eventually set the file pattern in the parameters of Process Documents from Files operator.
Does this process answer to your need ?
If not, can you be more explicit about what you want to do ?
Regards,
Lionel
Hi , Thank you looking into this , i have attached the data file below that has the ICD 10 codes, the use case i have is as follows, IF a user types Typhoid Fever, it will in turn return the ICD 10 Code , A04. I was planning on using this file to train my model and then use the input as Free Text to return the ICD 10 with a confidence interval.
Here is the link to the ICD-10 training file
https://drive.google.com/open?id=19Y8gn3qRNmIsJdYB1FupSTiXTIGm1pdS
Hi,
We have been working on this topic too.
Perhaps you could try to specify more the use case you have.
Happy to help you in the next steps.
Sven
Thank you for looking into this Sven,