"Analytics with RapidMiner Rosette [getting started]"
I'm just getting started with RM for text analytics. Everything has gone well working with structured data but I'm struggling with analysing text documents. Could you anyone provide a process of how to extract entities from a PDF or Word Doc?
I've searched these forums and Google and the only solution that seems to work is converting the file into a txt file first, which isn't ideal.
Any help would be super appreciated.