The RapidMiner community is on read-only mode until further notice. Technical support via cases will continue to work as is. For any urgent licensing related requests from Students/Faculty members, please use the Altair academic forum here.
Hi! Question on data extraction steps for Word & PDFs
Hi folks,
Im new to this s/ware and trying to figure out some basics.....
I have word and pdf files - theyre reports from various companies - what I want to do is to search for keywords (there are about 20 Im interested in) to find out the frequency of them. Ideally, Id like to search the documents and pull the data into a spreadsheet - its very basic but I cant figure out how to do it... ;(
Ive put the docs into the folder, tried to extract data but then I get lost as Im not sure what to do next..... if theres a quick step guide that would be great. Apologies if this has been done but I couldnt find it.
many thanks!
0
Best Answer
-
pimlico35 Member Posts: 4 Learner IThanks Martin - I will try that now. Im just trying to find my way around operators and what the steps are to get it to work!0
Answers
Dortmund, Germany