🎉 🎉 RAPIDMINER 9.10 IS OUT!!! 🎉🎉
Download the latest version helping analytics teams accelerate time-to-value for streaming and IIOT use cases.
Entity Extraction with Process Documents
Using the Text mining extension and coupled process documents operators, we can build a process for entity extraction.
- Text Processing Extension
- Text file of entities to be extracted
- Text file to extract
For the entity file, a simple CSV where each line is an individual entity
Read CSV - Call our entity CSV file
Process documents from data - send the read CSV into the process documents
Inside the process documents from data we will need a filter tokens and a transform cases (lowercase)
Using the word list output from step one, we connect it to a Process Documents operator to extract the word list from the text.