Image Mining

shaminishamini Member Posts: 2 Contributor I
edited November 2018 in Help
i'm doing image mining using rapidminer .
to do that, i've converted my images into grayscale , extracted text from images using ocr technique n now performing text mining to the text extracted .

while i perform text mining with tokenise operator, a stray character is always introduced n hence destroys the tokenising pattern i.e instead of tokenising it into words it tokenises it into letters 

how can i remove stray character
please help
from past 3 weeks , its bugging me ...any help or suggestions 
thanks in advance


  • Options
    alsaqer002alsaqer002 Member Posts: 5 Contributor II
    Hello Shamini,

    Try to use "Generate n-Grams (Terms)" in order to get words instead of letters.

    Would you please tell me how did you extracted text from images using ocr technique?

Sign In or Register to comment.