"Add spelling filter"
There is filter in text processing to remove dictionary words (stop words). Is there a filter to remove none-dictionary words?
One of the use is to filter words NOT in user-file. If the user-file is "linux.words", English dictionary, then this will remove none-English words. This is useful when we want to remove bad words from poorly scanned collection of OCR text files.
One of the use is to filter words NOT in user-file. If the user-file is "linux.words", English dictionary, then this will remove none-English words. This is useful when we want to remove bad words from poorly scanned collection of OCR text files.
Tagged:
0
Answers
Best,
Marius