Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Replacing whole words with dictionnary
Hi Rapid miner community,
I don't find the solution to replace whole words after a "read excel" operator. If I use a "Replace (dictionary)" operator linked with an excel file, words are partially substituted - as they are not tokenized - and sometimes part of the word is substituted and aggregated with the rest of the word. for instance, if in my dictionary I have many entries for the misspelling form of the word « application » (e.g app, apple, etc.) the result can be « applicationlicationncation » ... The reason is that, in my data set, I have many terms misspelled therefore I'd like to use such process to substitute the common misspelling forms.
Inside the « text processing » operator, after tokenization I could do it, but there’s no operator to handle this (as far I’ve seen). the « replace token could do the job, but I have to enter one by one all the entries that I presently have in my misspelling dictionary..
thanks for your help !
thanks for your help !
0
Best Answer
-
kayman Member Posts: 662 UnicornDid you tick the regex box on the replace operator? You also do not need the \b in your with, only in your substitute part.
Extending the match range is a question of modifying what you want to see in your boundaries, as in attached simple example.
Input : my andoid aplication isn't an androit app.
output : my android application isn't an android application.
5
Answers
best regards