Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Use AI Models for approximate matches

paolochippaolochip Member Posts: 4 Learner I
Hi,
many many times I'm struggling with joining/lookup values from different data sources that have sometimes slighlty differences hard to be coded or predicted. One of the classical example is two customer lists where the same customer can have capital vs not, commas, spaces and other differences in its name in a very random ways (e.g. CustomerA ltd vs customerA, ltd. or Customer B vs BCustomer, etc..) . I was wondering if there are existing AI models that can be used in RM that can solve this issue.

Thanks,

Paolo

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,529 RM Data Scientist
    Hey,
    last time I did this with a customer we ended up using Levenshein Distance. One may use word embeddings for it if you want to use some 'AI'.
    Cheers,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • paolochippaolochip Member Posts: 4 Learner I
    Hi Martin, thanks for your quick response, we will try and see

    Paolo
Sign In or Register to comment.