Finding exact or approximate matches in two data-sets

Kumar123Kumar123 Member Posts: 1 Newbie
I have two data-sets and both have 5 location columns (Name, Address, City, State, ZIP) and a dummy identifier column which has random numbers like "Data1_12345, Data1_ABD789" in first data-set and "Data2_456, Data2_7891" in second data-set.
I am trying to find exact or approximate matches between two data-sets i.e. which record in first data-set might be a good match with which record in second data-set. 
Please help!

Answers

  • BalazsBaranyBalazsBarany Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert Posts: 955 Unicorn
    Hi @Kumar123,

    a very similar problem is being discussed here:
    <b></b><a rel="nofollow" href="https://community.rapidminer.com/discussion/comment/63852" title="Link: https://community.rapidminer.com/discussion/comment/63852">https://community.rapidminer.com/discussion/comment/63852</a><br>
    Regards,
    Balázs
Sign In or Register to comment.