mystery science data mining problem 3000
I have an idea I would like to try out ... but I have no idea what operators could accomplish this:
What I would like to do is scan a database ( flat file currently) and find any records with matching fields.
Then assign a "relationship ID" to each field to help find relationships in the data.
( later I would like to include fuzzy matching as well above a certain match threshold, like Jaccard similarity or something similar).
Best regards, J.