RapidMiner

RapidMiner

Generic join script

Generic join script

This script implements a generic join operation that can compare on arbitrary function results, for example inequality. 

 

Input1: Example set 1

Input2: Example set 2

 

Inside the script, the join attribute names from example set 1 and 2 need to be specified, and the join operation. This can be as simple as e1 == e2 or something like abs(e1 - e2) <= 0.1 for a "fuzzy match" or e1.matches(e2) for a regular expression match. 

 

The output is a joined example set. 

 

The inputs are not allowed to contain identical attribute names, those need to be renamed before using this script.

--
Balázs Bárány
Data Scientist, Vienna
https://datascientist.at

Attachments