Community Manager Community Manager
Community Manager

Generic join script

This script implements a generic join operation that can compare on arbitrary function results, for example inequality. 


Input1: Example set 1

Input2: Example set 2


Inside the script, the join attribute names from example set 1 and 2 need to be specified, and the join operation. This can be as simple as e1 == e2 or something like abs(e1 - e2) <= 0.1 for a "fuzzy match" or e1.matches(e2) for a regular expression match. 


The output is a joined example set. 


The inputs are not allowed to contain identical attribute names, those need to be renamed before using this script.

Balázs Bárány
Data Scientist, Vienna
How can RapidMiner increase participation in our new competitions?
Twitter Feed