Mathematical comparison between same attributes names of two different data-set

achint_kumarachint_kumar Member Posts: 16 Contributor II
edited March 2020 in Help
Mathematical comparison between same attributes names of two different data-set based on placement of the attributes. I have 24 attributes each in two data sets, i need to do calculation from first dataset 1st attribute against 1st attribute of second data set. I don't want to pick the attribute name hardcoded, but i need it dynamic as the month name will differ from time to time.

Consider the attached dataset as both datasets(same attribute but different values)

I need to do calculation in 1st attribute (multiply, divide, etc) and compare the value of the same attribute in second dataset. Hope the requirement is clear. Im trying to make this as dynamic as possible. The placement of the attributes in exactly the same in both dataset(along with exact same name)

Looking forward to hearing from you all. Thank you.

Best Answer


  • BalazsBaranyBalazsBarany Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert Posts: 955 Unicorn
    Hi @achint_kumar,

    what's the meaning of the first column? You have duplicates in there.
    Would you join the second example set to the first one, based on some criterion?

    If you join two example sets based with identical column names, the attributes from the second one get a postfix attached ("_from_ES2"). 
    If the data sets are joined this way, you can do a Loop Attributes with the appropriate selection regular expression ([a-zA-Z]+-[0-9]+). This will select the attributes from the original example set. Then you do something like Generate Macro inside the loop to append the _from_ES2 postfix, and use Generate Attributes with the loop attribute macro and the generated attribute name macro, and your formula.

Sign In or Register to comment.