RapidMiner

"Append with Union" building block

Elite II

"Append with Union" building block

Another user posted a very helpful building block for "appending" collections:

Handling collections: behavior of union vs append

 

If you get a "collection" result with each element having the same attributes, you can use the Append operator to "flatten" it to one example set. 

 

However, sometimes the attributes are not completely identical. This building block can work with this kind of data, as it uses the Union operator which creates a superset of the attributes.

 

A very useful building block, thanks to user @aruberutou!

--
Balázs Bárány
Data Scientist, Vienna
https://datascientist.at
5 REPLIES
Highlighted
Elite III

Re: "Append with Union" building block

Thanks for pointing this out @BalazsBarany and thanks for creating it @aruberutou!

 

I've added an updated version of to the original post which scales nicely on larger datasets which you both might find useful. 

Let me know what you think. 

 

-- Training, Consulting, Sales in China, Hong Kong & Taiwan --
www.RapidMinerChina.com
Contributor II

Re: "Append with Union" building block

Hello 

I'm a very beginner in Rapidminer, and try to manipulate some of my datas... trials and error type of stuff Smiley Happy

 

After a loop attribute, I've got a collection of example set and try your "building block" on simple datas, the result is not what I'm looking for.

 

here is the capture.

it adds attribute(id) which is ok, but in this case it adds same datas of the others attribute at the end; I'd like only 14 lines

 

thanks

 

 

 

 

Attachments

Elite II

Re: "Append with Union" building block

If you only want the 14 original lines, do an Aggregate operation with the original data as the grouping attributes and the max() or avg() of the new columns.

 

Why are you calculating the (standard) deviation of the id variable? It's numeric but the numbers don't have actual meaning.

--
Balázs Bárány
Data Scientist, Vienna
https://datascientist.at
Contributor II

Re: "Append with Union" building block

I just took an example for training so no real signification of the new attributes, just a try:

I took tutorial process of the loop attribute

then I store the collection and try merge this collection into a single table using your block. the problem is it gives a table with new colums for each attribute coming from each example set, that's OK but it replicates twice the others parameters (so in that case 2 times the same 14 examples with "?" and I just the 14 examples and get rid of the "?" like in picture attached in my previous post

 

 

Elite II

Re: "Append with Union" building block

Did you try the Aggregate route? 

 

We should take this discussion into the Studio area anyway; the output you're getting is exactly what you're supposed to get from "Append with Union". 

--
Balázs Bárány
Data Scientist, Vienna
https://datascientist.at