The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.

Aggregate (like count) on non-sql data (like created example set)

ZiggizagZiggizag Member Posts: 6 Learner I
edited April 2023 in Help

Is there a way to aggregate without SQL?

Let's say I have a huge MySQL data set, and I map some values to arbitrary "categories" through a join with a "created example set". For example, I assign a value "high", to records where "offer_id" was 1,3,5 or 7, and I assign a value "low" to records where "offer_id" was 2, 4 or 6. Presume I do not want to load the mapping "offer_id" --> "category" into a database at the moment, but I like having it in a handy "created example set".

I have noticed, rather unfortunately, that "category" argument (which is set on "example data set" by the "set meta data" operator) is not visible to "aggregate" operator, so I see no easy way to count "high" and "low" records after join.

The question is: how to aggregate by attributes coming from such an "example data set"?

Here you are my process:


  • Options
    yyhuangyyhuang Administrator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 364 RM Data Scientist
    There are at least two methods. You can join the mapping table to the big table. Or replace the offer id with dictionary. Then use aggregate operator.
    I replace the id in golf data, here is the sample process that you can import into your own RM studio.

Sign In or Register to comment.