"Failing to aggregate data"

jgarciajgarcia Member Posts: 4 Contributor I
edited June 2019 in Help

Hi all,

I'm new to RapidMiner and I'm having trouble getting the aggregate operator to do what I want. I wonder if anyone can help me.

I have a large exampleset (~65000 lines) with about 5 attributes (1 id e 4 nominal). Since I would like to identify the distribution of values in, say, Attr_1, I use aggregate with "Attr_1/count" in Aggregation Attribute, and Attr_1 selected in "group_by_attributes". "Count all combinations", "count only distinct" and "ignore missings" are unchecked.

The result of the count however does not represent the distribution of values in Attr_1 in the original example set.

What am I doing wrong?

I really appreciate any kind of support.

J. Garcia


  • Options
    landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    it's really a pitty. I haven't been aware that there's such a restriction, I always thought it would be Regular Expression parameters. But you are right, seems to me I never faced the problem before. I have discussed with the responsible developer and we agreed to put it on our todo list.

Sign In or Register to comment.