Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Generate Aggregation and Group by

aruberutouaruberutou Member Posts: 23 Contributor II
edited June 2020 in Help
Hello,

I would like to count each unique item in a column, and create a new column with the corresponding count information for each item.

In excel, I would simply use =countif(ColumnA, ColumnA(i)) and copy down.

I expected that to be equally simple in RapidMiner; perhaps using GenerateAttribute. However, this is no "group by" function, and so my count column always returns "1" for each value.

Am I overlooking something?
Tagged:

Answers

  • aruberutouaruberutou Member Posts: 23 Contributor II
    Okay,

    I feel silly. Its seems a simple solution was to multiply the data, aggregate one thread, then rejoin using the aggregated attribute as the key.

    Its still messier than I would have hoped, but perfectly usable, I guess.

    Nevertheless, please let me know if I am missing something.
  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,531 RM Data Scientist
    Your solution would be my way as well.
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
Sign In or Register to comment.