The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options

Generate Aggregation and Group by

aruberutouaruberutou Member Posts: 23 Contributor II
edited June 2020 in Help
Hello,

I would like to count each unique item in a column, and create a new column with the corresponding count information for each item.

In excel, I would simply use =countif(ColumnA, ColumnA(i)) and copy down.

I expected that to be equally simple in RapidMiner; perhaps using GenerateAttribute. However, this is no "group by" function, and so my count column always returns "1" for each value.

Am I overlooking something?
Tagged:

Answers

  • Options
    aruberutouaruberutou Member Posts: 23 Contributor II
    Okay,

    I feel silly. Its seems a simple solution was to multiply the data, aggregate one thread, then rejoin using the aggregated attribute as the key.

    Its still messier than I would have hoped, but perfectly usable, I guess.

    Nevertheless, please let me know if I am missing something.
  • Options
    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,525 RM Data Scientist
    Your solution would be my way as well.
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
Sign In or Register to comment.