Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Generate Aggregation and Group by
aruberutou
Member Posts: 23 Contributor II
Hello,
I would like to count each unique item in a column, and create a new column with the corresponding count information for each item.
In excel, I would simply use =countif(ColumnA, ColumnA(i)) and copy down.
I expected that to be equally simple in RapidMiner; perhaps using GenerateAttribute. However, this is no "group by" function, and so my count column always returns "1" for each value.
Am I overlooking something?
I would like to count each unique item in a column, and create a new column with the corresponding count information for each item.
In excel, I would simply use =countif(ColumnA, ColumnA(i)) and copy down.
I expected that to be equally simple in RapidMiner; perhaps using GenerateAttribute. However, this is no "group by" function, and so my count column always returns "1" for each value.
Am I overlooking something?
Tagged:
0
Answers
I feel silly. Its seems a simple solution was to multiply the data, aggregate one thread, then rejoin using the aggregated attribute as the key.
Its still messier than I would have hoped, but perfectly usable, I guess.
Nevertheless, please let me know if I am missing something.
Dortmund, Germany