Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
[Solved] Calculation of column sum
Hi All,
I would like to calculate the sum of all values in a row in a dataset while the number of columns can vary. It is a text mining case so that it is not possible to predict the exact number of columns generated by vector creation. I would like to generate a new attribute containing the sum value. the dataset could look like
term; text1; text2; text3
free; 2;3;1
mind; 4;2;8
suggest;3;1;1
telephone;1;0;6
the result should be
term; text1; text2; text3;sum of occ
free;2;3;1;6
mind;4;2;8;14
suggest;3;1;1;5
telephone;1;0;6;7
as I said, the count of texts can vary so as far as I understand it the Generate Attributes operater cannot do the job.
Thanks in advance!
Chris
I would like to calculate the sum of all values in a row in a dataset while the number of columns can vary. It is a text mining case so that it is not possible to predict the exact number of columns generated by vector creation. I would like to generate a new attribute containing the sum value. the dataset could look like
term; text1; text2; text3
free; 2;3;1
mind; 4;2;8
suggest;3;1;1
telephone;1;0;6
the result should be
term; text1; text2; text3;sum of occ
free;2;3;1;6
mind;4;2;8;14
suggest;3;1;1;5
telephone;1;0;6;7
as I said, the count of texts can vary so as far as I understand it the Generate Attributes operater cannot do the job.
Thanks in advance!
Chris
0
Answers
The generate aggregation operator should be fine for this.
Set the attribute filter type parameter to "value_type", the value type parameter to "numeric" and the aggregation function to "sum" in order to make it only work on numeric attributes.
regards
Andrew