🎉 🎉 RAPIDMINER 9.10 IS OUT!!! 🎉🎉
Download the latest version helping analytics teams accelerate time-to-value for streaming and IIOT use cases.
Generate Aggregation - Problem with non-existent attributes
I'm just trying to build some custom scoring functions (mainly with the "Generate Attributes" operator). Among other things I need to calculate with some word occurences. At the moment there is an example set with all the attributes from a previously created word vector. Now I want to sum up the occurrences of some similar words and store this value in a new attribute. As far as I know this can easily be done by the "Generate Aggregation" operator. Setting "regular_expression" as attribute filter type and using an expression for those similar words first seemed to work well for me. Adding some more expressions finally led to an error: "AttributeFactory: cannot create attribute with value type 'attribute_value' (0)!". This message results from regular expressions for attribute search which don't bring up any matches. Is this behavior intended? Since I don't know if the words fitting to my regex patterns are present in documents that shall be testet I would prefer a default value (0 for my sum aggregation). How can i avoid this problem? Do I need to check the existence of the desired attributes before (perhaps with "Select attributes" and somehow count the size of the resulting attribute set)? I hope my problem is understandable, it's more of a general question than a process specific problem. I would appreciate any hints and help.
Thanks in advance!