Data Manipulation

ad2045ad2045 Member Posts: 5 Newbie
I have the following data:

User ID Category Subcategory
02 A Sub1
02 B Sub2
03 C Syb7

I would like to group the data by User ID as follows:

CategotyA (contains the number of subcategories)
CategoryB (contains the number of subcategories)
CategoryC( contains the number of subcategories)

example output

User IDCategoryACategoryB

Could someone help by providing the process and parameter needed?

Best Answer


  • Options
    Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    Use the Aggregate operator, and it looks like you want to Group By User ID and Category initially (and aggregate by counting on something like user id, but it doesn't really matter).  This will give you a version of the data you want, only the way you have it, you will need to Pivot the output, where you Group By User ID (again) and Index on Category.
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.