Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Feature Request: Visualization to drag/drop simple summary table

btibertbtibert Member, University Professor Posts: 146 Guru
edited September 2019 in Help
When thinking about EDA, and more specifically teaching the concept of really diving into the data, sometimes its really nice to simply drag/drop a few attributes around to rows/columns and either create counts or summarize a single variable.  

The count/aggregation could be as simple as 1 per table, but ideal state, we could include row count and include another stat (row/column/total %, or an aggregation like average) per cell. 

Margin totals (row/column/total) would help reinforce the elements of "what are we looking at" and help students think about how data can take vary shapes, especially when thinking about aggregates and summaries.  

I am aware we can code something up using a variety of operators, but admittedly, it's nice to just drag/drop some columns  as you are exploring and hypothesizing (e.g. Tableau style "table" explorations) a dataset that you have never seen before.

Best Answer

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,529 RM Data Scientist
    Hi,
    did you see 'Extract Statistics'? Not the drag and drop but gives you an overview.
    Best,
    Martin

    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • btibertbtibert Member, University Professor Posts: 146 Guru
    Thanks, I hadn't used that extension/operatore yet.  That said, if I were using the built in Deals dataset, I  might want to put Gender in the column, and Future Customer in the row, and have the intersection (crosstab) be the average age and or count of rows.  The Extract Statistics is output by column, which is nice, but also something we can mostly get from typical results ouput > stats.  Thanks for the idea though!
  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager
    yes @mschmitz my thought exactly. Try Turbo Prep.

    Scott
  • btibertbtibert Member, University Professor Posts: 146 Guru
    Yep, that'll do it!  Thanks
  • IngoRMIngoRM Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
    Just if others want to check out Turbo Prep's Pivot quickly, here is a link to a video showing it: https://academy.rapidminer.com/courses/turbo-prep-data-pivoting

Sign In or Register to comment.