Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Measuring Citizen's Engagement in community clubs

jmoses14jmoses14 Member Posts: 6 Learner III
edited December 2018 in Help

I'm new to RapidMiner but willing to learn. I need help with step by step analysis of how to measure citizens engagement in various community clubs in rapidminer. I have the data but don't know how to analyze it. Data attached  for simple step by step. Thanks

Answers

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,761 Unicorn

    @jmoses14 based on what I see from this data, you might want to use a Linear Regression model. Have you tried creating a process using the Read CSV operator to load your data in? That would be helpful to share.

  • jmoses14jmoses14 Member Posts: 6 Learner III

    Thanks Thomas. I've loaded the data but I try using corelation matrix. So I should change it to Linear Regression. Once I selected LR, what are my first 3 to 4 steps towards analyzing the data. I'm totally novice please. Thanks for your help

    Jon

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,761 Unicorn

    @jmoses14 as a rule, I don't open DOCX files. Can you post the XML of your process using the </> button AND following this KB article? https://community.rapidminer.com/t5/RapidMiner-Studio-Knowledge-Base/How-can-I-share-processes-without-RapidMiner-Server/ta-p/37047


    Thanks,

  • jmoses14jmoses14 Member Posts: 6 Learner III
  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager

    hello @jmoses14 welcome to the community! Some quick recommendations for you:
    • Post your XML process here in this thread (see https://youtu.be/KkgB5QXWXJ8 and "Read Before Posting" on right when you reply)
    • Attach your dataset if possible (use a fictionalized version if there are privacy concerns)
    • Make sure you have all necessary extensions installed (see https://youtu.be/pjBqG3xtXx4)

    Scott

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,761 Unicorn

    @jmoses14 You could do something like what's attached. To fully use the Correlation Matrix you'd have to convert the nominal values to numericals. 

     

    Just for fun, I chose the elapsed time as your label and used a GLM model.

     

    <?xml version="1.0" encoding="UTF-8"?><process version="8.1.001">
    <context>
    <input/>
    <output/>
    <macros/>
    </context>
    <operator activated="true" class="process" compatibility="8.1.001" expanded="true" name="Process">
    <process expanded="true">
    <operator activated="true" class="read_csv" compatibility="8.1.001" expanded="true" height="68" name="Read CSV" width="90" x="112" y="34">
    <parameter key="csv_file" value="C:\Users\TomOtt\Downloads\citizensengagement.csv"/>
    <parameter key="column_separators" value=","/>
    <parameter key="first_row_as_names" value="false"/>
    <list key="annotations">
    <parameter key="0" value="Name"/>
    </list>
    <parameter key="encoding" value="windows-1252"/>
    <list key="data_set_meta_data_information"/>
    </operator>
    <operator activated="true" class="nominal_to_numerical" compatibility="8.1.001" expanded="true" height="103" name="Nominal to Numerical" width="90" x="246" y="34">
    <parameter key="attribute_filter_type" value="value_type"/>
    <list key="comparison_groups"/>
    </operator>
    <operator activated="true" class="multiply" compatibility="8.1.001" expanded="true" height="103" name="Multiply" width="90" x="380" y="34"/>
    <operator activated="true" class="set_role" compatibility="8.1.001" expanded="true" height="82" name="Set Role" width="90" x="447" y="340">
    <parameter key="attribute_name" value="Elapsed_Time"/>
    <parameter key="target_role" value="label"/>
    <list key="set_additional_roles"/>
    </operator>
    <operator activated="true" class="concurrency:cross_validation" compatibility="8.1.001" expanded="true" height="145" name="Validation" width="90" x="648" y="289">
    <process expanded="true">
    <operator activated="true" class="h2o:generalized_linear_model" compatibility="7.6.001" expanded="true" height="124" name="Generalized Linear Model" width="90" x="179" y="34">
    <parameter key="remove_collinear_columns" value="true"/>
    <list key="beta_constraints"/>
    <list key="expert_parameters"/>
    </operator>
    <connect from_port="training set" to_op="Generalized Linear Model" to_port="training set"/>
    <connect from_op="Generalized Linear Model" from_port="model" to_port="model"/>
    <portSpacing port="source_training set" spacing="0"/>
    <portSpacing port="sink_model" spacing="0"/>
    <portSpacing port="sink_through 1" spacing="0"/>
    </process>
    <process expanded="true">
    <operator activated="true" class="apply_model" compatibility="8.1.001" expanded="true" height="82" name="Apply Model" width="90" x="45" y="34">
    <list key="application_parameters"/>
    </operator>
    <operator activated="true" class="performance" compatibility="8.1.001" expanded="true" height="82" name="Performance" width="90" x="179" y="34"/>
    <connect from_port="model" to_op="Apply Model" to_port="model"/>
    <connect from_port="test set" to_op="Apply Model" to_port="unlabelled data"/>
    <connect from_op="Apply Model" from_port="labelled data" to_op="Performance" to_port="labelled data"/>
    <connect from_op="Performance" from_port="performance" to_port="performance 1"/>
    <connect from_op="Performance" from_port="example set" to_port="test set results"/>
    <portSpacing port="source_model" spacing="0"/>
    <portSpacing port="source_test set" spacing="0"/>
    <portSpacing port="source_through 1" spacing="0"/>
    <portSpacing port="sink_test set results" spacing="0"/>
    <portSpacing port="sink_performance 1" spacing="0"/>
    <portSpacing port="sink_performance 2" spacing="0"/>
    <description align="left" color="blue" colored="true" height="103" resized="true" width="315" x="38" y="137">The model created in the Training step is applied to the current test set (10 %).&lt;br/&gt;The performance is evaluated and sent to the operator results.</description>
    </process>
    </operator>
    <operator activated="true" class="concurrency:correlation_matrix" compatibility="8.1.001" expanded="true" height="103" name="Correlation Matrix" width="90" x="648" y="34"/>
    <connect from_op="Read CSV" from_port="output" to_op="Nominal to Numerical" to_port="example set input"/>
    <connect from_op="Nominal to Numerical" from_port="example set output" to_op="Multiply" to_port="input"/>
    <connect from_op="Multiply" from_port="output 1" to_op="Correlation Matrix" to_port="example set"/>
    <connect from_op="Multiply" from_port="output 2" to_op="Set Role" to_port="example set input"/>
    <connect from_op="Set Role" from_port="example set output" to_op="Validation" to_port="example set"/>
    <connect from_op="Validation" from_port="model" to_port="result 4"/>
    <connect from_op="Validation" from_port="performance 1" to_port="result 5"/>
    <connect from_op="Correlation Matrix" from_port="example set" to_port="result 1"/>
    <connect from_op="Correlation Matrix" from_port="matrix" to_port="result 2"/>
    <connect from_op="Correlation Matrix" from_port="weights" to_port="result 3"/>
    <portSpacing port="source_input 1" spacing="0"/>
    <portSpacing port="sink_result 1" spacing="0"/>
    <portSpacing port="sink_result 2" spacing="0"/>
    <portSpacing port="sink_result 3" spacing="0"/>
    <portSpacing port="sink_result 4" spacing="0"/>
    <portSpacing port="sink_result 5" spacing="0"/>
    <portSpacing port="sink_result 6" spacing="0"/>
    </process>
    </operator>
    </process>
  • jmoses14jmoses14 Member Posts: 6 Learner III

    Hello Thomas. It felt as if I was looking at magic. How I wish I could just transformed what you did to arrived at my desired results. What can I do to take advantage of what you sent me and how do I begin? Are you availab;le or know someone as a tutor locally in Atlanta? Thanks

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,761 Unicorn

    @jmoses14 I do provide consulting services, but I'm in NJ. You can PM directly if you like. 

     

    With respect to using the XML process, you have copy and paste the XML code into the XML panel in Studio. Just go to View > Show Panel > XML, the paste it in there and hit the green check mark.

  • jmoses14jmoses14 Member Posts: 6 Learner III

    Thanks Thomas. Good news. Yes, I'll like to follow up with you. Please message me on how to contact you


    @Thomas_Ott wrote:

    @jmoses14 I do provide consulting services, but I'm in NJ. You can PM directly if you like. 

     

    With respect to using the XML process, you have copy and paste the XML code into the XML panel in Studio. Just go to View > Show Panel > XML, the paste it in there and hit the green check mark.



    . Thanks

    Jon

     

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,761 Unicorn

    @jmoses14 just shot you a private message via the Community. Hope to hear from you soon!

Sign In or Register to comment.