How can i select ,use exploratory data analysis for maximum, minimum values, standard deviation ,

u1125362 · April 2020

I am confused about exploratory key characteristics of each variable in housing.csv set such as maximum, minimum values, average, standard deviation, most frequent values (mode), missing values and invalid values etc. ,Discuss key results of exploratory data analysis presented in Table and provide a rationale for selecting top 5 variables for predicting median house value (medv), in particular focusing on the relationships of independent variables with each other and with dependent variable median house value (medv) drawing on results of EDA analysis and relevant literature on determinates of house prices

hbajpai · April 2020

Hey @u1125362 ,

You can use RapidMiner Correlation Matrix operator to visualize the relationship of attributes and label. It look like below.

Image: https://us.v-cdn.net/6030995/uploads/editor/ww/p452tcvtcrqp.png

As far as selecting the top 5 variables is concerned you can use couple various models with explain predictions operator to see model specific dependencies on attributes. Another way would be to utilize weight by correlation operator which looks like below figure. There are other weight based operators you can experiment with in Studio.

Image: https://us.v-cdn.net/6030995/uploads/editor/2s/mdor27dl9zap.png

As far as summary statistics goes you can check them out with the Statistics tab on your raw data import.

Image: https://us.v-cdn.net/6030995/uploads/editor/n1/avn06a51rtaa.png

I hope this helps.

Howdy, Stranger!

Quick Links

Categories

Altair RapidMiner Community

GET HELP. LEARN BEST PRACTICES. NETWORK WITH YOUR PEERS.

How can i select ,use exploratory data analysis for maximum, minimum values, standard deviation ,

Best Answer