The RapidMiner community is on readonly mode until further notice. Technical support via cases will continue to work as is. For any urgent licensing related requests from Students/Faculty members, please use the Altair academic forum here.
How to identify critical parameters in a dataset?
Hi,
Say that you have a dataset of 10 different parameters (listed in 10 columns) to describe a phenomenon (listed in the 11 column). What would be a procedure to identify which of the 10 parameters are critical to predicting the phenomenon on hand (i.e. maybe 45 parameters are need to properly predict the phenomenon, how to go around figuring out what are these parameters?).
Thanks,
Say that you have a dataset of 10 different parameters (listed in 10 columns) to describe a phenomenon (listed in the 11 column). What would be a procedure to identify which of the 10 parameters are critical to predicting the phenomenon on hand (i.e. maybe 45 parameters are need to properly predict the phenomenon, how to go around figuring out what are these parameters?).
Thanks,
Tagged:
0
Best Answers

varunm1 Member Posts: 1,207 UnicornHi @mzn
Auto model will provide you with attributes (Parameters) that support the prediction. If you don't have access to automodel, you can also see the correlation of these attributes to output using the correlation matrix operator. You can look at some feature selection technique operators like optimize selection, forward selection
If you are trying to predict using different algorithms like a decision tree, SVM, etc., you can use explain predictions operator to see which attributes are helpful in predictions for a particular algorithm
ThanksRegards,
Varun
https://www.varunmandalapu.com/Be Safe. Follow precautions and Maintain Social Distancing
8 
IngoRM Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM FounderHi,can I have access to/figure out the actual relationship/mathematical model used to predict future values?
The respective models are used for this. While for some of those models like linear regression (a formula) or a decision tree (nested ifthen statements) a mathematical representations can be derived, this is not general the case. Especially for the more complex models (and therefore often also more accurate models) like GBT or Neural Nets there is a close to zero chance to transform them into a humanreadable format. That is the reason why we have tools like the Simulator or Explain Predictions in the first place  so that you can build trust in what the models are doing even without such an explicit formula.
Best,
Ingo1
Answers
@IngoRM can you provide any suggestions on this.
Thanks
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing