The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options

# How to calculate predictive weights?

Member Posts: 4 Newbie
in Help
Hello!
I am using multiple variables (rainfall, temperature, wind speed, slope, distance to settlements, etc) for predicting fire events using different machine-learning algorithms. In the below-given example, I have used Random Forest to do so. Using 'Explain Predictions" this result was generated. In Support Prediction, we have Slope = 6.054 (0.378); Agriculture = 0 (0.074) where 6.054 and 0 are respective values and 0.378 & 0.074 are predictive weights.
Is there any method by which we can calculate the cumulative values of these predictive weights that can be converted to an equation?
I have to input these weights into another software (ArcGIS) to analyze raster data. Is there any other way to use this model on raster images?

• Options
Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,525 RM Data Scientist
Solution Accepted
Hi,

Longer Answer: If we talk about explainable ai we often talk about local influence factors. They basically answer the question "if this value would not be what it is, how big would be the impact on the prediction?" and then the higher the value, the more impactful the column is.

It is important to know, that these are _local_ ones. Those influence factors differ, as you see, for one example compared to the other. This is a feature of the method. You can check the wei port of Random Forest to get global influence factors. But then again only because you have a general pattern it does not mean that this is also present locally.

If you want to learn more about these methods i can recommend this ebook: https://christophm.github.io/interpretable-ml-book/ . It's the source for what I wrote in the Interpretations extension.

Let me turn the question around a bit: Why do you need an equation? You have the random forest model, which is a function.

Best,
Martin

- Sr. Director Data Solutions, Altair RapidMiner -
Dortmund, Germany

• Options
Member Posts: 4 Newbie

Thank you, Martin.

I am currently working in the field of Geospatial Technology, focusing on satellite imagery analysis. For GIS-based analysis and prediction, we utilize tools such as ArcGIS and R. While R allows us to import raster data and generate raster outputs, it struggles with larger geographical areas. On the other hand, ArcGIS provides similar functionalities but is less flexible when it comes to optimization.

Therefore, it's more convenient to work with vector data in the form of points. However, we still require outputs in the raster format. To apply algorithms across all pixels of a satellite image or raster data we need to convert it into some form.

For example, in the case of logistic regression, where coefficient values are calculated for different variables to form an equation.

As you mentioned about functions, is there any way to determine the function for this scenario?

• Options
Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,525 RM Data Scientist
Hey,
that won't work with Random Forest. While you can export them (as json), you will not be able to execute them natively in R. The usual way to do it would be to use a Real Time Scoring agent and put the scoring process into this. R can then query the Random Forest super quickly using a Werbservice call.

Is this a commercial request or academia?

Best,
Martin
- Sr. Director Data Solutions, Altair RapidMiner -
Dortmund, Germany
• Options
Member Posts: 4 Newbie