Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
"Kalman Filtering in RapidMiner versus the Alternatives"
Hi
I'm relatively new to Rapidminer and am trying to do the following, which I think needs some sort of Kalman filtering, which I believe is not available directly in Rapidminer, but may be through an R addin.
THE PROBLEM: I'm looking at AIS ship data and trying to identify what cargo is in any oil tanker. I can use the draft of the ship and calculate directly the density of what's in it as I know the volume (Archimedes principle ). However there is noise in this draft observation (hopefully random noise). I can compare observed (noisy) density with a list of say 5 possible cargo densities and calculate probability of each cargo (assuming I know the variance of the error). I then calculate these probabilities historically for lots of ships or different sizes and other parameters making particular A-B journeys. I can then construct a covariance matrix of how these other parameters affect the likelihood of a ship carrying a particular cargo, which perhaps say depends on ship size, journey beginning and endpoints, etc. I probably do some sort of k-cluster classification here also.
I then construct a model based on this covariance matrix , and the observed clustering, of what things, excluding draft, affect the chance of a ship carrying a cargo.
Now I would like to "adjust" the original draft data , assuming my model is correct. i.e. I see a draft of 13.4 metres which means most likely cargo is gasoline, but there is still a reasonable change that cargo is not gasoline but diesel or jet. I then adjust draft observation to 13.6 metres to make Diesel most likely , based on my model which says more ships of this size/journey details, etc carry diesel than Gasoline. Obviously I'm creating a feedback loop here, and I think Kalman filtering is the way to go; however there is no Kalman filtering in Rapidminer.
The question is whether I can adapt any of the existing machine learning algorithms in Rapidminer to achieve the same feedback effect. I somehow doubt it, but wondered if anyone has done something similar.
Thanks in Advance,
Simon
I'm relatively new to Rapidminer and am trying to do the following, which I think needs some sort of Kalman filtering, which I believe is not available directly in Rapidminer, but may be through an R addin.
THE PROBLEM: I'm looking at AIS ship data and trying to identify what cargo is in any oil tanker. I can use the draft of the ship and calculate directly the density of what's in it as I know the volume (Archimedes principle ). However there is noise in this draft observation (hopefully random noise). I can compare observed (noisy) density with a list of say 5 possible cargo densities and calculate probability of each cargo (assuming I know the variance of the error). I then calculate these probabilities historically for lots of ships or different sizes and other parameters making particular A-B journeys. I can then construct a covariance matrix of how these other parameters affect the likelihood of a ship carrying a particular cargo, which perhaps say depends on ship size, journey beginning and endpoints, etc. I probably do some sort of k-cluster classification here also.
I then construct a model based on this covariance matrix , and the observed clustering, of what things, excluding draft, affect the chance of a ship carrying a cargo.
Now I would like to "adjust" the original draft data , assuming my model is correct. i.e. I see a draft of 13.4 metres which means most likely cargo is gasoline, but there is still a reasonable change that cargo is not gasoline but diesel or jet. I then adjust draft observation to 13.6 metres to make Diesel most likely , based on my model which says more ships of this size/journey details, etc carry diesel than Gasoline. Obviously I'm creating a feedback loop here, and I think Kalman filtering is the way to go; however there is no Kalman filtering in Rapidminer.
The question is whether I can adapt any of the existing machine learning algorithms in Rapidminer to achieve the same feedback effect. I somehow doubt it, but wondered if anyone has done something similar.
Thanks in Advance,
Simon
Tagged:
0