Modelling: Ensemble


Modelling: Ensemble

How  do you merge  three models (SVM, Decision tree, ANN)   from  muliple experiments that used  different dataset/database  for each. 


 Dataset A                            DatasetB                     Dataset C

    a                                           x                                    kl

    b                                           y                                    po

    c                                           z                                     t

    d                                          m                                    u

    Decision tree                       SVM                                  ANN

Elite II

Re: Modelling: Ensemble

As long as the models are built on the same underlying superset of data attributes, and they are all trying to predict the same label, the fact that they were built on different datasets doesn't matter. You can simply store the models in the repository (using the "Store" operator) and then apply them all to a new dataset (using "Apply Model") as long as it contains all the individual attributes required.  Note that you'll need to make sure any data preprocessing that was done for the original modeling also is replicated (e.g., any feature engineering or data transformations).  You could then use them as an ensemble model (e.g., using the "Vote" method) for future predictions.


If the models are not built on some unified underlying superset of attributes to predict the same label, then I am not sure exactly what you would mean by "merging" the models.  They would at that point be totally separate models and there really wouldn't be anything to merge.




Brian T., Lindon Ventures - www.lindonventures.com
Analytics Consulting by Certified RapidMiner Analysts