RapidMiner 9.8 Beta is now available

Be one of the first to get your hands on the new features. More details and downloads here:


Survival analysis

arnmayerarnmayer Member Posts: 3 Contributor I

I started using RapidMiner today and I think it's great!

What I’m looking for specifically is a method for the prediction of cancer patient survival based on multiple measurements from histological specimens. This can be done in Python’s "DeepSurv" and R’s "randomForestSRC" packages. I know a bit of R, so I got the latter to work but I struggle with Python and DeepSurv. DeepSurv may be more accurate. It would be interesting to compare the results obtained with these (and possibly other) packages. 

So, my question is: Has anyone ever implemented a (patient) survival prediction model in RapidMiner?

The difference to the "normal" process is that one uses two variables to train the model on. One dichotomous variable, like “churn” in the example database, and a time variable (i.e., the survival time). One does not merely want to know IF someone died but also how long that took because depending on the time, the “IF-variable” can mean totally different things. E.g., someone died, but only after a very long time. That would obviously correspond to a good prognosis.

Any ideas would be very welcome.

Thank you


Best Answer


  • sara20sara20 Member Posts: 110   Unicorn


    I find a paper for your question. :)

    I hope this helps
  • arnmayerarnmayer Member Posts: 3 Contributor I
    Sara, thank you very much. I'll have a look.

    Kind regards,

  • arnmayerarnmayer Member Posts: 3 Contributor I
    Brian, thank you very much for this very helpful comment. I will try to make the connection between RapidMiner and the DeepSurv package. Will give an update.

    Kind regards,

Sign In or Register to comment.