RapidMiner 9.8 Beta is now available

Be one of the first to get your hands on the new features. More details and downloads here:

GET RAPIDMINER 9.8 BETA

Survival analysis

arnmayerarnmayer Member Posts: 3 Contributor I

I started using RapidMiner today and I think it's great!

What I’m looking for specifically is a method for the prediction of cancer patient survival based on multiple measurements from histological specimens. This can be done in Python’s "DeepSurv" and R’s "randomForestSRC" packages. I know a bit of R, so I got the latter to work but I struggle with Python and DeepSurv. DeepSurv may be more accurate. It would be interesting to compare the results obtained with these (and possibly other) packages. 

So, my question is: Has anyone ever implemented a (patient) survival prediction model in RapidMiner?

The difference to the "normal" process is that one uses two variables to train the model on. One dichotomous variable, like “churn” in the example database, and a time variable (i.e., the survival time). One does not merely want to know IF someone died but also how long that took because depending on the time, the “IF-variable” can mean totally different things. E.g., someone died, but only after a very long time. That would obviously correspond to a good prognosis.

Any ideas would be very welcome.

Thank you

Arnulf

Best Answer

Answers

  • sara20sara20 Member Posts: 110   Unicorn
    @arnmayer

    Hello

    I find a paper for your question. :)
    https://ieeexplore.ieee.org/abstract/document/8080031

    I hope this helps
    Sara
    arnmayer
  • arnmayerarnmayer Member Posts: 3 Contributor I
    Sara, thank you very much. I'll have a look.

    Kind regards,

    Arnulf
    sara20
  • arnmayerarnmayer Member Posts: 3 Contributor I
    Brian, thank you very much for this very helpful comment. I will try to make the connection between RapidMiner and the DeepSurv package. Will give an update.

    Kind regards,

    Arnulf
Sign In or Register to comment.