Suivez

la liste

Survival analysis is a statistical theory targeted at the study of human lifetime. In particular, relative survival analysis deals with the case of datasets that do not contain the reason of death of the patients, this information (supposed binary: studied cause or other causes) has to be estimated [1,2]. This arises quite frequently in large cancer studies due to, e.g., unreliable data. Most of the implementations of relative survival statistical routines target an interface in R, with some underlying C++, which is suboptimal and, most importantly, hard to maintain and expand in the long run. The main objective of the internship is to develop a new library in the -- rising -- programming language Julia to deal with relative survival analysis problem efficiently. This starts by the re-implementation of standard algorithms, while reviewing the literature, along which a real dataset will be used as a test case. Compatibility with the Julian ecosystem will be key (we will more specifically target the StatsModels.jl API), unlocking the application of bleeding-edge statistical tools (including, e.g., neural networks) to relative survival problems in the future. The communication and publication of our results will be an integral part of the work. This is also a very good opportunity to learn a new programming language, that is gaining traction is academic and in the industry.