Beyond Probabilities: How Survival Analysis Enhances Customer Insight and Credit Risk Management
Diego Vallarino, PhD (he/him)
Immigrant ????@???? | Global AI & Data Strategy Leader | Quantitative Finance Analyst | Risk & Fraud ML-AI Specialist | Ex-Executive at Coface, Scotiabank & Equifax | Board Member | PhD, MSc, MBA | EB1A Green Card Holder
Survival analysis, a statistical method that focuses on the time to an event, has proven to be an invaluable tool across various domains, including credit risk analysis, probability of default assessment, and propensity to purchase analysis. Its unique approach provides a deeper layer of insight, complementing traditional probability-based methodologies. Here’s a closer look at why survival analysis is an indispensable complement:
1. Precise Risk Modeling:
In the realm of credit risk analysis and default prediction, the element of time is paramount. Some borrowers may default immediately after acquiring a loan, while others maintain a good payment record for an extended period before encountering financial difficulties. Survival analysis allows for a more nuanced modeling of this temporal aspect, acknowledging that not all credit events unfold simultaneously.
2. Data Censoring:
Real-world data is often imperfect. In some cases, we lack complete information for all customers until a particular event transpires, such as a loan default. The occurrence of such right-censored data challenges traditional analysis methods. Survival analysis, however, excels at accommodating this form of data censoring, ensuring that even partial information can be leveraged effectively for modeling and prediction.
3. Identification of Risk Factors:
Survival analysis empowers risk analysts to identify factors that influence the timing of the event in question. By examining variables like income, credit history, age, and more, we can gain a deeper understanding of how they affect the likelihood of default or propensity to make a purchase, and how these relationships evolve over time.
4. Customer Portfolio Monitoring:
In the context of propensity to purchase analysis, survival analysis proves beneficial in monitoring customer behavior post-acquisition. It helps businesses assess how long customers maintain their relationship with the company before potentially switching to a competitor or canceling a service.
5. Modeling Recurrent Events:
Some events, such as loan defaults, can be recurrent. A customer who defaults once may be more likely to do so again in the future. Survival analysis is well-equipped to model these repeat occurrences, offering a dynamic approach to understanding and predicting these events over time.
So, survival analysis serves as an essential complement to traditional probability-based risk assessment and customer behavior analysis. By considering the temporal dimension, it provides a more comprehensive understanding of risks and customer behaviors, empowering businesses to make informed decisions and develop more effective strategies for risk management and customer retention.
An Example, for 1000 people, considering 10 behavioral variables.
领英推荐
The recommended model depends on the evaluation metric and the specific goals of your project. In the provided summary, various evaluation metrics (MAE, RMSE, and Rsquared) are presented for different models: glm (Generalized Linear Model), tree (decision tree), Random Forest, SVA, and XGBoost. The choice of the model can be based on the metric that is most relevant to your use case. Here are some general considerations:
Model selection also depends on other factors such as interpretability, model complexity, and available computational resources. Here are some general recommendations based on the provided metrics:
Ultimately, it’s important to consider the needs of your project and the metrics that are most relevant to your application before choosing the right model.
So far, this is an analysis model for statistical inference and traditional prediction modeling (I didn′t show it in the example). Whether it's for a credit scoring model, propensity model, churn model, purchase prediction model, recommendation model, breakage model, asset value prediction model, among others. However, it is not yet finished.
Now, let's complete the analysis for better decision making.
2. Calculate the Moment of Default or Propensity.
Based on the output of the survival regression analysis (MTLR in this case), we can draw some general conclusions:
Therefore, now you not only know the Probability of Default or Prepension (and the variables that have the most weight for that default), but we also have the time in days for the Default or Purchase to occur. In addition to knowing which variables affect positively and negatively.
This is explained in my new book.