Understanding the Factors Influencing the Timing of Product Purchases: Analysis of Key Variables
Diego Vallarino, PhD (he/him)
Immigrant ????@???? | Global AI & Data Strategy Leader | Quantitative Finance Analyst | Risk & Fraud ML-AI Specialist | Ex-Executive at Coface, Scotiabank & Equifax | Board Member | PhD, MSc, MBA | EB1A Green Card Holder
1. Introduction
One of the most important questions that retailers like 家乐福 沃尔玛 but also Scotiabank Banco Sabadell Banco Bci Santander Banco Ripley Chile among many others, ask themselves is not only whether or not a person will buy a certain basket of products (yes/no), or what will be the product they will buy after buying certain products (distance), but to know WHEN you will buy the product. This will mean that the definition of supply, positioning, stock, price, and promotion strategies, among other implications, can be defined with more information.
In this case we have worked with a particular dataset. This dataset has been provided by a real company and contains information necessary for conducting an analysis on the time until a purchase of a certain basket of products.
The dataset consists of 10 variables, each of which contributes to understanding the factors influencing the timing of a purchase. In this analysis, we will explore the components of each variable and examine their potential impact on the time to purchase.
For that, we will use advanced machine learning models and microeconomics principles to understand when a certain basket of products is purchased.
2. Variable Components?:
By analyzing these variables and their components, we aim to gain insights into the factors that contribute to the timing of a purchase. Understanding these factors can assist businesses in developing targeted marketing strategies, optimizing promotional activities, and improving customer satisfaction to enhance the overall sales performance and profitability.
ggsurvplot(survfit(Surv(time,status) ~ OnlineBehavior + Gender, data = df)
ggsurvplot(survfit(Surv(time,status) ~ OnlineBehavior + Location, data = df)))
领英推荐
bst <- xgboost(data = sparse_matrix, label = output_vector, max.depth = 4, eta = 1, nthread = 2, nrounds = 10
importanceRaw <- xgb.importance(feature_names = sparse_matrix@Dimnames[[2]], model = bst, data = sparse_matrix, label = output_vector)
xgb.plot.importance(importance_matrix = importanceRaw, main = "Variables Importance - SHAP", xlab = "Importance"))
3. Economic and Statistical Analysis
In the figure above, the variables that have the most weight to define the moment of a purchase are:
These three variables present more significant weight coefficients in the model, which implies that they are important factors in determining the moment of a purchase. However, it is important to note that the relative importance of these variables may vary in different contexts and for different data sets.
4. What about PromotionDiscount??
The variable “ PromotionsDiscounts “ has a negative coefficient of -0.02363. This indicates that promotions and discounts have some influence on the time of purchase, but in this case, the negative coefficient suggests that the presence of promotions and discounts is associated with a later purchase time.
This could be interpreted in different ways. For example, promotions and discounts could be strategically designed to encourage purchases at specific times, such as periods of low demand, or to drive sales of slower-moving products. Therefore, customers may wait for these offers to appear before making their purchases.
It is important to note that the relative weight of the “ PromotionsDiscounts “ variable in the model may depend on other factors and the specific retail context?. In some cases, promotions and discounts may have a more significant impact at the time of purchase, while in other cases, other factors may have a stronger influence. It is necessary to evaluate the model in conjunction with other analyzes to obtain a more complete and accurate picture of the variables that affect the moment of purchase.
About these models and how to apply them to the design of data-based products, will be my participation next September 11 in NYC at the "Customer Analytics Summit". I leave the link here