Y. Afisha Project: Revenue and Cohort Analysis Insights
The Y. Afisha Project delves into understanding revenue trends and customer behavior through comprehensive cohort analysis. This project aims to uncover key insights that can guide business strategy, particularly in enhancing customer retention and boosting revenue growth. By analyzing patterns over time and segmenting customers into cohorts based on their purchase behavior, the project sheds light on which strategies have the most significant impact on sustaining long-term business performance.
With a data-driven approach, the Y. Afisha Project helps stakeholders identify strengths and areas for improvement in customer engagement and revenue management. This analysis not only reveals how revenue evolves but also highlights the importance of retention strategies for maintaining a stable customer base.
Data Preparation
A solid data preparation phase was essential for the success of the Y. Afisha Project. This process involved meticulous cleaning and transformation to ensure the accuracy and reliability of the analysis. Key steps taken included:
Data Preparation with Three Main DataFrames
A.- Visits DataFrame (`visitsdf`):
Contains server log data detailing user visits to Y. Afisha's platform from January 2017 to December 2018.
Key columns include user ID (`uid`), device type, start and end timestamps of the visit, and source IDs.
Actions taken:
B.- Orders DataFrame (`ordersdf`):
Captures user orders, with columns such as user ID (`uid`), purchase timestamp (`buy_ts`), and revenue.
Actions taken:
C.- Costs DataFrame (`costsdf`):
Contains marketing costs associated with different dates, representing the expenses incurred by the platform for promotional activities.
Key steps:
Cohort Assignment and Merging
These dataframes were merged and aligned to enable a cohesive analysis:
These steps in data preparation set a strong foundation for analyzing revenue, user behavior, and cohort retention.
Key Findings and Analysis
After preparing and merging the three main dataframes—`visitsdf`, ordersdf, and costsdf—we uncovered significant insights into revenue trends, cohort behavior, and the impact of marketing expenses. Below are the main findings:
Revenue Trends Over Time
The analysis of revenue from January 2017 to December 2018 provided the following insights:
Revenue Distribution by Source and Device
The analysis of revenue by source and device provided valuable insights into user behavior and the contribution of different traffic sources. The visualization highlights the following:
Context and Data Interpretation:
Strategic Insights:
This analysis demonstrates how understanding revenue distribution across different sources and devices can inform data-driven marketing and platform strategies.
Cost Distribution by Source ID
The Cost Distribution by Source ID visualization reveals how marketing resources are allocated:
This data highlights the need to assess the ROI for high-cost sources and explore optimization strategies for lower-cost channels.
Revenue Distribution by Source ID
The Revenue Distribution by Source ID visualization provides insights into how different sources contribute to overall revenue. Here’s a concise analysis:
This distribution showcases the reliance on specific sources for revenue generation and highlights areas where efforts could be concentrated for maximizing returns.
领英推荐
Cumulative Revenue by Source ID
The Cumulative Revenue by Source ID visualization illustrates how revenue accumulates over time from different sources. Here’s a concise analysis:
This data indicates that focusing on the top-performing sources, particularly Source IDs 1 and 2, can optimize revenue strategies, while further investigation into the lower-performing sources could reveal opportunities for improvement.
Number of Users by Source ID
The Number of Users by Source ID visualization provides valuable insights into user acquisition across different marketing channels. Here’s the analysis:
The data highlights the importance of focusing on high-performing sources for user acquisition, which can lead to increased revenue potential.
Customer Retention and Churn Analysis
In this section, we focus on understanding customer retention rates and identifying factors contributing to churn across different cohorts. By analyzing how users interact with the platform over time, we can derive actionable insights to enhance customer engagement and loyalty.
Retention Rates by Cohort
The cohort analysis provided key insights into retention trends:
The Customer Retention Matrix reveals that October 2017 had the highest retention with 4,335 users, indicating strong engagement from that cohort. However, retention declines for newer cohorts, suggesting decreased engagement. The darker shades in the matrix represent higher user counts, quickly highlighting periods of strong retention. This analysis underscores the need for targeted strategies to maintain engagement with newer customer cohorts.
The Monthly Average Revenue by First Order Month matrix provides insights into average revenue generated from customers based on their first order month. Here are the key insights:
The highest average revenue appears in October 2017, at 26.8, indicating that customers who made their first purchase in that month tended to spend significantly more compared to other months.
In contrast, Source ID 10 shows the lowest average revenue, reflecting limited engagement or spending by users from that cohort.
The matrix also indicates a general trend where customers from earlier months tend to have higher average revenue, suggesting that initial engagement strategies were more effective during that period.
The Average Customer Purchase Size matrix reveals insights into the average spending behavior of customers based on their first order month and their cohort lifetime. Here are the key insights:
The highest average purchase size occurs for customers who made their first purchase in October 2017, reaching 26.8. This suggests that users acquired during this month tend to spend more than other cohorts, possibly due to effective marketing strategies or promotions.
As we move down the matrix, there is a general decline in average purchase sizes for newer cohorts, indicating that retention and purchase behavior may not be as strong for customers acquired later.
In particular, customers with a cohort lifetime of 0 months (i.e., their first purchase month) show varying average sizes but tend to be higher in the early months, suggesting initial engagement is crucial for spending behavior.
Churn Analysis
Understanding the factors that contribute to customer churn is critical for formulating retention strategies:
The Cohorts: User Retention matrix provides crucial insights into how user retention varies across different cohorts based on their first activity month. Here are the key insights:
This matrix shows retention rates for various cohorts over their lifetime. For instance, the cohort that first engaged in June 2017 retains 7.9% of users by the end of the first month, dropping to 4.5% by the end of the 11th month.
The October 2017 cohort displays strong retention, maintaining 7.9% after one month but shows a gradual decline, reflecting the need for consistent engagement strategies.
The overall trend indicates that retention rates decline significantly as the months progress for all cohorts, highlighting a common challenge in maintaining user engagement over time.
The Cohorts: Cancel Rates matrix provides insights into the cancellation behavior of customers based on their first activity month. Here are the key insights:
The highest cancellation rate appears in January 2018, reaching 55.4%. This indicates that customers who started engaging with the platform during this period were more likely to cancel their accounts.
As we examine earlier cohorts, we see a gradual decline in cancellation rates, particularly for those who first engaged in June 2017, which has a low cancellation rate of 0.0%.
The matrix reveals a pattern where newer cohorts tend to experience higher cancellation rates over time, suggesting a challenge in retaining users acquired during those months.
Strategic Recommendations
To address the insights gained from the retention and churn analysis, the following strategies are recommended:
These recommendations underscore the necessity of proactive strategies to enhance user loyalty and support sustained revenue growth.
Conclusion
The analysis of user behavior, retention, churn, and revenue trends for Y. Afisha has provided valuable insights into the dynamics of customer engagement and profitability. Key findings highlighted the importance of understanding cohort performance, the relationship between marketing spend and revenue, and the varying behaviors of users across different sources and devices.
The recommendations outlined—such as enhancing user engagement through personalized marketing, establishing robust feedback mechanisms, and targeting retention campaigns—are crucial for addressing the challenges of declining engagement and churn rates. By refining marketing strategies and focusing on high-performing sources, Y. Afisha can leverage its data to foster user loyalty and drive sustained revenue growth.
Ultimately, a proactive and data-driven approach will empower Y. Afisha to adapt to changing user behaviors and market conditions, ensuring long-term success and profitability.
Click here to explore the complete analysis and technical details of the Y. Afisha Project by visiting the dedicated GitHub repository. Here, you’ll find the full Jupyter Notebook with code, visualizations, and comprehensive explanations of the methodologies and insights derived throughout the project.
https://github.com/ricardosillercardenas/ricardo_siller_da_projects/blob/main/Y.Afisha_Project.ipynb