This project analyzes subscription-based business performance using SQL and Python. It reconstructs customer lifecycle from snapshot data and derives key SaaS metrics.
Telco customer dataset (~7K customers)
- SQL (PostgreSQL)
- Python (Pandas, Matplotlib)
- Monthly Recurring Revenue (MRR)
- Customer Churn Rate
- Cohort Retention Analysis
- Average Revenue Per User (ARPU)
- Customer Lifetime Value (LTV)
- Revenue Concentration (Pareto Analysis)
- MRR shows consistent growth driven by customer acquisition
- ARPU declines over time, indicating revenue growth is volume-driven
- Cohort retention appears flat due to snapshot-based reconstruction using tenure
- LTV distribution is right-skewed, with a small group of high-value customers
- Top 20% of customers contribute ~54% of total revenue
/sql→ all SQL transformations/data→ processed datasets/notebooks→ analysis and visualizations/images→ charts for quick preview
- Execute SQL scripts in order (01 → 08)
- Export results as CSV into
/data - Run notebook:
notebooks/analysis.ipynb
This dataset is a snapshot. Customer lifecycle was reconstructed using tenure, which results in flat retention curves until churn events.





