Cyclistic Bike Share Analysis - Using SQL and Tableau
Samarjeet Singh Arora
Microsoft Certified Data Analyst [SQL, Power BI, Excel, Tableau, Python] | Google Data Analytics | Turning Data into Actionable Insights
I am excited to present to you my first case study as a Data Analyst, where I worked with a realistic dataset and gained some valuable insights about the business and its customers.
Related Links
Table of Contents
Introduction
The Cyclistic Bike Share case study is part of a Google Data Analytics Certification Hosted on Coursera. I completed this Case study as a Capstone Project for the Certification.
The data used in this project is publicly available here under this license from Divvy Bikes. I used the latest available datasets of the past 12 months, which were from December 2022 to November 2023. The data consists of more than 5 Million rows and 13 Columns. I have the Ride ID, Type of bike, the start/end station names, their latitude, longitude and IDs and the Member Type (Casual or Annual)
Objective
Cyclistic is a Bike Share service in Chicago. As a Junior Data Analyst, working in a Bike Sharing Company, my objective was to find how the Casual members and the Annual members use the Bike Share service differently and provide 3 recommendations on how we can covert Casual members to Annual members as Annual members are more profitable for the company.
Tools used:
Preparation
The first thing I did was to download the datasets of the past 12 months, which were available in CSV format. I imported all the files into Dbeaver.
After analyzing all the columns and selecting appropriate data types for them, I created a table using PostgreSQL and merged all the downloaded datasets to the new table that I called "Cyclistic_2023_Bikeshare_Data".
Process
ALTER TABLE cyclistic_2023_bikeshare_data
DROP COLUMN start_station_id,
DROP COLUMN end_station_id;
delete from cyclistic_2023_bikeshare_data cbd where start_Station_name ='' or end_station_name =''
领英推荐
delete from cyclistic_2023_bikeshare_data cbd where started_at > ended_at
ALTER TABLE cyclistic_2023_bikeshare_data
ADD ride_length interval;
UPDATE cyclistic_2023_bikeshare_data
SET ride_length = ended_at-started_at
delete from cyclistic_2023_bikeshare_data WHERE EXTRACT(EPOCH FROM ride_length) < 60;
Analysis
Here are the findings I derived from the processed data:
Recommendations
Below are the 3 Recommendations that can help to convert Casual Riders into Annual members:
Conclusion
Cyclistic Bike Share was my first Data Analytics Case Study and I feel very confident after completing it as well as the Google Data Analytics Certification.
During the case study, I watched a lot of YouTube tutorials and did many Google searches to gather additional knowledge and to make sure I had a comprehensive understanding of the tools and techniques needed.
Starting from raw data of over 5 Million Rows and processing it to finally gain key insights about consumer behaviour was a valuable experience in solidifying my understanding of data analytics.