Case Study: How Does a Bike-Share Navigate Speedy Success?
Jazzrielle Swann
Certified Data Analyst| QBO Certified ProAdvisor| Studio Recording Artist
Quick Links
Scenario
I am a junior data analyst working on the marketing analyst team at Cyclistic, a bike-share company in Chicago. The director of marketing believes the company’s future success depends on maximizing the number of annual memberships. Therefore, my job is to understand how casual riders and annual members use Cyclistic bikes differently. I will present my findings to stakeholders to determine effective marketing strategies to convert casual riders into paid members.
About Cyclistic
In 2016, Cyclistic launched a successful bike-share offering. Since then, the program has grown to a fleet of 5,824 bicycles that are geotracked and locked into a network of 692 stations across Chicago. The bikes can be unlocked from one station and returned to any other station in the system at any time.
Until now, Cyclistic’s marketing strategy relied on building general awareness and appealing to broad consumer segments. One approach that helped make these things possible was the flexibility of its pricing plans: single-ride passes, full-day passes, and annual memberships. Customers who purchase single-ride or full-day passes are referred to as casual riders. Customers who purchase annual memberships are Cyclistic members.
Cyclistic’s finance analysts have concluded that annual members are much more profitable than casual riders. Although the pricing flexibility helps Cyclistic attract more customers, Moreno believes that maximizing the number of annual members will be key to future growth. Rather than creating a marketing campaign that targets all new customers, Moreno believes there is a solid opportunity to convert casual riders into members. She notes that casual riders are already aware of the Cyclistic program and have chosen Cyclistic for their mobility needs.
Moreno has set a clear goal: Design marketing strategies aimed at converting casual riders into annual members. In order to do that, however, the team needs to better understand how annual members and casual riders differ, why casual riders would buy a membership, and how digital media could affect their marketing tactics. Moreno and her team are interested in analyzing the Cyclistic historical bike trip data to identify trends.
Key Stakeholders
Ask
There are three questions we need to answer for this analysis:
For now, I will answer the first question of how annual members and casual riders use Cyclistic bikes differently.
Prepare
The first thing I did was download 12 monthly files from January 2023-December 2023 from the divvy-tripdata index to analyze historical data and identify trends. The data is available due to this license. There is a limitation due to data privacy issues that prohibits certain information, such as personal identification information (PII) of riders or members, such as credit card numbers. The files were downloaded to a special folder for the raw data and eventual clean data.
Dataset observations:
The datasets are stored as downloadable .csv files. Opening the dataset in Excel, I observed that the dataset contains 13 columns, which include the ride ID, the type of bike used, start and end dates and times, station names and IDs, as well as the member type.
The files had missing blank cells in the station_name and station_id columns, as well as sometimes in the longitude and latitude columns.
Does the data follow ROCCC?
The data does follow ROCCC:
Reliable: The data is reliable because it was obtained firsthand.
Original: The data is original because it is provided by the company as downloadable files.
Comprehensive: The data is comprehensive because of the large amount of data they contain.
Current: The data is current as it is from the most current year.
Cited: The data is cited on the download page.
领英推荐
The data is comprehensive in that the datasets have a lot of information to pull from and analyze. The data is also current, being from 2023.
Process
Part 1 (Microsoft Excel)
I opened each file month by month in Microsoft Excel before starting my data cleaning process, which is as follows:
Part 2 (Google BigQuery)
Once I had finished my initial cleaning in Microsoft Excel, I exported all the cleaned data as- .csv files to a folder I created ahead of time in my capstone project folder. I then uploaded the folder with the cleaned data to a Google Storage Bucket before uploading each table into Google BigQuery.
Once I combined the files into one table, I further cleaned the data by checking and deleting any null values I missed in my original sweep through in Excel.
Analyze
Once I had cleaned the data to a point where I was satisfied in Google Big Query, I then analyzed parts of the data. To start my analysis, I totaled the number of members and casual riders.
Then, I analyzed how casual riders and members use the bikes by days of the week. I analyzed which day of the week was the busiest for casual riders vs. which days were busiest for members.
Next, I analyzed which months casual riders and members used the most bikes.
After that, I analyzed the average ride length for casual riders vs. annual members.
Share (Tableau)
I decided to use Tableau Public to create my dashboard.
From my visualization, I was able to analyze the habits of members and casual riders. From my analysis, I observed these behaviors in casual members:
The behaviors of members pointed to:
(note: this forum post helped me with converting time duration.)
Act
Based on my analysis, I think Cyclistic should advertise closer to July and appeal to weekend casual riders to turn them into members. Cyclistic should offer casual riders more classic bike options than electric bikes.