Cyclistic Capstone Concluded!
Caroline J.
Data Analyst | Business Intelligence | I help companies drive data informed decision making | Remote
Cyclistic Capstone Concluded!
Although I've never had the chance to use the service, I clearly remember the first time I saw a bike-for-hire service in my city. My first thoughts was "What is that bike doing there? I hope nobody steals it!" Then, my Mom (who reads the paper because she's a responsible adult ??) explained to me that it was a new business model. And I was fascinated! I'm always impressed by entrepreneurs, the idea to turn a simple concept into a money making opportunity is really admirable.
So when I had the chance to explore a dataset for a bike rental company through my capstone project with the Google Data Analytics certificate, I jumped at the opportunity to identify how annual members and casual riders utilize Cyclistic bikes differently to find ways to convert casual users to members.
Using RStudio, I imported the 12 csv data files representing September 2021 through August 2022?and discovered:
First, the Data!
This dataset comes from the Google Data Analytics Certificate Capstone Project.
The link to the dataset is?Divvi Trip Data.
I utilized the data files for September 2021 through August 2022 which used the following naming convention: YYYYMM-divvy-tripdata.
Of the thirteen columns, some of the most important are the: customer id, bike type, started at date and time, ended at date and time, start station name, end station name, and member type (member of casual). Since the size of the files varies from 103,771 rows to 823,489 rows, I decided to utilize RStudio to perform the data cleaning and analysis.
To begin, I loaded the following packages: tidyverse, lubridate, janitor, scales, readr and forecast. Next, I uploaded the 12 csv files, combined them into a single data frame, used the janitor package to clean the rows and columns and eliminated all columns except the necessary ones: (rideable_type, started_at, ended_at, start_station_name, member_casual). Additionally, I eliminated all rows with an empty start station because my analysis was aimed at determining the most popular start station.?
Next, I started working with the start and end dates to get them in the correct format to?utilize for calculating ride lengths.
Once this was accomplished, I was able to calculate trip duration. At this point, some data cleaning was needed to eliminate rides that were less than a minute (574 rides, in total).?
All of the details of the code I wrote for this project, including the graphs, can be found in my RMarkdown file on GitHub at.
The Analysis
Since the main task is to understand how casual and member riders differ, in hopes of converting more casual members to members, I thought that understanding the rider base to be a solid first step.
Next, I wondered if the riding habits of casual and members differed. As it turns out, casual riders rent the Cyclistic bikes more often.
领英推荐
Discovering this made me curious about the distance each type of rider typically goes. Again, it was the casual riders who consistently went farther than member riders.
I then turned my attention to looking for trends at the daily, weekly and yearly level. And I discovered that 3pm-6pm were peak ride times.
While weekdays were more popular for casual riders, members rode more during the week.
And seasonally, the warmer months of June through October are the busiest.
Finally, I wanted to analyze which start stations were the most popular for marketing & promotional purposes and discovered the top 10 start stations.
Recap & Recommend
The initial task was to investigate opportunities to convert casual riders to members, using data analysis techniques in R programming and it was discovered that:
Based on this information, these are my recommendations to increase membership riders:
Here is where you can view this information presented in a PowerPoint format.
Action!
I thank you for reading and welcome your feedback! Please consider following me or connecting on LinkedIn at?Carly Jocson and explore my portfolio at Analytics Alcove. Also, please keep me in mind for any remote positions as a data analyst!
Research Analyst
1 年This is fantastic and I have an R buddy in the DAA. Way to go!!!
English to Malay Translator ? Education & Pedagogy ? Government ? Social Science ? Data Science ?? 10+ years in Education ??MEd in TESL
1 年Love your article ??. Your dedication to this project is admirable. I can tell that you put a lot of time and effort into this. Well done! Caroline Jocson, MAT
Data Analyst | Excel | SQL | Tableau | Top Secret Security Clearance | Army Veteran
1 年Great work Carly! I really enjoyed reading the article. It’s very easy for the reader to grasp the main points, but I especially liked how you included your thought process as you went along. It made it really easy to understand your analysis, insights and recommendations. Well done!
Passionate Executive following life's calling to do "work with purpose"
1 年Great analysis! What’s next????
M.Sc in Data Analytics | Data Analyst | Business Analyst | Finance Analyst | with 14 years in IT and Data Analytics
1 年Great way to go Caroline, as the Same goes Practice makes perfect, I believe next year you will go Further and even Finding what you wished for.?