top of page

CitiBike Data Insights:
Analyzing User Behavior and Trip Patterns

Overview​:

This project aims to analyze and visualize NYC CitiBike trip data for non-technical end users. This demo highlights the use of Tableau for data visualization and Python for data analysis.

 

Purpose:

The primary goal of this project is to provide an interactive and dynamic dashboard that allows users to explore the CitiBike dataset to uncover patterns and trends that could help improve bike-sharing services and user experience.

​

​

 

 

 

 

 

 

 

 

 

 

 

​

​

​

 

Insights:

The full analysis can be found on Tableau Public here: https://shorturl.at/7RUBk

​

High Level Highlights:

  • Subscriber users account for 58% of of total rides

  • Nearly 50% of riders with known gender are male.

  • Generation Alpha accounts for 42% of users.

  • The average ride time is approximately 23 minutes.

  • Manhattan is the most popular area, with W 20 St & 11 Ave as the most frequent station.

  • Sunday is the most popular day overall, with Wednesday being the peak day for male users and Saturday for female users.

 

​

​

 

 

 

 

 

 

 

 

 

 

 

 

​

​

​​

​

​

​

​

​

​

​

​

​

​

​

​

​​

​

​

​

​​Actions

  • Offer targeted promotions or discounts to specific user groups (e.g., millennials, first-time users) to increase usage.

  • Use trip start and end data to identify high-demand areas for optimal station placement, improving convenience and reducing bike shortages.

  • Adjust fleet availability and pricing strategies during peak usage times to better serve users.

  • Continuously monitor and update user segments to refine service offerings and adjust for seasonal trends or behavioral changes.

 

Summary:

This project provides useful insights into how CitiBike users behave and how bike trips are made. Looking ahead, we could improve the analysis by including weather data (like temperature or rain) and the time of day. These factors could give us a better understanding of how weather affects when and how long people ride bikes. For example, we could see if more people use bikes when it's warmer or if rainy days lead to fewer trips. Similarly, looking at what times of day people use bikes the most could help manage bike availability, making sure more bikes are ready during busy periods like rush hours or evenings.

 

Features:

  • Trip Duration Bands: Categorizes trip durations into predefined time ranges for easy analysis.

  • Dynamic User Grouping: Allows for dynamic segmentation of users based on Gender, Generational Cohorts, or User Type.

  • Interactive Dashboards: Built with Tableau to provide interactive visualizations of the data, with the ability to toggle between different segments for deeper analysis.

  • Calculated Fields: Used to categorize data (e.g., Trip Duration, User Groups) into meaningful bands for easier comparison. Tools & Technologies

  • Tableau: Used for creating interactive dashboards and visualizations.

  • Python: Used for data processing, cleaning, and analysis. Calculations and data transformations were performed using Python libraries such as Pandas.

  • CitiBike Dataset: Obtained from Kaggle (https://citibikenyc.com/system-data), the dataset contains trip-level data for CitiBike users, including information on trip duration, start and end stations, user type, and more.

386389530-99ef2ef8-6b07-4f31-bf67-9fa6ce849785.png
386389621-5279dea6-e408-44ea-8cdb-76deb675372a.png
386389771-3b264168-9a46-4db5-82bf-07dd714fc1cf.png

Data Tools/Skills Used:

bottom of page