Ford GoBike Trip dataset
The Ford GoBike Trip dataset contains information on individual rides from a bike‑sharing system, covering the San Francisco Bay Area and surrounding regions. Each trip is anonymized and includes trip duration (seconds), start time and date, end time and date, start station ID, start station name, start station latitude, start station longitude, end station ID, end station name, end station latitude, end station longitude, bike ID, user type (subscriber or customer), member birth year, and member gender.
Description
Dataset Overview
Dataset Name
Ford GoBike Trip Data
Dataset Content
- Trip Duration (seconds)
- Start Time and Date
- End Time and Date
- Start Station ID
- Start Station Name
- Start Station Latitude
- Start Station Longitude
- End Station ID
- End Station Name
- End Station Latitude
- End Station Longitude
- Bike ID
- User Type (Subscriber or Customer)
- Member Year of Birth
- Member Gender
Dataset Analysis Goals
Perform exploratory data analysis using Python data‑science and visualization libraries to explore dataset variables, understand structure, anomalies, patterns, and relationships.
Dataset Analysis Observations
- Consistent column names (snake_case)
- Generate minutes from duration_sec
- Remove time from start_time and end_time columns for easier processing
- Compute trip distances using geographic data
- Filter reasonable member age range from member_year_of_birth
- Create age bins for member age groups
Dataset Analysis Questions
- How fast is Ford GoBike growing?
- How do riding trends vary by age, gender, weekday, and time of day?
- What differences exist between subscribers and customers?
- Which docks are used most frequently?
- When and where do all rides occur?
Dataset Analysis Findings
- Users aged 20‑30 account for about 40% of rides.
- Males represent 76% of rides, females 22%.
- Most rides occur on weekdays; weekend usage is half.
- Peak usage aligns with commuting hours, 8 am and 5 pm.
- 88.92% of rides are by subscribers.
- Average trip length: subscribers 10.769067 min, customers 23.846594 min.
- Age group 20‑30 dominates both user types.
- The most popular start and end station is Caltrain Station 2 (Townsend St at 4th St) in San Francisco.
- 5 pm is the peak hour for all bike‑share rides.
- Most rides start at 5th St. at Virginia St.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: github
Created: 1/12/2019
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.