JUHE API Marketplace
DATASET
Open Source Community

Ford GoBike Trip dataset

The Ford GoBike Trip dataset contains information on individual rides from a bike‑sharing system, covering the San Francisco Bay Area and surrounding regions. Each trip is anonymized and includes trip duration (seconds), start time and date, end time and date, start station ID, start station name, start station latitude, start station longitude, end station ID, end station name, end station latitude, end station longitude, bike ID, user type (subscriber or customer), member birth year, and member gender.

Updated 12/16/2020
github

Description

Dataset Overview

Dataset Name

Ford GoBike Trip Data

Dataset Content

  • Trip Duration (seconds)
  • Start Time and Date
  • End Time and Date
  • Start Station ID
  • Start Station Name
  • Start Station Latitude
  • Start Station Longitude
  • End Station ID
  • End Station Name
  • End Station Latitude
  • End Station Longitude
  • Bike ID
  • User Type (Subscriber or Customer)
  • Member Year of Birth
  • Member Gender

Dataset Analysis Goals

Perform exploratory data analysis using Python data‑science and visualization libraries to explore dataset variables, understand structure, anomalies, patterns, and relationships.

Dataset Analysis Observations

  • Consistent column names (snake_case)
  • Generate minutes from duration_sec
  • Remove time from start_time and end_time columns for easier processing
  • Compute trip distances using geographic data
  • Filter reasonable member age range from member_year_of_birth
  • Create age bins for member age groups

Dataset Analysis Questions

  • How fast is Ford GoBike growing?
  • How do riding trends vary by age, gender, weekday, and time of day?
  • What differences exist between subscribers and customers?
  • Which docks are used most frequently?
  • When and where do all rides occur?

Dataset Analysis Findings

  • Users aged 20‑30 account for about 40% of rides.
  • Males represent 76% of rides, females 22%.
  • Most rides occur on weekdays; weekend usage is half.
  • Peak usage aligns with commuting hours, 8 am and 5 pm.
  • 88.92% of rides are by subscribers.
  • Average trip length: subscribers 10.769067 min, customers 23.846594 min.
  • Age group 20‑30 dominates both user types.
  • The most popular start and end station is Caltrain Station 2 (Townsend St at 4th St) in San Francisco.
  • 5 pm is the peak hour for all bike‑share rides.
  • Most rides start at 5th St. at Virginia St.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Bike Sharing
Data Analysis

Source

Organization: github

Created: 1/12/2019

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.