New York City Yellow Taxi Trip data
The NYC Yellow Taxi trip records capture pickup and drop‑off dates and times, pickup and drop‑off locations, trip distance, itemized fare details, rate type, payment type, and the number of passengers reported by the driver.
Description
NYC Yellow Taxi Tripdata Analytics | Microsoft Azure Data Engineering Project
Dataset Overview
Dataset Description
NYC Yellow trip records contain the following fields:
- Pickup and drop‑off dates and times
- Pickup and drop‑off locations
- Trip distance
- Fare details
- Rate type
- Payment type
- Driver‑reported passenger count
Dataset Source
- Original source: https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page
- Data dictionary: https://www.nyc.gov/assets/tlc/downloads/pdf/data_dictionary_trip_records_yellow.pdf
Dataset Usage
This dataset underpins a comprehensive Azure data engineering project aimed at ingesting, transforming, analyzing, and visualizing New York City taxi trip data.
Project Architecture

Azure Services Used
- Azure Data Factory (ADF)
- Azure Data Lake Storage Gen2 (ADLS Gen2)
- Azure Databricks
- Azure Synapse Analytics
- Key Vault
- Azure Active Directory
- Power BI
Languages Used
- Programming languages: Python, PySpark
- Scripting language: SQL
Data Model

Power BI Dashboard

AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: github
Created: 11/14/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.