JUHE API Marketplace
DATASET
Open Source Community

online_retail.csv

This dataset consists of the original retail data downloaded from Kaggle, intended for building an end‑to‑end data pipeline. The data includes retail transaction information, which can be modeled into fact and dimension tables and used for data quality checks.

Updated 7/19/2024
github

Description

Retail Data Pipeline Dataset

Dataset Description

  • Data File Location: Within the folder dags/include/datasets/, containing the following files:
    • online_retail.csv: Original dataset downloaded from Kaggle.
    • country.csv: Dataset generated using a BigQuery table.

Technology Stack

  • Data Processing Tools:
    • Python
    • Docker and Docker‑compose
    • Soda.io
    • Metabase
    • Google Cloud Storage
    • Google BigQuery
    • Airflow (Astronomer edition)
    • dbt
    • GitHub

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Retail Data
Data Modeling

Source

Organization: github

Created: 7/8/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.