JUHE API Marketplace
DATASET
Open Source Community

California Housing

This dataset is a modified version of the California Housing dataset, sourced from Luís Torgo’s page (University of Porto). The original data came from the now‑defunct StatLib repository and can also be obtained from StatLib mirrors. It is constructed from the 1990 U.S. Census, where each row represents a census tract. The dataset includes attributes such as longitude, latitude, median housing age, total rooms, total bedrooms, population, households, median income, median house value, and ocean proximity.

Updated 12/10/2023
github

Description

California Housing Dataset Overview

Data Source

  • This dataset is a modified version of the California Housing dataset, the original dataset sourced from Luís Torgo’s page (University of Porto), initially obtained from the StatLib repository.
  • The dataset is built from the 1990 California census, with each row representing a census block group.

Data Adjustments

  • Randomly removed 207 values from the total_bedrooms column to illustrate handling of missing data.
  • Added a categorical attribute ocean_proximity to describe the relative position of each block group to the ocean.

Data Description

  • Attribute List:

    • longitude: longitude
    • latitude: latitude
    • housing_median_age: median house age
    • total_rooms: total number of rooms
    • total_bedrooms: total number of bedrooms
    • population: population
    • households: number of households
    • median_income: median income
    • median_house_value: median house value
    • ocean_proximity: relative position to the ocean
  • ocean_proximity Category Statistics:

    • <1H OCEAN: 9,136
    • INLAND: 6,551
    • NEAR OCEAN: 2,658
    • NEAR BAY: 2,290
    • ISLAND: 5

Dataset Characteristics

  • The dataset contains a variety of geographic and housing‑related attributes, with particular emphasis on the relationship between house value and location.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Housing Data
Census Analysis

Source

Organization: github

Created: 5/14/2023

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.