California Housing
This dataset is a modified version of the California Housing dataset, sourced from Luís Torgo’s page (University of Porto). The original data came from the now‑defunct StatLib repository and can also be obtained from StatLib mirrors. It is constructed from the 1990 U.S. Census, where each row represents a census tract. The dataset includes attributes such as longitude, latitude, median housing age, total rooms, total bedrooms, population, households, median income, median house value, and ocean proximity.
Description
California Housing Dataset Overview
Data Source
- This dataset is a modified version of the California Housing dataset, the original dataset sourced from Luís Torgo’s page (University of Porto), initially obtained from the StatLib repository.
- The dataset is built from the 1990 California census, with each row representing a census block group.
Data Adjustments
- Randomly removed 207 values from the
total_bedroomscolumn to illustrate handling of missing data. - Added a categorical attribute
ocean_proximityto describe the relative position of each block group to the ocean.
Data Description
-
Attribute List:
longitude: longitudelatitude: latitudehousing_median_age: median house agetotal_rooms: total number of roomstotal_bedrooms: total number of bedroomspopulation: populationhouseholds: number of householdsmedian_income: median incomemedian_house_value: median house valueocean_proximity: relative position to the ocean
-
ocean_proximityCategory Statistics:<1H OCEAN: 9,136INLAND: 6,551NEAR OCEAN: 2,658NEAR BAY: 2,290ISLAND: 5
Dataset Characteristics
- The dataset contains a variety of geographic and housing‑related attributes, with particular emphasis on the relationship between house value and location.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: github
Created: 5/14/2023
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.