Back to datasets
Dataset assetOpen Source CommunitySurvival AnalysisBreast Cancer

Haberman’s Cancer Survival Dataset

The Haberman survival dataset comprises data from a study conducted at the Billings Hospital of the University of Chicago between 1958 and 1970, involving patients who underwent breast cancer surgery. The dataset attributes include patients' age at operation, year of operation, number of positive axillary nodes detected, and survival status.

Source
github
Created
Mar 22, 2019
Updated
Apr 8, 2024
Signals
375 views
Availability
Linked source ready
Overview

Dataset description and usage context

Haberman’s Cancer Survival Data Set Summary

Data Description

  • Source: University of Chicago’s Billings Hospital
  • Period: 1958 - 1970
  • Objective: To predict patient survival after 5 years post-surgery for breast cancer

Attribute Information

  1. Age of patient at time of operation (numerical)
  2. Patient’s year of operation (year — 1900, numerical)
  3. Number of positive auxillary nodes detected (numerical)
  4. Survival status (class attribute)
    • 1 = the patient survived 5 years or longer
    • 2 = the patient died within 5 years

Analysis Tools

  • Python Libraries: Seaborn, Matplotlib, NumPy, Pandas
  • Visualization Example: Density plot of patient age vs. year of operation
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio