Back to datasets
Dataset assetOpen Source CommunitySurvival AnalysisBreast Cancer
Haberman’s Cancer Survival Dataset
The Haberman survival dataset comprises data from a study conducted at the Billings Hospital of the University of Chicago between 1958 and 1970, involving patients who underwent breast cancer surgery. The dataset attributes include patients' age at operation, year of operation, number of positive axillary nodes detected, and survival status.
Source
github
Created
Mar 22, 2019
Updated
Apr 8, 2024
Signals
375 views
Availability
Linked source ready
Overview
Dataset description and usage context
Haberman’s Cancer Survival Data Set Summary
Data Description
- Source: University of Chicago’s Billings Hospital
- Period: 1958 - 1970
- Objective: To predict patient survival after 5 years post-surgery for breast cancer
Attribute Information
- Age of patient at time of operation (numerical)
- Patient’s year of operation (year — 1900, numerical)
- Number of positive auxillary nodes detected (numerical)
- Survival status (class attribute)
- 1 = the patient survived 5 years or longer
- 2 = the patient died within 5 years
Analysis Tools
- Python Libraries: Seaborn, Matplotlib, NumPy, Pandas
- Visualization Example: Density plot of patient age vs. year of operation
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.