Back to datasets
Dataset assetOpen Source CommunityMedical ImagingImage Analysis

mimic-cxr-dataset

This dataset is primarily used for image analysis, containing three features: image, findings, and impression. The image feature stores image data; findings and impression store textual descriptions. The dataset includes a training set with 30,633 samples, total size 800,678,886 bytes, download size 792,886,513 bytes.

Source
huggingface
Created
Dec 15, 2024
Updated
Dec 15, 2024
Signals
1,026 views
Availability
Linked source ready
Overview

Dataset description and usage context

MIMIC‑CXR Dataset

Dataset Information

Features

  • image: image data (image type).
  • findings: textual description of findings (string type).
  • impression: textual overall impression (string type).

Data Splits

  • train: training set, 30,633 samples, 800,678,886 bytes.

Data Size

  • Download size: 792,886,513 bytes
  • Dataset size: 800,678,886 bytes

Configuration

  • config_name: default
    • data_files:
      • split: train
        • path: data/train-*
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio