JUHE API Marketplace
DATASET
Open Source Community

ChestX-Det10

This dataset contains 3,543 chest X‑ray images, annotated with bounding boxes for ten chest abnormalities, including nodules, masses, and pneumothorax. The project uses the ChestX‑Det10 dataset, which is a subset of NIH ChestX‑14.

Updated 12/5/2024
github

Description

Dataset Overview

Dataset Information

  • Dataset Name: ChestX-Det10
  • Data Source: This dataset is a subset of the NIH ChestX-14 dataset.
  • Data Type: Chest X‑ray images
  • Number of Images:
    • Training images: 3,001
    • Test images: 1,000+
  • Number of Classes: 10
  • Class List:
    • Consolidation
    • Pneumothorax
    • Emphysema
    • Calcification
    • Nodule
    • Mass
    • Fracture
    • Effusion
    • Atelectasis
    • Fibrosis
  • Data Missing: 22.69% of the data is considered background.

Dataset Characteristics

  • Class Distribution: The dataset exhibits class imbalance, with Consolidation and Effusion being the most common, while Mass and Pneumothorax are relatively rare.
  • Bounding Box Distribution: Most bounding boxes are small, with width and height primarily below 200 pixels, and the area distribution skewed toward smaller regions.

Data Processing

  • Data Augmentation: Applied random horizontal flips, random rotations, color jitter, and normalization.
  • Class Weight Adjustment: Used weighted loss functions to address class imbalance.
  • Attention Mechanism: Introduced CBAM (Convolutional Block Attention Module) to enhance feature selection.

Dataset Applications

  • Object Detection: Used to train and evaluate YOLOv8 and Faster R-CNN models for detecting abnormalities in chest X‑ray images.
  • Model Comparison: Analyze the performance of YOLOv8 and Faster R-CNN in terms of speed, accuracy, and precision.
  • Performance Optimization: Optimize model performance using class weighting, data augmentation, and learning rate adjustments.

Dataset Citation

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Medical Imaging Analysis
Anomaly Detection

Source

Organization: github

Created: 11/27/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.