DATASET
Open Source Community
ChestX-Det10
This dataset contains 3,543 chest X‑ray images, annotated with bounding boxes for ten chest abnormalities, including nodules, masses, and pneumothorax. The project uses the ChestX‑Det10 dataset, which is a subset of NIH ChestX‑14.
Updated 12/5/2024
github
Description
Dataset Overview
Dataset Information
- Dataset Name: ChestX-Det10
- Data Source: This dataset is a subset of the NIH ChestX-14 dataset.
- Data Type: Chest X‑ray images
- Number of Images:
- Training images: 3,001
- Test images: 1,000+
- Number of Classes: 10
- Class List:
- Consolidation
- Pneumothorax
- Emphysema
- Calcification
- Nodule
- Mass
- Fracture
- Effusion
- Atelectasis
- Fibrosis
- Data Missing: 22.69% of the data is considered background.
Dataset Characteristics
- Class Distribution: The dataset exhibits class imbalance, with Consolidation and Effusion being the most common, while Mass and Pneumothorax are relatively rare.
- Bounding Box Distribution: Most bounding boxes are small, with width and height primarily below 200 pixels, and the area distribution skewed toward smaller regions.
Data Processing
- Data Augmentation: Applied random horizontal flips, random rotations, color jitter, and normalization.
- Class Weight Adjustment: Used weighted loss functions to address class imbalance.
- Attention Mechanism: Introduced CBAM (Convolutional Block Attention Module) to enhance feature selection.
Dataset Applications
- Object Detection: Used to train and evaluate YOLOv8 and Faster R-CNN models for detecting abnormalities in chest X‑ray images.
- Model Comparison: Analyze the performance of YOLOv8 and Faster R-CNN in terms of speed, accuracy, and precision.
- Performance Optimization: Optimize model performance using class weighting, data augmentation, and learning rate adjustments.
Dataset Citation
- ChestX-ray14 Dataset: arXiv:1705.02315
- CheXNet: arXiv:1711.05225
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Medical Imaging Analysis
Anomaly Detection
Source
Organization: github
Created: 11/27/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.