Back to datasets
Dataset assetOpen Source CommunityData MiningImage Classification
Book Cover Dataset
This dataset contains 207,572 books from the Amazon marketplace, intended for book cover image classification and data mining tasks. The dataset includes cover images, titles, authors, and categories.
Source
github
Created
Oct 25, 2019
Updated
Oct 25, 2019
Signals
297 views
Availability
Linked source ready
Overview
Dataset description and usage context
Dataset Overview
Dataset Name
Book Cover Dataset
Dataset Content
Contains 207,572 books from the Amazon.com, Inc. marketplace.
Dataset Tasks
Task 1: Classification
- Sub‑task A: Book Cover Image to Genre (BookCover30)
- Description: Classify books based on their cover images.
- Data: 57,000 cover images across 30 categories.
- Split: Train and test sets split 90%‑10%.
Task 2: Data Mining
- Sub‑task: Data Mining (Book32)
- Description: Explore the entire book database.
- Data: 207,572 books across 32 categories. Each book includes a cover image, title, author, and category.
Dataset Usage
Image Resources
- Full‑size Images: Not provided due to size constraints; label files contain image URLs.
- (224 × 224 × 3) Images: Resized images for the BookCover30 dataset are available for download.
- Download Link: Google Drive (657 MB)
Citation Information
- Paper: "Judging a Book by its Cover," arXiv preprint arXiv:1610.09204 (2016).
- Authors: B. K. Iwana, S. T. Raza Rizvi, S. Ahmed, A. Dengel, and S. Uchida.
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.