DATASET
Open Source Community
Book Cover Dataset
This dataset contains 207,572 books from the Amazon marketplace, intended for book cover image classification and data mining tasks. The dataset includes cover images, titles, authors, and categories.
Updated 10/25/2019
github
Description
Dataset Overview
Dataset Name
Book Cover Dataset
Dataset Content
Contains 207,572 books from the Amazon.com, Inc. marketplace.
Dataset Tasks
Task 1: Classification
- Sub‑task A: Book Cover Image to Genre (BookCover30)
- Description: Classify books based on their cover images.
- Data: 57,000 cover images across 30 categories.
- Split: Train and test sets split 90%‑10%.
Task 2: Data Mining
- Sub‑task: Data Mining (Book32)
- Description: Explore the entire book database.
- Data: 207,572 books across 32 categories. Each book includes a cover image, title, author, and category.
Dataset Usage
Image Resources
- Full‑size Images: Not provided due to size constraints; label files contain image URLs.
- (224 × 224 × 3) Images: Resized images for the BookCover30 dataset are available for download.
- Download Link: Google Drive (657 MB)
Citation Information
- Paper: "Judging a Book by its Cover," arXiv preprint arXiv:1610.09204 (2016).
- Authors: B. K. Iwana, S. T. Raza Rizvi, S. Ahmed, A. Dengel, and S. Uchida.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Image Classification
Data Mining
Source
Organization: github
Created: 10/25/2019
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.