JUHE API Marketplace
DATASET
Open Source Community

Book Cover Dataset

This dataset contains 207,572 books from the Amazon marketplace, intended for book cover image classification and data mining tasks. The dataset includes cover images, titles, authors, and categories.

Updated 10/25/2019
github

Description

Dataset Overview

Dataset Name

Book Cover Dataset

Dataset Content

Contains 207,572 books from the Amazon.com, Inc. marketplace.

Dataset Tasks

Task 1: Classification

  • Sub‑task A: Book Cover Image to Genre (BookCover30)
    • Description: Classify books based on their cover images.
    • Data: 57,000 cover images across 30 categories.
    • Split: Train and test sets split 90%‑10%.

Task 2: Data Mining

  • Sub‑task: Data Mining (Book32)
    • Description: Explore the entire book database.
    • Data: 207,572 books across 32 categories. Each book includes a cover image, title, author, and category.

Dataset Usage

Image Resources

  • Full‑size Images: Not provided due to size constraints; label files contain image URLs.
  • (224 × 224 × 3) Images: Resized images for the BookCover30 dataset are available for download.

Citation Information

  • Paper: "Judging a Book by its Cover," arXiv preprint arXiv:1610.09204 (2016).
  • Authors: B. K. Iwana, S. T. Raza Rizvi, S. Ahmed, A. Dengel, and S. Uchida.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Image Classification
Data Mining

Source

Organization: github

Created: 10/25/2019

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.