JUHE API Marketplace
DATASET
Open Source Community

ksabeh/openbrand

This dataset contains multiple features such as category, title, brand, ASIN, image URL, position index, token count, title length, and title category. The dataset is divided into several subsets, including training, testing, and category‑specific subsets such as automotive, cellphones, clothes, electronics, grocery, pets, sports, toys, and a validation set. Each subset provides its byte size and number of examples. Total download size and overall size are also provided.

Updated 8/27/2023
hugging_face

Description

Dataset Overview

Features

  • category: string
  • title: string
  • brand: string
  • asin: string
  • imageURL: string
  • position_index: integer
  • num_tokens: integer
  • title_length: integer
  • title_category: string

Data Splits

  • train: 68,007,488 bytes, 181,551 samples
  • test: 18,875,793 bytes, 50,432 samples
  • automotive: 4,523,220 bytes, 12,891 samples
  • cellphones: 51,882,096 bytes, 78,478 samples
  • clothes: 37,489,496 bytes, 85,052 samples
  • electronics: 4,820,108 bytes, 9,568 samples
  • grocery: 1,567,047 bytes, 4,475 samples
  • new_cat: 93,547,671 bytes, 174,381 samples
  • pets: 4,175,961 bytes, 10,851 samples
  • sports: 3,804,172 bytes, 10,841 samples
  • toys: 4,161,246 bytes, 12,657 samples
  • val: 7,583,420 bytes, 20,172 samples

Dataset Size

  • Download size: 110,231,234 bytes
  • Dataset size: 300,437,718 bytes

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Product Classification
Market Analysis

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.