Back to datasets
Dataset assetOpen Source CommunityMarket AnalysisProduct Classification
ksabeh/openbrand
This dataset contains multiple features such as category, title, brand, ASIN, image URL, position index, token count, title length, and title category. The dataset is divided into several subsets, including training, testing, and category‑specific subsets such as automotive, cellphones, clothes, electronics, grocery, pets, sports, toys, and a validation set. Each subset provides its byte size and number of examples. Total download size and overall size are also provided.
Source
hugging_face
Created
Nov 28, 2025
Updated
Aug 27, 2023
Signals
174 views
Availability
Linked source ready
Overview
Dataset description and usage context
Dataset Overview
Features
- category: string
- title: string
- brand: string
- asin: string
- imageURL: string
- position_index: integer
- num_tokens: integer
- title_length: integer
- title_category: string
Data Splits
- train: 68,007,488 bytes, 181,551 samples
- test: 18,875,793 bytes, 50,432 samples
- automotive: 4,523,220 bytes, 12,891 samples
- cellphones: 51,882,096 bytes, 78,478 samples
- clothes: 37,489,496 bytes, 85,052 samples
- electronics: 4,820,108 bytes, 9,568 samples
- grocery: 1,567,047 bytes, 4,475 samples
- new_cat: 93,547,671 bytes, 174,381 samples
- pets: 4,175,961 bytes, 10,851 samples
- sports: 3,804,172 bytes, 10,841 samples
- toys: 4,161,246 bytes, 12,657 samples
- val: 7,583,420 bytes, 20,172 samples
Dataset Size
- Download size: 110,231,234 bytes
- Dataset size: 300,437,718 bytes
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.