ksabeh/openbrand
This dataset contains multiple features such as category, title, brand, ASIN, image URL, position index, token count, title length, and title category. The dataset is divided into several subsets, including training, testing, and category‑specific subsets such as automotive, cellphones, clothes, electronics, grocery, pets, sports, toys, and a validation set. Each subset provides its byte size and number of examples. Total download size and overall size are also provided.
Description
Dataset Overview
Features
- category: string
- title: string
- brand: string
- asin: string
- imageURL: string
- position_index: integer
- num_tokens: integer
- title_length: integer
- title_category: string
Data Splits
- train: 68,007,488 bytes, 181,551 samples
- test: 18,875,793 bytes, 50,432 samples
- automotive: 4,523,220 bytes, 12,891 samples
- cellphones: 51,882,096 bytes, 78,478 samples
- clothes: 37,489,496 bytes, 85,052 samples
- electronics: 4,820,108 bytes, 9,568 samples
- grocery: 1,567,047 bytes, 4,475 samples
- new_cat: 93,547,671 bytes, 174,381 samples
- pets: 4,175,961 bytes, 10,851 samples
- sports: 3,804,172 bytes, 10,841 samples
- toys: 4,161,246 bytes, 12,657 samples
- val: 7,583,420 bytes, 20,172 samples
Dataset Size
- Download size: 110,231,234 bytes
- Dataset size: 300,437,718 bytes
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: hugging_face
Created: Unknown
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.