Back to datasets
Dataset assetOpen Source CommunityHistorical ResearchAcademic Literature
ZhJiHo/CBRN-JSTOR-History
This dataset contains 47 training samples, each comprising two features: doi and fullText. The doi is a string representing the document's unique identifier; fullText is a sequence of strings representing the document's complete textual content. The total size of the dataset is 20,030,598 bytes, and the download size is 11,690,115 bytes. The default configuration of the dataset specifies the path to the training data file.
Source
hugging_face
Created
Nov 28, 2025
Updated
Jul 9, 2024
Signals
86 views
Availability
Linked source ready
Overview
Dataset description and usage context
Dataset Overview
Dataset Information
- Features:
doi: Data type – string.fullText: Data type – sequence of strings.
Data Split
- Training Set:
- Name:
train - Bytes: 20030598
- Samples: 47
- Name:
Dataset Size
- Download Size: 11690115 bytes
- Dataset Size: 20030598 bytes
Configuration
- Configuration Name:
default- Data Files:
- Split:
train - Path:
data/train-*
- Split:
- Data Files:
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.