Explore high-quality datasets for your AI and machine learning projects.
This dataset contains 47 training samples, each comprising two features: doi and fullText. The doi is a string representing the document's unique identifier; fullText is a sequence of strings representing the document's complete textual content. The total size of the dataset is 20,030,598 bytes, and the download size is 11,690,115 bytes. The default configuration of the dataset specifies the path to the training data file.