Back to datasets
Dataset assetOpen Source CommunityUniversity TextbooksAcademic Texts

P1ayer-1/college-texts-annas-v1

--- dataset_info: features: - name: author dtype: int64 - name: cover_url dtype: string - name: date_added dtype: string - name: date_modified dtype: string - name: description dtype: float64 - name: edition dtype: int64 - name: extension dtype: string - name: filesize dtype: string - name: filesize_reported dtype: string - name: in_libgen dtype: string - name: language dtype: string - name: md5 dtype: string - name: md5_reported dtype: string - name: pages dtype: string - name: pilimi_torrent dtype: string - name: publisher dtype: string - name: series dtype: string - name: title dtype: string - name: unavailable dtype: string - name: volume dtype: int64 - name: year dtype: string - name: zlibrary_id dtype: int64 splits: - name: train num_bytes: 43134412 num_examples: 43206 download_size: 20108980 dataset_size: 43134412 configs: - config_name: default data_files: - split: train path: data/train-* --- # Dataset Card for "college-texts-annas-v1" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

Source
hugging_face
Created
Nov 28, 2025
Updated
Aug 6, 2023
Signals
98 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Dataset Features

  • author: integer
  • cover_url: string
  • date_added: string
  • date_modified: string
  • description: float
  • edition: integer
  • extension: string
  • filesize: string
  • filesize_reported: string
  • in_libgen: string
  • language: string
  • md5: string
  • md5_reported: string
  • pages: string
  • pilimi_torrent: string
  • publisher: string
  • series: string
  • title: string
  • unavailable: string
  • volume: integer
  • year: string
  • zlibrary_id: integer

Dataset Splits

  • train:
    • Bytes: 43134412
    • Records: 43206

Dataset Size

  • Download size: 20108980
  • Dataset size: 43134412

Configuration

  • config_name: default
    • data_files:
      • split: train
      • path: data/train-*
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio