JUHE API Marketplace
DATASET
Open Source Community

Elise-hf/PwC

This dataset contains multiple features such as user ID, paper URL, arXiv ID, title, abstract, URL link, conference, authors, task, date, and methods. The dataset is split into a training set and a test set, with the training set comprising 149,495 samples and the test set comprising 37,108 samples. The total dataset size is 547,449,614 bytes.

Updated 4/18/2023
hugging_face

Description

Dataset Overview

Dataset Features

  • uid: Data type is int64
  • paper_url: Data type is string
  • arxiv_id: Data type is string
  • title: Data type is string
  • abstract: Data type is string
  • url_abs: Data type is string
  • url_pdf: Data type is string
  • proceeding: Data type is string
  • authors: Data type is sequence:string
  • tasks: Data type is sequence:string
  • date: Data type is float64
  • methods: Data type is list, containing the following sub‑features:
    • code_snippet_url: Data type is string
    • description: Data type is string
    • full_name: Data type is string
    • introduced_year: Data type is int64
    • main_collection: Data type is struct, containing the following sub‑features:
      • area: Data type is string
      • description: Data type is string
      • name: Data type is string
      • parent: Data type is string
    • name: Data type is string
    • source_title: Data type is string
    • source_url: Data type is string
  • index_level_0: Data type is int64

Dataset Splits

  • train: Size 437,349,959 bytes, containing 149,495 samples
  • test: Size 110,099,655 bytes, containing 37,108 samples

Dataset Size

  • Download size: 183,963,479 bytes
  • Total size: 547,449,614 bytes

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Academic Papers
Research Methods

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.