Back to datasets
Dataset assetOpen Source CommunityLegal ResearchLegal Case Summaries

CaseSumm

The CaseSumm dataset comprises U.S. Supreme Court cases from 1815 to 2019 along with their official syllabi, which are summaries of majority opinions authored by court‑hired attorneys and approved by judges. The syllabi serve as gold‑standard summaries for evaluating other summary methods. Case opinions are sourced from the Public Resource Org archive, while the syllabi are extracted from the United States Reports and official opinions hosted by the Library of Congress. The dataset is released under a CC BY‑NC 4.0 license, providing rich resources for the research community.

Source
huggingface
Created
Nov 9, 2024
Updated
Nov 9, 2024
Signals
158 views
Availability
Linked source ready
Overview

Dataset description and usage context

CaseSumm Dataset Overview

Basic Information

  • Dataset Name: CaseSumm
  • License: CC BY‑NC 3.0
  • Task Category: Summarization
  • Language: English
  • Tags: Law

Dataset Description

CaseSumm contains U.S. Supreme Court cases from 1815 to 2019 together with their official syllabi (summaries of majority opinions). These syllabi, drafted by court‑hired attorneys and approved by judges, are considered gold‑standard summaries for evaluating other summarization approaches.

Data Sources

  • Case Opinions: Public Resource Org archive.
  • Official Syllabi: United States Reports and official opinions hosted by the Library of Congress.

License & Usage

The dataset is provided under a CC BY‑NC 4.0 license, offering a valuable resource for the research community.

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio