CaseSumm
The CaseSumm dataset comprises U.S. Supreme Court cases from 1815 to 2019 along with their official syllabi, which are summaries of majority opinions authored by court‑hired attorneys and approved by judges. The syllabi serve as gold‑standard summaries for evaluating other summary methods. Case opinions are sourced from the Public Resource Org archive, while the syllabi are extracted from the United States Reports and official opinions hosted by the Library of Congress. The dataset is released under a CC BY‑NC 4.0 license, providing rich resources for the research community.
Dataset description and usage context
CaseSumm Dataset Overview
Basic Information
- Dataset Name: CaseSumm
- License: CC BY‑NC 3.0
- Task Category: Summarization
- Language: English
- Tags: Law
Dataset Description
CaseSumm contains U.S. Supreme Court cases from 1815 to 2019 together with their official syllabi (summaries of majority opinions). These syllabi, drafted by court‑hired attorneys and approved by judges, are considered gold‑standard summaries for evaluating other summarization approaches.
Data Sources
- Case Opinions: Public Resource Org archive.
- Official Syllabi: United States Reports and official opinions hosted by the Library of Congress.
License & Usage
The dataset is provided under a CC BY‑NC 4.0 license, offering a valuable resource for the research community.
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.