Explore high-quality datasets for your AI and machine learning projects.
The CaseSumm dataset comprises U.S. Supreme Court cases from 1815 to 2019 along with their official syllabi, which are summaries of majority opinions authored by court‑hired attorneys and approved by judges. The syllabi serve as gold‑standard summaries for evaluating other summary methods. Case opinions are sourced from the Public Resource Org archive, while the syllabi are extracted from the United States Reports and official opinions hosted by the Library of Congress. The dataset is released under a CC BY‑NC 4.0 license, providing rich resources for the research community.