Back to datasets
Dataset assetOpen Source CommunityPolitical AnalysisUS Congress

c4lliope/us-congress

This dataset provides detailed legislative data from the US Congress, including key information about bills such as title, summary, plain text, sponsor, actions, amendments, committees, cosponsors, indexes, related bills, subjects, and more. Each feature includes its name and data type; some features contain nested structures or lists. The dataset also specifies training splits with sizes and counts.

Source
hugging_face
Created
Nov 28, 2025
Updated
Jun 6, 2023
Signals
179 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Dataset Name

"us-congress"

Dataset Structure

Main Features

  • key: string
  • title: string
  • summaries: structured, includes sub‑features
    • pagination: structured, contains count (int64)
    • request: structured, includes fields like billNumber, billType, billUrl, congress, contentType, format (all strings)
    • summaries: list of records with actionDate, actionDesc, text, updateDate, versionCode (all strings)
  • plaintext: string
  • sponsor: string
  • actions: structured, includes sub‑features
    • actions: list with fields such as actionCode, actionDate, actionTime, calendarNumber (structured), committees (list), recordedVotes (list), sourceSystem (structured), text, type (strings)
    • pagination: count (int64)
    • request: same as above
  • amendments: structured, includes sub‑features
    • amendments: list with fields like congress, description, latestAction (structured), number, purpose, type, updateDate, url (strings)
    • pagination: count (int64)
    • request: same as above
  • committees: structured, includes sub‑features
    • committees: list with activities, chamber, name, subcommittees, systemCode, type, url (lists/strings)
    • request: same as above
  • cosponsors: structured, includes sub‑features
    • cosponsors: list with bioguideId, district, firstName, fullName, isOriginalCosponsor, lastName, middleName, party, sponsorshipDate, sponsorshipWithdrawnDate, state, url (strings)
    • pagination: count and countIncludingWithdrawnCosponsors (int64)
    • request: same as above
  • index: structured, includes sub‑features
    • bill: structured, contains many fields such as actions, amendments, cosponsors, introducedDate, latestAction, number, originChamber, policyArea, relatedBills, sponsors, subjects, summaries, textVersions, title, titles, type, updateDate, updateDateIncludingText (mixed structured/string)
    • request: same as above
  • relatedbills: structured, includes sub‑features
    • pagination: count (int64)
    • relatedBills: list with congress, latestAction (structured), number, relationshipDetails (list), title, type, url (strings)
    • request: same as above
  • subjects: structured, includes sub‑features
    • pagination: count (int64)
    • request: same as above
    • subjects: structured, includes legislativeSubjects (list) and policyArea (structured)
  • text: structured, includes sub‑features
    • pagination: count (int64)
    • request: same as above
    • textVersions: list with date, formats (list), type (strings)
  • titles: structured, includes sub‑features
    • pagination: count (int64)
    • request: same as above
    • titles: list with billTextVersionCode, billTextVersionName, chamberCode, chamberName, title, titleType (strings)

Dataset Size

  • Download Size: 6,439,766 bytes
  • Dataset Size: 42,798,980 bytes
  • Training Set Size: 42,798,980 bytes
  • Number of Training Examples: 6,433
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio