Back to datasets
Dataset assetOpen Source CommunityOlympic GamesSports Performance Analysis
Olympics-Dataset
This data comes from olympedia.org and was web scraped with the Python Beautiful Soup library (see scrape_data.py) athletes/bios.csv contains the raw biographical information on each athlete results/results.csv contains a row-by-row breakdown of each event athletes competed in and their results in that event. Note, in the process of scraping this dataset, temporary CSV files were created to checkpoint scraping progress. For simplicity these checkpointed files have since been removed from the repository.
Source
github
Created
Apr 2, 2024
Updated
Apr 20, 2024
Signals
816 views
Availability
Linked source ready
Overview
Dataset description and usage context
Olympics‑Dataset Overview
Dataset Content
- Athlete Information: Contains the original biographical information of each athlete, located at
athletes/bios.csv. - Competition Results: Detailed records of each athlete's participation and outcomes for each event, located at
results/results.csv.
Data Source and Collection Method
- Data sourced from olympedia.org.
- Web scraped using Python's Beautiful Soup library; the specific script can be found in
scrape_data.py.
Dataset Updates
- The data covers athletes and results from the Summer and Winter Olympic Games from 1896 to 2022, and it is planned to add 2024 data after the Paris 2024 Olympics.
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.