Olympics-Dataset
This data comes from olympedia.org and was web scraped with the Python Beautiful Soup library (see scrape_data.py) athletes/bios.csv contains the raw biographical information on each athlete results/results.csv contains a row-by-row breakdown of each event athletes competed in and their results in that event. Note, in the process of scraping this dataset, temporary CSV files were created to checkpoint scraping progress. For simplicity these checkpointed files have since been removed from the repository.
Description
Olympics‑Dataset Overview
Dataset Content
- Athlete Information: Contains the original biographical information of each athlete, located at
athletes/bios.csv. - Competition Results: Detailed records of each athlete's participation and outcomes for each event, located at
results/results.csv.
Data Source and Collection Method
- Data sourced from olympedia.org.
- Web scraped using Python's Beautiful Soup library; the specific script can be found in
scrape_data.py.
Dataset Updates
- The data covers athletes and results from the Summer and Winter Olympic Games from 1896 to 2022, and it is planned to add 2024 data after the Paris 2024 Olympics.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: github
Created: 4/2/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.