JUHE API Marketplace
DATASET
Open Source Community

osv5m/osv5m

OpenStreetView‑5M is the first large‑scale open street‑view image geolocation benchmark. It provides a global visual geolocation challenge, allowing users to experience the difficulty of the benchmark through the supplied demo. The dataset contains training and test splits, downloadable via the Hugging Face Hub.

Updated 4/27/2024
hugging_face

Description

OpenStreetView‑5M

Dataset Overview

OpenStreetView‑5M is a large‑scale open street‑view image geolocation benchmark dataset.

Structure

  • Config Name: default
  • Data Files:
    • Training Set:
      • File path: "train.csv"
      • Image directory: "images/train"
    • Test Set:
      • File path: "test.csv"
      • Image directory: "images/test"

Download

Full Dataset

from huggingface_hub import snapshot_download
snapshot_download(repo_id="osv5m/osv5m", local_dir="datasets/osv5m", repo_type=dataset)

Extract

import os, zipfile
for root, dirs, files in os.walk("datasets/osv5m"):
    for file in files:
        if file.endswith(".zip"):
            with zipfile.ZipFile(os.path.join(root, file), r) as zip_ref:
                zip_ref.extractall(root)
                os.remove(os.path.join(root, file))

Load Directly

from datasets import load_dataset
dataset = load_dataset(osv5m/osv5m, full=False)

The full flag indicates whether to load the complete metadata (default False).

Download Test Set Only

from huggingface_hub import hf_hub_download
for i in range(5):
    hf_hub_download(repo_id="osv5m/osv5m", filename=str(i).zfill(2)+.zip, subfolder="images/test", repo_type=dataset, local_dir="datasets/OpenWorld")
    hf_hub_download(repo_id="osv5m/osv5m", filename="README.md", repo_type=dataset, local_dir="datasets/OpenWorld")

Citation

@article{osv5m,
    title = {{OpenStreetView‑5M}: {T}he Many Roads to Global Visual Geolocation},
    author = {Astruc, Guillaume and Dufour, Nicolas and Siglidis, Ioannis and Aronssohn, Constantin and Bouia, Nacim and Fu, Stephanie and Loiseau, Romain and Nguyen, Van Nguyen and Raude, Charles and Vincent, Elliot and Xu, Lintao and Zhou, Hongyu and Landrieu, Loic},
    journal = {CVPR},
    year = {2024},
}

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Street View Images
Visual Geolocation

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.