JUHE API Marketplace
DATASET
Open Source Community

sagecontinuum/solarirradiancedataset

This dataset is used to estimate solar irradiance from ground‑level images to support solar power generation, weather forecasting, climate‑change research, and smart‑home management. The data consist of images captured by the top camera of a Sage Waggle Node and corresponding solar irradiance values recorded by a tower at Argonne National Laboratory. During preprocessing, a CSV file was created to match images with their irradiance values, and nighttime photos were excluded; only summer photos were retained. During training, images were resized to 224 × 224 and augmented with random flips and rotations to improve model generalization. ResNet‑50 achieved the best performance with a mean absolute error of 82. Future work includes adding more training data to reduce error, improving estimation under thin clouds and during sunrise/sunset, and implementing pattern‑based irradiance prediction. ### Dataset Overview ## Dataset Information - **Features**: - `image`: image data, type `image` - `irradiance`: irradiance data, type `float32` - **Splits**: - `full`: complete dataset with 1 000 samples, total size 13,466,250 bytes - **Download Size**: 14,234,112 bytes - **Dataset Size**: 13,466,250 bytes - **Labels**: - `climate` - **License**: MIT ## Pre‑processing - A CSV file was generated to pair each image with its corresponding solar irradiance value. - Images were taken from the top camera of a Sage Waggle Node, while irradiance values came from Argonne National Laboratory tower measurements. - Nighttime photos were filtered out; only summer photos were kept, and original 2,000 × 2,000‑pixel images were downscaled to 500 × 500. ## Training and Model - Images were resized to 224 × 224 before conversion to tensors, with random flips and rotations applied for augmentation. - Pre‑trained ResNet and VGG‑16 models were compared, with the final fully‑connected layer replaced to output a continuous value. - ResNet‑50 yielded the best results with a mean absolute error of 82. ## Future Directions - Increase training data to lower MAE. - Address errors caused by thin cloud layers. - Improve irradiance prediction during sunrise and sunset. - Develop pattern‑based irradiance level prediction.

Updated 9/11/2023
hugging_face

Description

数据集概述

数据集信息

  • 特征:
    • image: 图像数据,类型为 image
    • irradiance: 辐照度数据,类型为 float32
  • 分割:
    • full: 完整数据集,包含 1000 个样本,总大小为 13466250 字节
  • 下载大小: 14234112 字节
  • 数据集大小: 13466250 字节
  • 标签:
    • climate
  • 许可证: MIT

数据预处理

  • 创建了一个 CSV 文件,将图像与其对应的太阳辐照度值进行匹配。
  • 图像来自 Sage Waggle Node 的顶部相机,辐照度值来自 Argonne 国家实验室的塔读数。
  • 排除了夜间照片,仅使用夏季照片,并将原始 2000x2000 像素的图像缩小到 500x500 像素。

训练和模型

  • 图像在转换为张量之前被调整大小到 224x224 像素,并随机翻转和旋转以提高模型的泛化能力。
  • 比较了预训练的 ResNet 模型和 VGG-16 模型,替换了最后一层全连接层以输出连续值。
  • 发现 ResNet 50 模型表现最佳,平均绝对误差为 82。

未来方向

  • 增加训练数据以降低平均绝对误差(MAE)。
  • 解决薄云层导致的模型误差问题。
  • 改进日出和日落时的辐照度值预测。
  • 实现基于收集数据模式的太阳辐照度水平预测功能。

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Solar Irradiance
Image Regression

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.