JUHE API Marketplace
DATASET
Open Source Community

nimashoghi/wbm

The dataset contains multiple material‑science features such as chemical formula, number of sites, volume, energy, band gap, etc., which can be used for material property research and prediction. It includes 256,963 samples (total size 725 MB, download size 156 MB).

Updated 7/21/2024
hugging_face

Description

Dataset Overview

Dataset Features

  • formula: chemical formula (string).
  • n_sites: number of sites (float).
  • volume: volume (float).
  • uncorrected_energy: uncorrected energy (float).
  • e_form_per_atom_wbm: formation energy per atom (WBM) (float).
  • e_above_hull_wbm: energy above hull (WBM) (float).
  • bandgap_pbe: PBE band gap (float).
  • wyckoff_spglib_initial_structure: Wyckoff symbol of the initial structure (string).
  • uncorrected_energy_from_cse: uncorrected energy from CSE (float).
  • e_correction_per_atom_mp2020: MP2020 per‑atom correction energy (float).
  • e_correction_per_atom_mp_legacy: MP legacy per‑atom correction energy (float).
  • e_form_per_atom_uncorrected: uncorrected formation energy per atom (float).
  • e_form_per_atom_mp2020_corrected: MP2020 corrected formation energy per atom (float).
  • e_above_hull_mp2020_corrected_ppd_mp: MP2020 corrected energy above hull (float).
  • site_stats_fingerprint_init_final_norm_diff: normalized fingerprint difference between initial and final structures (float).
  • wyckoff_spglib: Wyckoff symbol (string).
  • unique_prototype: boolean indicating a unique prototype.
  • formula_from_cse: chemical formula from CSE (string).
  • initial_structure: nested object describing the initial crystal structure (class, module, charge, lattice parameters, sites, etc.).
  • id: identifier (string).
  • material_id: material identifier (string).
  • frac_pos, cart_pos, pos, cell: positional arrays (float).
  • num_atoms: number of atoms (integer).
  • atomic_numbers: atomic numbers (integer array).
  • composition: composition (integer array).

Dataset Split

  • all: 256,963 samples, total size 725 MB.

Size

  • Download size: 156 MB.
  • Dataset size: 725 MB.

Configuration

  • default: Includes all data files under data/all-*.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Materials Science
Crystal Structure

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.