Back to datasets
Dataset assetOpen Source CommunityAutomotive DesignEngineering Simulation

DrivAerNet dataset

The DrivAerNet dataset includes a parametric model of the DrivAer fastback, developed using ANSA® software to enable extensive exploration of automotive design variations. This model is defined by 50 geometric parameters, allowing the generation of 4000 unique car designs through Optimal Latin Hypercube sampling and the Enhanced Stochastic Evolutionary Algorithm.

Source
github
Created
Mar 5, 2024
Updated
Mar 16, 2024
Signals
580 views
Availability
Linked source ready
Overview

Dataset description and usage context

DrivAerNet++ Dataset Overview

Dataset Description

  • Scale and Content: DrivAerNet++ is the largest multimodal automotive aerodynamic design dataset, containing 8,000 diverse car designs modeled via high‑fidelity computational fluid dynamics (CFD) simulations.
  • Design Diversity: The dataset covers various body types such as fastback, hatchback, and wagon, as well as different chassis and wheel designs.

Design Parameters

  • Parameter Selection: The generation of the dataset considered multiple geometric parameters that significantly affect aerodynamics, and their ranges were set to avoid values that are difficult to manufacture or aesthetically unpleasing.
  • Parametric Model: Each 3D car geometry is fully described by 26 parameters, using two primary deformation methods: deformation boxes and direct deformation.

Dataset Content

  • CFD simulation data: High‑fidelity CFD simulation data for each car design, including 3D flow fields.
  • 3D car mesh: Detailed 3D meshes suitable for various machine learning applications.
  • Parametric model: Parameter tables enabling extensive exploration of automotive design variables.
  • Aerodynamic coefficients: Key metrics such as drag coefficient (Cd), lift coefficient (Cl), etc.
  • Segmentation parts: Segmented components of the car model for classification tasks.
  • Point cloud data: Point clouds for each car design.

Application Domains

  • Data‑driven design optimization
  • Generative AI
  • Surrogate model training
  • CFD simulation acceleration
  • Geometry classification

Computational Resources

  • Computational Cost: CFD simulations were performed on MIT Supercloud using 60 nodes (total 2,880 CPU cores), with each CFD case using 256 cores and 1,000 GB memory.
  • Storage Requirements: The full dataset requires 39 TB of storage.
  • Computation Time: Simulations consumed approximately 3 × 10⁶ CPU‑hours.
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio