JUHE API Marketplace
DATASET
Open Source Community

LongVALE

LongVALE: Time-Aware Omni-Modal Perception Benchmark for Vision-Audio-Language-Event in Long-duration Videos

Updated 12/6/2024
github

Description

LongVALE

Dataset Overview

  • Name: LongVALE
  • Full Name: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos
  • Description: This dataset aims to provide a time-aware omni-modal perception benchmark for long-duration videos, covering visual, audio, language, and event modalities.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Long Video Analysis
Multimodal Perception

Source

Organization: github

Created: 12/6/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.