DATASET
Open Source Community
LongVALE
LongVALE: Time-Aware Omni-Modal Perception Benchmark for Vision-Audio-Language-Event in Long-duration Videos
Updated 12/6/2024
github
Description
LongVALE
Dataset Overview
- Name: LongVALE
- Full Name: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos
- Description: This dataset aims to provide a time-aware omni-modal perception benchmark for long-duration videos, covering visual, audio, language, and event modalities.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Access Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Long Video Analysis
Multimodal Perception
Source
Organization: github
Created: 12/6/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.