DATASET
Open Source Community
movie Knowledge Graph Dataset
This is a movie knowledge‑graph dataset prepared for NebulaGraph, sourced from OMDB and MovieLens, intended for movie recommendation systems.
Updated 4/29/2024
github
Description
Dataset Overview
Data Sources
- Actor and movie genre data: sourced from OMDB.
- User‑movie interaction records: sourced from MovieLens.
Dataset Structure
-
Vertex Types:
- User (user_id)
- Movie (name)
- Person (name, birthdate)
- Genre (name)
-
Edge Types:
- watched (rate(double))
- belongs_to_genre
- directed_by
- acted_in
Data Processing Workflow
- Raw data organization
- Load data into data warehouse (Postgres)
- Transform data into a format suitable for property‑graph models (dbt) and export as CSV
- Load CSV files into NebulaGraph (Nebula‑Importer)
Dataset Usage
- The dataset is used to build a movie knowledge graph, supporting the NebulaGraph graph database.
- For detailed usage, refer to this link.
Dataset Schema Mapping
- The schema mapping details how the two tabular data sources are mapped to NebulaGraph's property‑graph model.
Dataset Validation
- After importing into NebulaGraph, execute
SHOW STATS;to verify data integrity and ensure correct loading.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Movie Recommendation
Knowledge Graph
Source
Organization: github
Created: 11/6/2022
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.