DATASET

Open Source Community

imdb-5000-movie-dataset

This dataset contains 5,000 randomly selected movie records from IMDB, with 28 attributes for each record.

Updated 6/23/2023

github

Description

Dataset Overview

Dataset Name

Name: imdb-5000-movie-dataset
Source: Kaggle

Dataset Content

Record Count: Over 5,000
Attribute Count: 28
File Format: CSV
File Name: movie_metadata.csv

Data Processing

Cleaning: The dataset was cleaned for analysis and visualization purposes.
Analysis:
- linechart.py: Cleaned and analyzed director_name, genres, title_year, imdb_score, counting the number of movies released between 1916 and 2016.
- histogram.py: Cleaned and analyzed title_year, num_critic_for_reviews, num_user_for_reviews, director_facebook_likes, counting review frequencies and director Facebook likes.

Visualization

Tool: matplotlib.pyplot
Output Files:
- linechart.py:
  - linechart.png
  - linechart1.png
  - linechart2.png
  - linechart3.png
  - linechart4.png
- histogram.py:
  - histogram.png
  - histogram1.png
  - histogram2.png

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Movie Data

IMDB

Source

Organization: github

Created: 12/31/2016

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.

Check Prices →