JUHE API Marketplace
DATASET
Open Source Community

imdb-5000-movie-dataset

This dataset contains 5,000 randomly selected movie records from IMDB, with 28 attributes for each record.

Updated 6/23/2023
github

Description

Dataset Overview

Dataset Name

  • Name: imdb-5000-movie-dataset
  • Source: Kaggle

Dataset Content

  • Record Count: Over 5,000
  • Attribute Count: 28
  • File Format: CSV
  • File Name: movie_metadata.csv

Data Processing

  • Cleaning: The dataset was cleaned for analysis and visualization purposes.
  • Analysis:
    • linechart.py: Cleaned and analyzed director_name, genres, title_year, imdb_score, counting the number of movies released between 1916 and 2016.
    • histogram.py: Cleaned and analyzed title_year, num_critic_for_reviews, num_user_for_reviews, director_facebook_likes, counting review frequencies and director Facebook likes.

Visualization

  • Tool: matplotlib.pyplot
  • Output Files:
    • linechart.py:
      • linechart.png
      • linechart1.png
      • linechart2.png
      • linechart3.png
      • linechart4.png
    • histogram.py:
      • histogram.png
      • histogram1.png
      • histogram2.png

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Movie Data
IMDB

Source

Organization: github

Created: 12/31/2016

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.