JUHE API Marketplace
DATASET
Open Source Community

MovieLens 32M and MovieLens 1B Synthetic Dataset

This project uses the MovieLens 32M and MovieLens 1B Synthetic Datasets to demonstrate an advanced recommendation system developed for a media streaming platform (inspired by Netflix). The system employs a hybrid approach that combines collaborative filtering, content‑based filtering, and graph‑based recommendation to provide personalized movie suggestions.

Updated 8/12/2024
github

Description

MovieLens Recommendation System

This project showcases a complex recommendation system developed for a media streaming platform (inspired by Netflix) using the MovieLens 1B synthetic dataset. The system adopts a hybrid approach that combines collaborative filtering, content‑based filtering, and graph‑based recommendation to deliver personalized movie suggestions.

Project Structure

├── LICENSE <- Open‑source license (if selected) │ ├── Makefile <- Makefile with convenient commands such as make data or make train │ ├── README.md <- Top‑level README for developers using this project. │ ├── data │ │ │   ├── external <- Data from third‑party sources. │ │ │   ├── interim <- Converted intermediate data. │ │ │   ├── processed <- Final curated dataset for modeling. │ │ │   └── raw <- Immutable raw data dump. │ ├── environment.yml <- Requirements file for reproducing the analysis environment, e.g., generated via pip freeze > requirements.txt │ ├── models <- Trained serialized models, model predictions, or model summaries │ ├── notebooks <- Jupyter notebooks. Naming convention: number (for ordering), author initial, and a brief - separated description, e.g., 1.0-jqp-initial-data-exploration │ └── src <- Source code for this project │ ├── init.py <- Makes src a Python module │ ├── dataset.py <- Scripts for downloading or generating data │ ├── features.py <- Code for creating modeling features │ ├── modeling │ │ │   ├── init.py │ │ │   ├── predict.py <- Code for running model inference with a trained model │ │ │   └── train.py <- Code for training a model │ └── plots.py <- Code for creating visualizations

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Recommender Systems
Movie Recommendation

Source

Organization: github

Created: 8/5/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.