README Documentation
🤖 AI/ML Resources
Curated collection of AI/ML resources and tools to jumpstart your artificial intelligence and machine learning journey. Explore tutorials, datasets, libraries, and more!
PRs with additional resources or suggestions are much welcome!
Resources 📕
Learning Resources 📚
- Neural Networks: Zero to Hero - From Andrej Karpathy, former Director of AI at Tesla and now at OpenAI.
- Full Stack LLM Bootcamp - From the creators of the Full Stack Deep Learning course and book.
- All about LLMs - A gist with great curated learning resources.
- Transformers - Introduction to Transformers with Andrej Karpathy.
- The Illustrated Transformer - A great visual explanation of the Transformer architecture.
- How ChatGPT really works - A great initial explanation of how ChatGPT works.
- ChatGPT Prompt Engineering for Developers! - Great course by OpenAI employees.
- Learn Prompting - Another great course (text-based) about prompting.
- Prompt Engineering Guide - Guides, papers, lectures, and resources for prompt engineering.
- Prompt Engineering - Great post from Lilian Weng, Head of Applied AI Research at OpenAI.
- MLOps Guide - A guide on MLOps.
- MLOps Zoomcamp - A great course on MLOps.
- Gandalf - A fun way to learn about prompt injection.
- Practical Deep Learning - A course designed for people with some coding experience who want to learn how to apply deep learning and machine learning to practical problems.
- Let's build GPT - From Andrej Karpathy, Let's build GPT: from scratch, in code, spelled out.
- AI Canon - A great curated list of resources to get smarter about modern AI.
- Generative AI Learning Path - This learning path guides you through curated content on Generative AI products and technologies.
- Rules of Machine Learning - Intended to help those with a basic knowledge of machine learning get the benefit of Google's best practices.
- AI Companion App - A tutorial stack to create and host AI companions that you can chat with on a browser or text via SMS.
- Microsoft AI Lab - AI Lab helps a large, fast-growing community of developers get started on AI.
- ML Course Notes - A place to collaborate and share lecture notes on all topics related to machine learning, NLP, and AI.
- Generative AI for Beginners - A 12-lesson course teaching everything you need to know to start building Generative AI applications.
- ML YouTube Courses - An index of some of the best and most recent machine learning courses available on YouTube.
- ML Papers Explained - Explanations of key concepts in machine learning.
- LLM Course - A course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
- Open LLMs - A list of open LLMs available for commercial use with their context lengths and licenses.
- Awesome ChatGPT Prompts - A collection of prompt examples to be used with ChatGPT and other LLMs.
- Hugging Face Course - Learn how to apply Transformers to various tasks in natural language processing and beyond.
- Andrew Ng’s Machine Learning - A foundational course on machine learning.
- Machine Learning Roadmap - A roadmap connecting important concepts in machine learning.
- Robert Miles AI Safety - YouTube channel about AI safety.
LLMs
- OpenAI LLMs - OpenAI models.
- Hugging Face - The leading open-source AI community. Find trending models, datasets, and spaces.
- Bloom - An open-source multilingual model similar to GPT-3.
- LLaMA - A foundational large language model by Meta.
- Llama 2 - The next generation of Meta's open-source large language model.
- JARVIS - An interface for LLMs to connect numerous AI models.
- Open LLM Leaderboard - Evaluate and rank open-source LLMs.
Modern LLM Models
Commercial Models
- Claude - Anthropic's family of frontier AI systems, known for long context windows and reasoning capabilities.
- Gemini - Google DeepMind's multimodal AI system available in Ultra, Pro, and Nano sizes.
- GPT-4 - OpenAI's most advanced system, with variants including GPT-4o (omni) and GPT-4 Turbo.
Open Source Models
- Llama 3 - Meta's latest open foundation language model family, available in 8B and 70B parameters.
- Mistral - A family of open-source large language models, including Mistral 7B and Mixtral 8x7B.
- Qwen - Alibaba's advanced LLM series with strong multilingual capabilities, available in various sizes.
- DeepSeek - A powerful open-source language model trained on 2T tokens with strong coding capabilities.
- Vicuna-13B - An open-source chatbot fine-tuned from LLaMA.
- Yi - A series of large language models trained from scratch by 01.AI, available in 6B to 34B parameters.
- Phi-3 - Microsoft's small yet powerful models (3.8B and 14B) with strong reasoning capabilities.
- Falcon - Technology Innovation Institute's open-source LLM trained on 1 trillion tokens.
- Falcon 2 - Technology Innovation Institute's latest open-source LLM with enhanced capabilities.
- Orca 2 - Microsoft's smaller models that demonstrate strong reasoning capabilities.
- Gemma 2 - Google's lightweight models (9B, 27B) optimized for high-speed inference across hardware.
- Command R/R+ - Cohere's models specifically designed for RAG and long-context tasks.
- Granite - IBM's enterprise-focused models with efficient hybrid architectures.
- Phi-4 - Microsoft's 16B parameter multimodal SLM with multilingual support and image understanding.
- Qwen2.5/Qwen3 - Alibaba's updated series (0.5B-72B, 128K context, 29+ languages) with specialized variants.
- Grok-1.5 - xAI's model with vision capabilities and long context support.
- StableLM - Stability AI's language models for various scales and use cases.
- Nemotron 3 - Nvidia's hybrid architecture model for cost-effective scaling.
- Kimi K2 - Chinese model with strong performance and long context capabilities.
- ChatGLM/GLM - Tsinghua University's bilingual conversational model.
- MiniMax - Chinese multimodal model with text and image capabilities.
- Guidance - Control modern language models more effectively and efficiently.
- TheBloke on Hugging Face - Compiles the best open-source models in various formats.
- DemoGPT - Create 🦜️🔗 LangChain apps by just using prompts.
- Llama2 Web UI - Run Llama 2 with Gradio web UI on GPU or CPU from anywhere.
- llama.cpp - Run LLaMA model using 4-bit integer quantization on a MacBook and more.
- LocalAI - Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required.
- LocalAGI - A small virtual assistant you can run locally, powered by LocalAI.
- Ollama - A backend that allows you to run large language models locally.
- KoboldCpp - An easy-to-use AI text-generation software for GGML and GGUF models.
- GPT4All - Open-source large language models that run locally on your CPU and nearly any GPU.
- vLLM - A high-throughput and memory-efficient inference and serving engine for LLMs.
- ExLlama - A fast inference library for running LLMs locally on modern consumer-class GPUs.
- Stable Beluga 2 - A fine-tuned Llama2 70B model.
- Claude - An AI assistant from Anthropic with advanced reasoning and extended context.
Chat and Agents
- ChatGPT - The leading chatbot built on GPT-3.5 and GPT-4.
- Bing Chat - A conversational AI language model powered by Microsoft Bing.
- Open-Assistant - Open-source chat agent that interacts with external sources.
- Auto-GPT - An experimental open-source attempt to make GPT-4 fully autonomous.
- LoopGPT - A modular reimplementation of Auto-GPT.
- ThinkGPT - Implementing Chain-of-Thought reasoning for Large Language Models.
- Transformers Agents - Provides a natural language API on top of transformers.
- MetaGPT - The Multi-Agent Framework: Given one-line requirement, return PRD, design, tasks, repo.
- GPT-Engineer - Specify what you want it to build; the AI asks for clarification and then builds it.
- Khoj - An AI personal assistant for your digital brain.
- Danswer - Open-source enterprise question-answering.
- simpleaichat - Python package for easily interfacing with chat apps.
- RealChar - Create and chat with AI characters.
- ChatGPT AutoExpert - Supercharged Custom Instructions for ChatGPT.
- Bee Agent Framework - Framework for building scalable agentic applications.
- Local AI - A desktop app for local, private, secured AI experimentation.
- Serge - A chat interface crafted with llama.cpp for running GGUF models.
- SillyTavern - A chat UI for interacting with text generation AIs and roleplay characters.
- TavernAI - An atmospheric frontend for chat and story writing.
- Maid - A cross-platform Flutter app for interfacing with GGUF/llama.cpp models locally.
- AgentGPT - Configure and deploy autonomous AI agents.
- SuperAGI - A dev-first open-source autonomous AI agent framework.
- BabyAGI - AI-powered task management system using OpenAI.
- Open Interpreter - Open-source, locally running implementation of OpenAI's Code Interpreter.
- AutoPR - Automated pull request workflow powered by AI.
- Phind - An AI search engine and assistant for programmers.
- Character.AI - Create and chat with AI-powered characters.
- Agent2Agent - Google DeepMind's framework for multi-agent collaboration with LLMs.
- CrewAI - Framework for orchestrating role-playing, autonomous AI agents with collaborative intelligence.
- CopilotKit - Framework for building agent-native applications with Generative UI and human-in-the-loop workflows.
- LangGraph - Build resilient language agents as graphs with advanced state management.
- Open-AutoGLM - Open phone agent model and framework for unlocking AI phone capabilities.
- Mastra - Framework for building AI-powered applications and agents with modern TypeScript stack.
- deer-flow - ByteDance's community-driven Deep Research framework combining LLMs with web search and tools.
- openai-agents-python - OpenAI's lightweight, powerful framework for multi-agent workflows.
- 12-factor-agents - Principles for building production-ready LLM-powered software agents.
- DB-GPT - AI Native Data App Development framework with AWEL and Agents.
- elizaOS/eliza - Autonomous agents for everyone with Discord, Slack, Telegram integration.
- Microsoft AutoGen - Programming framework for agentic AI with multi-agent conversations.
AI-Powered IDEs
- Cursor - AI-first code editor based on VSCode with powerful chat, edit, and generation capabilities.
- Windsurf - AI agent-powered IDE that integrates with existing workflows, by Codeium.
- Trae - Adaptive AI IDE that transforms how you work, collaborating with you to run faster.
- Zed - High-performance, multiplayer code editor with AI features.
Development
- LangChain - Framework for developing applications powered by LLMs.
- Pinecone - Long-term memory for models with a vector database.
- Chroma - The open-source alternative to Pinecone.
- Plug-in-Play - Simplify the integration of plugins into open-source LLMs.
- GPTCache - Caching for LLM responses. Useful for cost savings.
- OpenAI Cookbook - A collection of examples and best practices for building with OpenAI.
- GPTCache - Semantic cache to store responses from LLM queries for cost savings.
- Vercel AI SDK - An open source library for building AI-powered user interfaces with React, Svelte, and Vue.
- How to build an agent with LangChain - Great Jupyter notebook from OpenAI.
- Mojo - A new programming language combining Python syntax with systems programming.
- Semantic Kernel - SDK that enables integration of AI LLMs with conventional programming languages.
- Langcorn - API server for serving LangChain models with FastAPI.
- smol developer - Your own personal junior developer :)
- smol plugin - Automatically generate OpenAI plugins by specifying your API in markdown.
- Kor - A thin wrapper on top of LLMs to extract structured data.
- tiktoken - A fast BPE tokenizer used with OpenAI's models.
- OpenAI Function Calling - Standardize LLM output.
- Vercel AI SDK - Build AI-powered applications with React, Svelte, and Vue.
- Code Interpreter API - Open-source implementation of ChatGPT Code Interpreter.
- Unsloth - Framework for fine-tuning Large Language Models.
- Pezzo - Developer-first LLMOps platform to streamline prompt design and version management.
- Lunary - Production toolkit for LLMs focusing on observability and evaluations.
- Ludwig - Low-code framework for building custom AI models.
- Langroid - Lightweight Python framework to build LLM-powered applications.
- LLMware - Unified framework for developing LLM-based application patterns.
- LLM App - Production framework for building and serving AI applications.
- LlamaIndex - A data framework for building LLM applications over external data.
- LMQL - A query language for large language models.
- Haystack - Framework for building NLP applications with language models.
- Prediction Guard - Integrate private, controlled, and compliant LLM functionality.
- Portkey - LLMOps platform to monitor, manage, and improve LLM-based apps.
- OpenRouter - A unified API to access 100+ LLMs from different providers through a single interface.
- Cline - Autonomous coding agent right in your IDE, capable of creating/editing files and executing commands.
- Screenpipe - Screen and voice AI recording for context-aware AI applications.
- Axolotl - Tool for fine-tuning LLMs with YAML configs supporting LoRA/QLoRA/full tuning.
- LLaMA-Factory - Web UI for fine-tuning LLaMA models with multi-GPU support.
RAG (Retrieval-Augmented Generation)
- RAGFlow - Leading open-source RAG engine fusing cutting-edge RAG with Agent capabilities.
- GraphRAG - Microsoft's modular graph-based RAG system for enhanced reasoning.
- LightRAG - Simple and fast Retrieval-Augmented Generation framework.
- R2R - Production-ready AI retrieval system with agentic RAG and RESTful API.
- Verba - RAG chatbot powered by Weaviate with customizable pipelines.
- AutoRAG - RAG evaluation and optimization framework with AutoML-style automation.
- Cognita - Modular, open-source RAG framework for production applications.
- RAG_Techniques - Showcase of advanced RAG techniques and implementations.
Model Context Protocol (MCP)
- fastmcp - Fast, Pythonic way to build MCP servers and clients for AI applications.
- mcp-for-beginners - Microsoft's open-source curriculum for MCP in multiple languages (.NET, Java, TypeScript, Python, Rust).
- claude-flow - Leading agent orchestration platform for Claude with MCP support and swarm intelligence.
- fastapi_mcp - Expose FastAPI endpoints as Model Context Protocol tools with Auth.
- mcp-chrome - Chrome extension-based MCP server for browser automation and content analysis.
- mcp-use - Easiest way to interact with MCP servers with custom agents.
- mcp-go - Go implementation of Model Context Protocol for seamless LLM integration.
- mcp-agent - Build effective agents using Model Context Protocol and workflow patterns.
- awesome-mcp-servers - Curated list of Model Context Protocol servers.
AI Workflow Automation
- n8n - Fair-code workflow automation platform with native AI capabilities and 400+ integrations.
- Dify - Production-ready platform for agentic workflow development with low-code interface.
- activepieces - AI Agents & MCPs & AI Workflow Automation with 400 MCP servers.
- sim - Open-source platform to build and deploy AI agent workflows.
AI Video Generation
- Sora - OpenAI's text-to-video model with narrative intelligence for creating realistic videos.
- Kling AI - Filmmaker-friendly video generation with collaboration features, 1080p, 10s videos.
- Luma AI (Dream Machine) - 4K HDR/EXR support for professional post-production quality video generation.
- Runway Gen-4 - Filmmaking tools with camera controls, Act-One, and comprehensive editing suite.
- Google Veo - Best all-arounder with strong prompt adherence, realism with audio, 1080p, 8s.
- Pika - Cost-effective video generation with creative controls and editing features.
- Seedance Pro - Most reliable video generation with stunning realism and 12s native generations.
- Minimax Hailuo - Good for object transformation with creative video effects.
AI Search Engines
- Perplexity AI - Conversational AI search with superior source citations and focus modes for different domains.
- You.com - Productivity engine with multiple AI models (GPT-4, Claude 3, Gemini) and customizable search modes.
- Andi Search - Ranked #1 for factual accuracy in 2024 benchmarks with conversational interface.
- Brave Search - Independent search with AI Answer feature, zero tracking, and ad-free results.
- Arc Search - Mobile-first with "Browse for Me" feature that compiles information into custom webpages.
- Consensus - Specialized in scientific and academic research search.
- Wolfram Alpha - Computational knowledge engine for scientific and mathematical queries.
- Phind - AI search engine for developers with multiple model choices and technical focus.
Tools
- Vault AI - Tool for uploading documents and asking questions about their content.
- LangFlow - Visual prototyping and experimentation with LangChain.
- Flowise - Visual tool to build your customized LLM flow.
- PentestGPT - A GPT-empowered penetration testing tool 🕵️.
- TypingMind - A better UI for ChatGPT.
- privateGPT - Ask questions to your documents without an internet connection.
- Quivr - Dump all your files and thoughts into your Generative AI second brain and chat with it.
- Stable Diffusion Web UI - A browser interface based on Gradio for Stable Diffusion.
- h2oGPT - Like privateGPT, but with GPU inference supported.
- localGPT - Inspired by privateGPT, using Vicuna-7b and InstructorEmbeddings.
- Promptflow - Create executable flowcharts linking LLMs, prompts, and functions.
- Dify - An open-source LLM app development platform.
- txtai - Semantic search and workflows powered by language models.
- Unofficial OpenAI Status - An in-depth OpenAI status page.
- gpt-prompt-engineer - Generates, tests, and ranks prompts for your task.
- rag-stack - Deploy a private ChatGPT alternative hosted within your VPC.
- AnythingLLM - Open-source ChatGPT equivalent for open and closed-source LLMs.
- DocsGPT - Streamlines finding information in project documentation.
- Dialoqbase - Facilitate the creation of custom chatbots using a knowledge base.
- FastGPT - Knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities.
ChatGPT Extensions
- WebChatGPT - Augment ChatGPT prompts with relevant web results.
- GPT for Sheets and Docs - ChatGPT extension for Google Sheets and Docs.
- YouTube Summary with ChatGPT - Summarize YouTube videos with ChatGPT.
- ChatGPT Prompt Genius - Discover and share the best prompts for ChatGPT.
- ChatGPT for Search Engines - Display ChatGPT responses alongside search results.
Code Assistants
- Refact - Open-source AI coding assistant with code completion and chat.
- Draw a UI - Draw a mockup and generate HTML using AI.
- Continue - Open-source autopilot for VS Code and JetBrains.
- Sweep AI - AI junior developer that turns bugs and requests into code changes.
- Cody - Free, open-source AI coding assistant for code completion and Q&A.
- Aider - Pair program with GPT-3.5/GPT-4 to edit code in your git repo.
- AutoPR - Automated pull request workflow powered by AI.
- bloop - ChatGPT for your codebase; search and generate patches.
- GitHub Copilot - GitHub Copilot.
- Ghostwriter - AI-powered pair programmer by Replit.
- Amazon CodeWhisperer - Build applications faster with ML-powered coding companion.
- MutableAI - AI-accelerated software development.
- GPT-Code UI - Open-source implementation of ChatGPT Code Interpreter.
Local AI Coding Agents & Assistants
Terminal/CLI-Based Coding Agents
- OpenClaw - Personal AI assistant that runs locally with autonomous task execution, plugin system, and multi-platform messaging integration.
- Codex - OpenAI's lightweight coding agent that runs in your terminal for autonomous code editing and execution.
- Claude Code - Anthropic's autonomous coding assistant with terminal integration and advanced reasoning.
- Goose - Open-source, extensible AI agent from Block that goes beyond code suggestions with install, execute, edit, and test capabilities.
- Roo Code - VS Code extension providing a whole dev team of AI agents with multiple modes, semantic indexing, and MCP support.
- pi-mono - AI agent toolkit with coding agent CLI, unified LLM API, TUI & web UI libraries, and vLLM pods.
Desktop Applications & Unified Clients
- Cherry Studio - Agentic AI desktop with 300+ assistants, autonomous coding, and unified access to frontier LLMs.
- AionUi - Free, local, open-source UI for multiple coding agents including Codex, Claude Code, Goose CLI, and more.
- cc-switch - Cross-platform All-in-One assistant tool for Claude Code, Codex, OpenCode & Gemini CLI with skills management.
- nanobot - Ultra-lightweight OpenClaw alternative with minimal resource requirements.
- nanoclaw - Lightweight alternative to OpenClaw running in Apple containers for security with WhatsApp integration.
Agent Management & Orchestration
- claude-squad - Manage multiple AI terminal agents like Claude Code, Aider, Codex, and Amp simultaneously.
- vibe-kanban - Kanban-style task management to get 10X more out of coding agents.
- humanlayer - Human-in-the-loop framework for AI coding agents solving hard problems in complex codebases.
- oh-my-opencode - The best agent harness for orchestrating multiple coding agents.
Skills & Extensions
- awesome-claude-skills - Curated list of Claude Skills, resources, and tools for customizing Claude AI workflows.
- awesome-openclaw-skills - Collection of OpenClaw Skills (formerly Moltbot/Clawdbot skills).
- awesome-agent-skills - 300+ agent skills from official teams and community, compatible with Codex, Claude Code, Cursor, and others.
- obsidian-skills - Agent skills for Obsidian note-taking integration.
- marketingskills - Marketing skills for Claude Code and AI agents including CRO, copywriting, SEO, and analytics.
Infrastructure & Utilities
- 1Panel - Web interface for Linux servers to manage OpenClaw agents, LLMs, websites, databases, and containers.
- LangBot - Production-grade platform for building agentic IM bots across Discord, Slack, Telegram, WeChat, and more.
- memU - Memory system for 24/7 proactive agents like openclaw with persistent context and scheduled jobs.
- moltworker - Run OpenClaw on Cloudflare Workers for serverless deployment.
- happy - Mobile and web client for Codex and Claude Code with realtime voice and encryption.
- ccusage - CLI tool for analyzing Claude Code/Codex usage from local JSONL files.
Vector Databases
- Weaviate - An open-source vector database that's robust and scalable.
- Milvus - Open-source vector database for embedding similarity search.
- Qdrant - Vector similarity search engine and database.
- Deep Lake - Database for AI optimized for deep-learning applications.
- Chroma - Open-source embedding database for AI applications.
- LanceDB - Developer-friendly vector database for AI applications.
Stable Diffusion
- Stable Diffusion Web UI - Browser interface for Stable Diffusion.
- Midjourney - Independent research lab exploring new mediums of thought.
- InvokeAI - Creative engine for Stable Diffusion models.
- ComfyUI - Powerful and modular Stable Diffusion GUI and backend.
- Lama Cleaner - Image inpainting tool powered by SOTA AI models.
- ControlNet - Neural network structure to control diffusion models by adding extra conditions.
- Stable Diffusion XL - Stability AI's advanced text-to-image model with improved quality and features.
Audio Generation
AI Voice Cloning
- Eleven Labs - AI voice generator with lifelike voices.
- Resemble AI - AI voice generator and voice cloning for text-to-speech.
- Murf AI - Create voiceovers with lifelike AI voices.
- Bark - Transformer-based text-to-audio model. #opensource
- AudioCraft - A library for audio processing and generation with deep learning, including MusicGen for music generation.
- Whisper - OpenAI's robust speech recognition model for transcription and translation.
Music Generation
- Harmonai - Open-source generative audio tools for music production.
- Mubert - Royalty-free music ecosystem powered by AI.
- MusicLM - Google's model for generating high-fidelity music from text.
Marketing AI Tools
- Jasper AI - AI-powered tool for generating marketing content like blogs, emails, and ad copy.
- Mutiny - Personalization platform to improve website conversions using AI.
- Clearbit - Lead enrichment and data intelligence platform.
- Adzooma - AI-powered PPC campaign management platform.
- Phrasee - AI tool that generates optimized marketing copy.
- Rupert AI - AI tools for designers and marketers.
- Persuva - AI-driven platform to create persuasive, high-converting ad copy at scale.
Other
- PromptBase - Marketplace for buying and selling quality prompts for AI models.
- Have I Been Trained? - Check if your image has been used to train AI art models.
- GummySearch - AI-based customer research via Reddit.
- Taplio - AI-powered LinkedIn tool.
- PromptPal - Search for prompts and bots, then use them with your favorite AI.
- Code to Flow - Visualize code logic as flowcharts using AI.
- AI-Flow - Connect multiple AI models easily.
- Architecture Helper - Analyze building architecture and generate custom styles.
- LM Studio - Discover, download, and run local LLMs with a desktop app.
- Ollama - Run, create, and share large language models locally.
- Jan - Open-source ChatGPT alternative that runs 100% offline on your computer.
- PrivateGPT - Interact privately with your documents using the power of LLMs, 100% privately, no data leaves your execution environment.
- LocalAI - Self-hosted, community-driven, local OpenAI-compatible API.
Quick Actions
Key Features
Model Context Protocol
Secure Communication
Real-time Updates
Open Source