JUHE API Marketplace
Mtehabsim avatar
MCP Server

ScreenPilot

enables AI to fully control and access GUI interactions by providing tools for mouse and keyboard, ideal for general automation, education, and experimentation.

50
GitHub Stars
11/22/2025
Last Updated
No Configuration
Please check the documentation below.
  1. Home
  2. MCP Servers
  3. ScreenPilot

README Documentation

MseeP.ai Security Assessment Badge

Trust Score

ScreenPilot

MCP server to let LLM take full control on your device by providing screen automation toolkit for controlling and interacting with graphical user interfaces. Good for automation, education and having fun.

Main Features

  • 📷 Screen capture and analysis
  • 🖱️ Mouse control (clicking, positioning)
  • ⌨️ Keyboard input (typing, key presses, hotkeys)

watch demo

https://github.com/user-attachments/assets/c18380c0-b3dd-4b7c-925d-28ef205ca11f

Installation

  1. Install python 3.12
  2. Clone the repository:
    git clone https://github.com/Mtehabsim/ScreenPilot.git
    
  3. create virtiual environment

python -m venv venv
  1. activate the env
venv\Scripts\activate
  1. Install the required packages:
    pip install -r requirements.txt
    
  2. Open Claude AI desktop
  3. file -> settings -> developer -> edit config
  4. open config file and paste this
{
    "mcpServers": {
        "device-controll": {
            "command": "pathToEnv\\venv\\Scripts\\python.exe",
            "args": [
                "pathToProject\\ScreenPilot\\main.py"
            ]
        }
    }
}

  1. Replace     "pathToEnv\venv\Scripts\python.exe" → with the full path to your python.exe     "pathToProject\ScreenPilot\main.py" → with the full path to your main.py file

  2. Save the config file.

  3. Open Claude AI Desktop.

  4. Go to File → Exit

  5. You can now open Claude AI Desktop and enjoy ScreenPilot.

Available Tools

  • Screen Capture: Take screenshots and get screen information
  • Mouse Control: Move the mouse and perform clicks
  • Keyboard Actions: Type text, press keys, and use hotkey combinations
  • Scrolling: Scroll in different directions and to specific positions
  • Element Detection: Check if elements exist on screen and wait for them to appear
  • Action Sequences: Perform multiple actions in sequence

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Quick Actions

View on GitHubView All Servers

Key Features

Model Context Protocol
Secure Communication
Real-time Updates
Open Source

Boost your projects with Wisdom Gate LLM API

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.

Learn More
JUHE API Marketplace

Accelerate development, innovate faster, and transform your business with our comprehensive API ecosystem.

JUHE API VS

  • vs. RapidAPI
  • vs. API Layer
  • API Platforms 2025
  • API Marketplaces 2025
  • Best Alternatives to RapidAPI

For Developers

  • Console
  • Collections
  • Documentation
  • MCP Servers
  • Free APIs
  • Temp Mail Demo

Product

  • Browse APIs
  • Suggest an API
  • Wisdom Gate LLM
  • Global SMS Messaging
  • Temp Mail API

Company

  • What's New
  • Welcome
  • About Us
  • Contact Support
  • Terms of Service
  • Privacy Policy
Featured on Startup FameFeatured on Twelve ToolsFazier badgeJuheAPI Marketplace - Connect smarter, beyond APIs | Product Huntai tools code.marketDang.ai
Copyright © 2025 - All rights reserved