contop

agent
Guvenlik Denetimi
Basarisiz
Health Uyari
  • License — License: MIT
  • Description — Repository has a description
  • Active repo — Last push 0 days ago
  • Low visibility — Only 5 GitHub stars
Code Basarisiz
  • network request — Outbound network request in contop-cli-proxy/src/codex-direct-session.ts
  • execSync — Synchronous shell command execution in contop-cli-proxy/src/index.ts
Permissions Gecti
  • Permissions — No dangerous permissions requested
Purpose
This agent acts as an AI-powered remote desktop interface, allowing you to control a desktop computer autonomously from a mobile phone using typed or spoken commands.

Security Assessment
This tool carries a High overall security risk. Its core functionality involves controlling a desktop, which is reflected in the code analysis: it utilizes synchronous shell command execution (`execSync`) and makes outbound network requests to facilitate communication between the phone and the desktop. By design, the application has deep access to your operating system and can execute sensitive actions. While no hardcoded secrets or explicitly dangerous repository permissions were found, granting a remote tool the ability to run shell commands inherently requires the user to place a massive amount of trust in the software.

Quality Assessment
The project is relatively new but actively maintained, with repository pushes occurring as recently as today. It uses a standard permissive MIT license and includes clear documentation. However, it currently suffers from extremely low community visibility, evidenced by having only 5 GitHub stars. Because the codebase has not been widely reviewed or battle-tested by the open-source community, its underlying reliability and security remain unproven.

Verdict
Use with caution—while functional, the combination of low community trust and high-risk shell execution capabilities means you should thoroughly inspect the source code before relying on it.
SUMMARY

Your Desktop, From Anywhere. AI-powered remote desktop control from your phone.

README.md

Contop

Contop

Your Desktop, From Anywhere

AI-powered remote desktop control from your phone

WebsiteDocsReleases

Latest mobile release Latest desktop release License Python TypeScript (mobile) TypeScript (desktop) React Native Expo Tauri FastAPI Google ADK PinchTab v0.8.2 Platform


Contop turns your phone into an AI remote control for any desktop. Speak or type a command on your mobile, and an autonomous agent on your computer observes your screen, runs CLI commands, clicks buttons, fills forms, automates browsers, and streams progress back - all in real time over a peer-to-peer WebRTC tunnel.

No port forwarding. No VPN. No SSH. Scan a QR code and start working.


Install

Desktop App

macOS (Homebrew - recommended):

brew install slopedrop/contop/contop

# Update to latest version
brew update && brew upgrade contop

No security warnings. Python dependencies install automatically on first launch.

macOS (manual):
Download the .dmg from Releases, open it, and drag to Applications.

First launch: right-click the app → Open → click Open in the dialog. This is standard for open-source apps without code signing.

Windows (Scoop - recommended):

scoop bucket add contop https://github.com/slopedrop/scoop-contop
scoop install contop

# Update to latest version
scoop update && scoop update contop

No SmartScreen warnings. Python dependencies install automatically on first launch. NVIDIA GPU with CUDA is auto-detected.

Windows (manual):
Download the .exe installer from Releases and run it.

SmartScreen may show a warning - click More infoRun anyway. This is standard for open-source apps without code signing.

Linux:

# AppImage (any distro)
chmod +x Contop*.AppImage && ./Contop*.AppImage

# Debian / Ubuntu
sudo dpkg -i contop-desktop_*.deb

Download from Releases.

Mobile App

Download the Android .apk from Releases.

Enable Install from unknown sources when prompted.

iOS is not yet available for public alpha.


How It Works

How It Works - Phone to Agent to Screen

Demo

https://github.com/user-attachments/assets/98705f2a-72f3-4e26-ae48-bbb889ec9c97

Screenshots

Manual control - direct touch control of the remote desktop
Manual Control - direct touch control of the remote screen
Landscape split view - live video beside the execution thread
Landscape Split View - live video beside the execution thread
Session history - persisted conversation list
Session History
Session history - restored conversation with full execution thread
Restored Session
Settings - models and API keys
Settings - Models
Settings - security rules and system prompts
Settings - Security

Features

Autonomous AI Agent

  • 30+ execution tools - CLI, GUI automation, file operations, browser control, window management, document processing, app lifecycle, and more
  • Smart vision routing - 9 backends: OmniParser V2, Gemini Computer Use, Accessibility Tree, and 6 OpenRouter vision models (UI-TARS, Kimi, Qwen, Phi, Molmo, Holotron)
  • Multi-step planning - plan-generation tool with research sub-agent, tool chaining, and up to 50 iterations per task
  • Multi-provider LLM - Gemini, OpenAI, Anthropic, and OpenRouter (100+ models including Groq, Mistral, DeepSeek, and more) via LiteLLM
  • Subscription mode - use your existing Claude Pro/Max, Gemini Pro, or ChatGPT Plus/Pro subscription instead of API keys via the built-in CLI proxy (Claude Code, Gemini CLI, Codex CLI - text-only, no LLM vision fallback)
  • Skills system - extensible via SKILL.md standard with YAML workflows and Python tool loading
  • Real-time feedback - step-by-step progress, screenshots, and model/backend transparency streamed to your phone

Security

  • Dual-Tool Evaluator - every command classified and routed through a security gate before execution
  • Destructive action approval - dangerous operations require explicit user confirmation
  • Sandboxed execution - high-risk commands run in an isolated Docker container
  • Restricted path isolation - prevents agent from accessing protected directories
  • JSONL audit log - every tool call logged with timestamps, commands, and outcomes
  • Away Mode - PIN-locked secure overlay with auto-engage on idle (Windows)

Connectivity

  • QR code pairing - scan to connect with 30-day persistent tokens, no IP configuration needed
  • Cloudflare Tunnel - automatic public URL, zero port forwarding
  • WebRTC P2P - dual data channels (reliable + unreliable) with live video streaming
  • Paired device management - geo-location tracking, connection path visibility, per-device revoke, OS notifications
  • Connection loss resilience - automatic execution kill on disconnect, chat-only fallback mode

Desktop App (Tauri v2)

  • Lightweight native shell (Rust) with settings GUI
  • Manages the Python server as a sidecar process
  • API key and subscription mode configuration, security rules, system prompts
  • CLI proxy lifecycle management - auto-start, health monitoring, and watchdog restart
  • Cross-platform: Windows, macOS, Linux

Mobile App (Expo / React Native)

  • Adaptive layouts: split-view, side-by-side, fullscreen video, thread-focus
  • Real-time execution thread with tool outputs and screenshots
  • Session history with persistence and restore
  • Model selection, extended thinking toggle, custom instructions

Architecture

Architecture - Mobile, Desktop Host, External Services

Tech Stack

Layer Technology
Mobile React Native 0.83, Expo 55, TypeScript, NativeWind v4, Zustand
Desktop Tauri v2 (Rust + Vite), Win32 APIs for Away Mode
Server Python 3.12, FastAPI, asyncio, aiortc
AI Agent Google ADK, LiteLLM (multi-provider routing)
AI Models Gemini, OpenAI, Anthropic, Any model on OpenRouter (API keys or CLI subscriptions)
Vision OmniParser V2, Gemini Computer Use, Accessibility Tree, 6 OpenRouter models
Automation PyAutoGUI, platform adapters (Win/Mac/Linux), PinchTab CDP
Networking WebRTC (aiortc), Cloudflare Tunnels, DTLS encryption
Security Dual-Tool Evaluator, Docker sandbox

Development Setup

Prerequisites

  • Python 3.12+ with uv
  • Node.js 18+
  • At least one LLM API key: Gemini, OpenAI, Anthropic, or OpenRouter - or an existing Claude Pro/Max, Gemini Pro, or ChatGPT Plus/Pro subscription via the built-in CLI proxy
  • Android / iOS device with Expo dev build

1. Start the Server

cd contop-server
uv sync
uv run uvicorn main:app --host 0.0.0.0 --port 8000

2. Run the Desktop App (optional)

cd contop-desktop
npm install
npm run tauri dev

3. Run the Mobile App

cd contop-mobile
npm install
npx expo run:android   # or: npx expo run:ios

4. Pair and Go

  1. Open the desktop app (or visit http://localhost:8000) to see the QR code
  2. Scan the QR code from the mobile app
  3. Start speaking or typing - the agent observes your screen and executes your commands

For detailed setup, platform-specific instructions, and configuration options, see the full documentation.

Project Structure

contop/
├── contop-server/           # Python FastAPI server + AI agent
│   ├── core/                # Agent, evaluator, signaling, pairing, skills engine
│   ├── tools/               # Vision backends, Docker sandbox, browser automation
│   ├── platform_adapters/   # OS-specific automation (Win / Mac / Linux)
│   ├── skills/              # Built-in skills (web research, IDE chat, CLI patterns)
│   ├── prompts/             # Agent system prompts
│   └── tests/               # pytest (unit + ATDD)
├── contop-mobile/           # Expo / React Native mobile client
│   ├── app/                 # Expo Router screens
│   ├── components/          # ExecutionThread, ExecutionInputBar, RemoteScreen
│   ├── hooks/               # useWebRTC, useConversation
│   ├── stores/              # Zustand state management
│   └── services/            # AI settings, session storage
├── contop-cli-proxy/        # CLI subscription proxy (Node.js / TypeScript)
│   └── src/                 # OpenAI-compatible proxy wrapping Claude/Gemini/Codex CLIs
├── contop-desktop/          # Tauri v2 desktop app
│   ├── src/                 # Vite frontend (HTML/CSS/JS)
│   └── src-tauri/           # Rust backend, Away Mode, sidecar + proxy management
├── website/                 # Next.js 15 marketing site
└── docs/                    # Docusaurus 3 documentation

Testing

cd contop-server && uv run pytest                # all server tests
cd contop-mobile && npx jest                     # all mobile tests

Links

Resource URL
Website contop.app
Documentation docs.contop.app
Releases GitHub Releases
Issues GitHub Issues

License

MIT

Yorumlar (0)

Sonuc bulunamadi