sidebutton

mcp
Guvenlik Denetimi
Uyari
Health Uyari
  • License — License: Apache-2.0
  • Description — Repository has a description
  • Active repo — Last push 0 days ago
  • Low visibility — Only 5 GitHub stars
Code Gecti
  • Code scan — Scanned 12 files during light audit, no dangerous patterns found
Permissions Gecti
  • Permissions — No dangerous permissions requested

Bu listing icin henuz AI raporu yok.

SUMMARY

Open-source browser automation for AI agents. MCP server + Chrome extension + YAML workflow engine + knowledge packs. Give your AI agent a browser and domain expertise.

README.md

SideButton

Local workflow automation for your browser, terminal, and AI.

License
Website
Docs

Website · Documentation · GitHub

Define workflows in YAML and execute them through browser automation, shell commands, and LLM integration.

Who is this for?

  • Power users who repeat the same browser tasks daily (data entry, form filling, content publishing)
  • Developers who want to automate workflows without writing code
  • AI agent builders who need browser automation via MCP
  • Teams who want shareable, version-controlled automation

Why SideButton?

Reusable Workflows Define once in YAML, run forever. No re-prompting AI every time.
Recording Mode Click through any task once, export as reusable automation.
Embed Buttons Inject one-click action buttons directly into any web page.
AI-Powered Steps LLM classification and generation built into workflows.
MCP Server Expose workflows to AI agents like Claude Code, Cursor, VS Code.
REST API JSON endpoints for mobile and external integrations.

Quick Start

# Install dependencies
pnpm install

# Build all packages
pnpm build

# Start the server
pnpm start

# Open http://localhost:9876 in your browser

Or run directly with the CLI:

pnpm cli serve          # Start server with dashboard
pnpm cli serve --stdio  # Start with stdio transport (for Claude Desktop)
pnpm cli list           # List available workflows
pnpm cli status         # Check server status

# Skill pack management
pnpm cli registry add <path|url>   # Install skill packs from a registry
pnpm cli registry update [name]    # Update installed packs
pnpm cli registry remove <name>    # Uninstall packs and remove registry
pnpm cli search [query]            # Search available skill packs

# Creating skill packs
pnpm cli init [domain]             # Scaffold a new skill pack
pnpm cli validate [path]           # Validate pack structure
pnpm cli publish [source]          # Publish to a registry (--registry <path>)
pnpm cli publish --registry <path> # Reindex all packs in a registry (in-place mode)

Features

  • Config-first workflows - Define actions in YAML files
  • Browser automation - Navigate, click, type, scroll, extract via Chrome extension
  • Shell execution - Run bash commands with output capture
  • Terminal workflows - Execute commands in visible terminal windows (macOS)
  • LLM integration - Text classification and generation via OpenAI/Anthropic
  • Control flow - Conditionals, retries, and nested workflows
  • Recording mode - Capture user actions to generate selectors
  • Skill packs - Install domain-specific workflows, roles, and targets from registries
  • MCP Server - Expose workflows to AI agents
  • REST API - JSON endpoints for mobile and external integrations

Creating Workflows

Workflows are defined as YAML files in the workflows/ directory.

Basic Structure

id: hello_world
title: "Hello World"
steps:
  - type: shell.run
    cmd: "echo 'Hello from SideButton!'"

Step Types

Type Description
Browser
browser.navigate Open a URL
browser.click Click an element by selector
browser.type Type text into an element
browser.scroll Scroll the page
browser.extract Extract text from an element into variable
browser.extractAll Extract all matching elements
browser.wait Wait for element or fixed time delay
browser.exists Check if element exists
browser.hover Position cursor on element
browser.key Send keyboard keys
browser.snapshot Capture accessibility snapshot
browser.extractMap Extract structured data from repeated elements
browser.injectCSS Inject CSS styles into page
browser.injectJS Execute JavaScript in page
browser.select_option Select dropdown option
browser.scrollIntoView Scroll element into view
browser.fill Fill input value (React-compatible)
Shell
shell.run Execute a bash command
terminal.open Open a visible terminal window (macOS)
terminal.run Run command in terminal window
LLM
llm.classify Structured classification with categories
llm.generate Free-form text generation
Control Flow
control.if Conditional branching
control.retry Retry with backoff
control.stop End workflow with message
workflow.call Call another workflow with parameters
Data
data.first Extract first item from list

Variable Interpolation

Use {{variable}} syntax to reference extracted values or parameters:

steps:
  - type: browser.extract
    selector: ".username"
    as: user

  - type: shell.run
    cmd: "echo 'Hello, {{user}}!'"

Architecture

┌──────────────────────────────────────────────────────────────────────────┐
│                          @sidebutton/server                               │
│                                                                          │
│  ┌─────────────────────┐   ┌──────────────────────────────────────────┐  │
│  │  stdio Transport    │   │    Fastify HTTP + WebSocket (port 9876)  │  │
│  │  ─────────────────  │   │    ────────────────────────────────────  │  │
│  │  stdin → JSON-RPC   │   │  GET  /        → Dashboard (Svelte)      │  │
│  │  stdout ← JSON-RPC  │   │  GET  /ws      → Chrome Extension WS     │  │
│  │  (Claude Desktop)   │   │  POST /mcp     → MCP JSON-RPC (SSE)      │  │
│  └──────────┬──────────┘   │  GET  /api/*   → REST API                │  │
│             │              └──────────────────────┬───────────────────┘  │
│             │                                     │                      │
│             └──────────────────┬──────────────────┘                      │
│                                ▼                                         │
│  ┌────────────────────────────────────────────────────────────────────┐  │
│  │                       @sidebutton/core                              │  │
│  │                                                                     │  │
│  │  - Workflow types & parser (YAML)                                  │  │
│  │  - Step executors (37 step types)                                  │  │
│  │  - Variable interpolation                                          │  │
│  │  - Execution context & events                                      │  │
│  └────────────────────────────────────────────────────────────────────┘  │
└──────────────────────────────────────────────────────────────────────────┘
      ▲                ▲                      ▲                      ▲
      │ stdio          │ WebSocket            │ HTTP POST            │ REST
      ▼                ▼                      ▼                      ▼
┌──────────────┐ ┌─────────────────┐ ┌─────────────────┐ ┌───────────────────┐
│Claude Desktop│ │ Chrome Extension│ │   Claude Code   │ │   Mobile App      │
│ (MCP stdio)  │ │ (Browser Auto)  │ │   (MCP SSE)     │ │   (REST Client)   │
└──────────────┘ └─────────────────┘ └─────────────────┘ └───────────────────┘

Project Structure

sidebutton/
├── packages/
│   ├── core/              # @sidebutton/core
│   │   └── src/
│   │       ├── types.ts       # Workflow types
│   │       ├── parser.ts      # YAML loader
│   │       ├── executor.ts    # Workflow runner
│   │       └── steps/         # Step implementations
│   ├── server/            # @sidebutton/server
│   │   ├── bin/               # CLI entry point
│   │   └── src/
│   │       ├── server.ts      # Fastify HTTP server
│   │       ├── stdio-mode.ts  # stdio transport entry point
│   │       ├── extension.ts   # WebSocket client
│   │       ├── mcp/           # MCP handlers
│   │       │   ├── handler.ts    # MCP JSON-RPC logic
│   │       │   ├── stdio.ts      # stdio transport adapter
│   │       │   └── tools.ts      # Tool definitions
│   │       └── cli.ts         # Commander CLI
│   └── dashboard/         # Svelte web UI
│       └── src/
│           ├── App.svelte
│           └── lib/
├── extension/             # Chrome extension
├── workflows/             # Public workflow library
├── actions/               # User-created workflows
├── skills/                # Installed skill packs (per-domain)
└── run_logs/              # Execution history

Browser Extension

Install the Chrome extension to enable browser automation:

Install from Chrome Web Store

After installing:

  1. Navigate to any website
  2. Click the SideButton extension icon
  3. Click "Connect This Tab"

MCP Server (AI Agent Integration)

SideButton supports both stdio and SSE transports for MCP, implementing protocol version 2025-11-25 with full resumability support.

Claude Desktop

Add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "sidebutton": {
      "command": "npx",
      "args": ["sidebutton", "--stdio"]
    }
  }
}

The --stdio flag uses stdin/stdout for MCP communication. The HTTP server still runs in the background for browser extension connectivity.

Claude Code

Add to ~/.claude/settings.json:

{
  "mcpServers": {
    "sidebutton": {
      "type": "sse",
      "url": "http://localhost:9876/mcp"
    }
  }
}

Cursor

Add to ~/.cursor/mcp.json:

{
  "mcpServers": {
    "sidebutton": {
      "url": "http://localhost:9876/mcp"
    }
  }
}

Available MCP Tools

Tool Description
run_workflow Execute a workflow by ID
list_workflows List all available workflows
get_workflow Get workflow YAML definition
get_run_log Get execution log for a run
list_run_logs List recent workflow executions
get_browser_status Check browser extension connection
capture_page Capture selectors from current page
navigate Navigate browser to URL
snapshot Get page accessibility snapshot
click Click an element
type Type text into an element
scroll Scroll the page
screenshot Capture page screenshot
hover Hover over element
extract Extract text from element
select_option Select dropdown option
evaluate Execute JavaScript in browser

Environment Variables

Variable Required For Description
OPENAI_API_KEY llm.* steps OpenAI API key for LLM workflows
ANTHROPIC_API_KEY llm.* steps Anthropic API key (alternative)

Development

# Install dependencies
pnpm install

# Build all packages
pnpm build

# Start server locally
pnpm start

# CLI commands
pnpm cli list          # List workflows
pnpm cli status        # Check status
pnpm cli serve         # Start server

Watch Mode

# Full dev mode (all packages with watch rebuild)
pnpm dev

# Individual components
pnpm dev:server        # Server with auto-restart on :9876
pnpm dev:dashboard     # Dashboard watch build (outputs to server)
pnpm dev:core          # Core library watch build

In dev mode:

  • Server auto-restarts on code changes via tsx watch
  • Dashboard continuously rebuilds into the server package
  • Access everything at http://localhost:9876

Platform Automation Disclaimer

SideButton is a general-purpose browser automation framework. When automating third-party platforms:

  • Review Terms of Service: Many platforms prohibit or restrict automation. You are responsible for complying with the terms of any platform you automate.
  • Account Risk: Automation may result in account restrictions or suspension on some platforms.
  • Use Responsibly: Only automate actions you would perform manually. Respect rate limits and platform guidelines.

The authors do not endorse or encourage violations of third-party terms of service.

Legal

License

This project uses mixed licensing. See LICENSING.md for details.

Yorumlar (0)

Sonuc bulunamadi