SideButton

Name: sidebutton
Author: sidebutton

Local workflow automation for your browser, terminal, and AI.

Website · Documentation · GitHub

Define workflows in YAML and execute them through browser automation, shell commands, and LLM integration.

Who is this for?

Power users who repeat the same browser tasks daily (data entry, form filling, content publishing)
Developers who want to automate workflows without writing code
AI agent builders who need browser automation via MCP
Teams who want shareable, version-controlled automation

Why SideButton?


Reusable Workflows	Define once in YAML, run forever. No re-prompting AI every time.
Recording Mode	Click through any task once, export as reusable automation.
Embed Buttons	Inject one-click action buttons directly into any web page.
AI-Powered Steps	LLM classification and generation built into workflows.
MCP Server	Expose workflows to AI agents like Claude Code, Cursor, VS Code.
REST API	JSON endpoints for mobile and external integrations.

Quick Start

# Install dependencies
pnpm install

# Build all packages
pnpm build

# Start the server
pnpm start

# Open http://localhost:9876 in your browser

Or run directly with the CLI:

pnpm cli serve          # Start server with dashboard
pnpm cli serve --stdio  # Start with stdio transport (for Claude Desktop)
pnpm cli list           # List available workflows
pnpm cli status         # Check server status

# Skill pack management
pnpm cli registry add <path|url>   # Install skill packs from a registry
pnpm cli registry update [name]    # Update installed packs
pnpm cli registry remove <name>    # Uninstall packs and remove registry
pnpm cli search [query]            # Search available skill packs

# Creating skill packs
pnpm cli init [domain]             # Scaffold a new skill pack
pnpm cli validate [path]           # Validate pack structure
pnpm cli publish [source]          # Publish to a registry (--registry <path>)
pnpm cli publish --registry <path> # Reindex all packs in a registry (in-place mode)

Features

Config-first workflows - Define actions in YAML files
Browser automation - Navigate, click, type, scroll, extract via Chrome extension
Shell execution - Run bash commands with output capture
Terminal workflows - Execute commands in visible terminal windows (macOS)
LLM integration - Text classification and generation via OpenAI/Anthropic
Control flow - Conditionals, retries, and nested workflows
Recording mode - Capture user actions to generate selectors
Skill packs - Install domain-specific workflows, roles, and targets from registries
MCP Server - Expose workflows to AI agents
REST API - JSON endpoints for mobile and external integrations

Creating Workflows

Workflows are defined as YAML files in the workflows/ directory.

Basic Structure

id: hello_world
title: "Hello World"
steps:
  - type: shell.run
    cmd: "echo 'Hello from SideButton!'"

Step Types

Type	Description
Browser
`browser.navigate`	Open a URL
`browser.click`	Click an element by selector
`browser.type`	Type text into an element
`browser.scroll`	Scroll the page
`browser.extract`	Extract text from an element into variable
`browser.extractAll`	Extract all matching elements
`browser.wait`	Wait for element or fixed time delay
`browser.exists`	Check if element exists
`browser.hover`	Position cursor on element
`browser.key`	Send keyboard keys
`browser.snapshot`	Capture accessibility snapshot
`browser.extractMap`	Extract structured data from repeated elements
`browser.injectCSS`	Inject CSS styles into page
`browser.injectJS`	Execute JavaScript in page
`browser.select_option`	Select dropdown option
`browser.scrollIntoView`	Scroll element into view
`browser.fill`	Fill input value (React-compatible)
Shell
`shell.run`	Execute a bash command
`terminal.open`	Open a visible terminal window (macOS)
`terminal.run`	Run command in terminal window
LLM
`llm.classify`	Structured classification with categories
`llm.generate`	Free-form text generation
Control Flow
`control.if`	Conditional branching
`control.retry`	Retry with backoff
`control.stop`	End workflow with message
`workflow.call`	Call another workflow with parameters
Data
`data.first`	Extract first item from list

Variable Interpolation

Use {{variable}} syntax to reference extracted values or parameters:

steps:
  - type: browser.extract
    selector: ".username"
    as: user

  - type: shell.run
    cmd: "echo 'Hello, {{user}}!'"

Architecture

┌──────────────────────────────────────────────────────────────────────────┐
│                          @sidebutton/server                               │
│                                                                          │
│  ┌─────────────────────┐   ┌──────────────────────────────────────────┐  │
│  │  stdio Transport    │   │    Fastify HTTP + WebSocket (port 9876)  │  │
│  │  ─────────────────  │   │    ────────────────────────────────────  │  │
│  │  stdin → JSON-RPC   │   │  GET  /        → Dashboard (Svelte)      │  │
│  │  stdout ← JSON-RPC  │   │  GET  /ws      → Chrome Extension WS     │  │
│  │  (Claude Desktop)   │   │  POST /mcp     → MCP JSON-RPC (SSE)      │  │
│  └──────────┬──────────┘   │  GET  /api/*   → REST API                │  │
│             │              └──────────────────────┬───────────────────┘  │
│             │                                     │                      │
│             └──────────────────┬──────────────────┘                      │
│                                ▼                                         │
│  ┌────────────────────────────────────────────────────────────────────┐  │
│  │                       @sidebutton/core                              │  │
│  │                                                                     │  │
│  │  - Workflow types & parser (YAML)                                  │  │
│  │  - Step executors (37 step types)                                  │  │
│  │  - Variable interpolation                                          │  │
│  │  - Execution context & events                                      │  │
│  └────────────────────────────────────────────────────────────────────┘  │
└──────────────────────────────────────────────────────────────────────────┘
      ▲                ▲                      ▲                      ▲
      │ stdio          │ WebSocket            │ HTTP POST            │ REST
      ▼                ▼                      ▼                      ▼
┌──────────────┐ ┌─────────────────┐ ┌─────────────────┐ ┌───────────────────┐
│Claude Desktop│ │ Chrome Extension│ │   Claude Code   │ │   Mobile App      │
│ (MCP stdio)  │ │ (Browser Auto)  │ │   (MCP SSE)     │ │   (REST Client)   │
└──────────────┘ └─────────────────┘ └─────────────────┘ └───────────────────┘

Project Structure

sidebutton/
├── packages/
│   ├── core/              # @sidebutton/core
│   │   └── src/
│   │       ├── types.ts       # Workflow types
│   │       ├── parser.ts      # YAML loader
│   │       ├── executor.ts    # Workflow runner
│   │       └── steps/         # Step implementations
│   ├── server/            # @sidebutton/server
│   │   ├── bin/               # CLI entry point
│   │   └── src/
│   │       ├── server.ts      # Fastify HTTP server
│   │       ├── stdio-mode.ts  # stdio transport entry point
│   │       ├── extension.ts   # WebSocket client
│   │       ├── mcp/           # MCP handlers
│   │       │   ├── handler.ts    # MCP JSON-RPC logic
│   │       │   ├── stdio.ts      # stdio transport adapter
│   │       │   └── tools.ts      # Tool definitions
│   │       └── cli.ts         # Commander CLI
│   └── dashboard/         # Svelte web UI
│       └── src/
│           ├── App.svelte
│           └── lib/
├── extension/             # Chrome extension
├── workflows/             # Public workflow library
├── actions/               # User-created workflows
├── skills/                # Installed skill packs (per-domain)
└── run_logs/              # Execution history

Browser Extension

Install the Chrome extension to enable browser automation:

Install from Chrome Web Store

After installing:

Navigate to any website
Click the SideButton extension icon
Click "Connect This Tab"

MCP Server (AI Agent Integration)

SideButton supports both stdio and SSE transports for MCP, implementing protocol version 2025-11-25 with full resumability support.

Claude Desktop

Add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "sidebutton": {
      "command": "npx",
      "args": ["sidebutton", "--stdio"]
    }
  }
}

The --stdio flag uses stdin/stdout for MCP communication. The HTTP server still runs in the background for browser extension connectivity.

Claude Code

Add to ~/.claude/settings.json:

{
  "mcpServers": {
    "sidebutton": {
      "type": "sse",
      "url": "http://localhost:9876/mcp"
    }
  }
}

Cursor

Add to ~/.cursor/mcp.json:

{
  "mcpServers": {
    "sidebutton": {
      "url": "http://localhost:9876/mcp"
    }
  }
}

Available MCP Tools

Tool	Description
`run_workflow`	Execute a workflow by ID
`list_workflows`	List all available workflows
`get_workflow`	Get workflow YAML definition
`get_run_log`	Get execution log for a run
`list_run_logs`	List recent workflow executions
`get_browser_status`	Check browser extension connection
`capture_page`	Capture selectors from current page
`navigate`	Navigate browser to URL
`snapshot`	Get page accessibility snapshot
`click`	Click an element
`type`	Type text into an element
`scroll`	Scroll the page
`screenshot`	Capture page screenshot
`hover`	Hover over element
`extract`	Extract text from element
`select_option`	Select dropdown option
`evaluate`	Execute JavaScript in browser

Environment Variables

Variable	Required For	Description
`OPENAI_API_KEY`	`llm.*` steps	OpenAI API key for LLM workflows
`ANTHROPIC_API_KEY`	`llm.*` steps	Anthropic API key (alternative)

Development

# Install dependencies
pnpm install

# Build all packages
pnpm build

# Start server locally
pnpm start

# CLI commands
pnpm cli list          # List workflows
pnpm cli status        # Check status
pnpm cli serve         # Start server

Watch Mode

# Full dev mode (all packages with watch rebuild)
pnpm dev

# Individual components
pnpm dev:server        # Server with auto-restart on :9876
pnpm dev:dashboard     # Dashboard watch build (outputs to server)
pnpm dev:core          # Core library watch build

In dev mode:

Server auto-restarts on code changes via tsx watch
Dashboard continuously rebuilds into the server package
Access everything at http://localhost:9876

Platform Automation Disclaimer

SideButton is a general-purpose browser automation framework. When automating third-party platforms:

Review Terms of Service: Many platforms prohibit or restrict automation. You are responsible for complying with the terms of any platform you automate.
Account Risk: Automation may result in account restrictions or suspension on some platforms.
Use Responsibly: Only automate actions you would perform manually. Respect rate limits and platform guidelines.

The authors do not endorse or encourage violations of third-party terms of service.

Legal

License

This project uses mixed licensing. See LICENSING.md for details.

Engine, server, CLI, dashboard — Apache-2.0
Browser extension — FSL-1.1-Apache-2.0 (converts to Apache-2.0 on 2029-03-15)