sidebutton
Health Uyari
- License — License: Apache-2.0
- Description — Repository has a description
- Active repo — Last push 0 days ago
- Low visibility — Only 5 GitHub stars
Code Gecti
- Code scan — Scanned 12 files during light audit, no dangerous patterns found
Permissions Gecti
- Permissions — No dangerous permissions requested
Bu listing icin henuz AI raporu yok.
Open-source browser automation for AI agents. MCP server + Chrome extension + YAML workflow engine + knowledge packs. Give your AI agent a browser and domain expertise.
SideButton
Local workflow automation for your browser, terminal, and AI.
Define workflows in YAML and execute them through browser automation, shell commands, and LLM integration.
Who is this for?
- Power users who repeat the same browser tasks daily (data entry, form filling, content publishing)
- Developers who want to automate workflows without writing code
- AI agent builders who need browser automation via MCP
- Teams who want shareable, version-controlled automation
Why SideButton?
| Reusable Workflows | Define once in YAML, run forever. No re-prompting AI every time. |
| Recording Mode | Click through any task once, export as reusable automation. |
| Embed Buttons | Inject one-click action buttons directly into any web page. |
| AI-Powered Steps | LLM classification and generation built into workflows. |
| MCP Server | Expose workflows to AI agents like Claude Code, Cursor, VS Code. |
| REST API | JSON endpoints for mobile and external integrations. |
Quick Start
# Install dependencies
pnpm install
# Build all packages
pnpm build
# Start the server
pnpm start
# Open http://localhost:9876 in your browser
Or run directly with the CLI:
pnpm cli serve # Start server with dashboard
pnpm cli serve --stdio # Start with stdio transport (for Claude Desktop)
pnpm cli list # List available workflows
pnpm cli status # Check server status
# Skill pack management
pnpm cli registry add <path|url> # Install skill packs from a registry
pnpm cli registry update [name] # Update installed packs
pnpm cli registry remove <name> # Uninstall packs and remove registry
pnpm cli search [query] # Search available skill packs
# Creating skill packs
pnpm cli init [domain] # Scaffold a new skill pack
pnpm cli validate [path] # Validate pack structure
pnpm cli publish [source] # Publish to a registry (--registry <path>)
pnpm cli publish --registry <path> # Reindex all packs in a registry (in-place mode)
Features
- Config-first workflows - Define actions in YAML files
- Browser automation - Navigate, click, type, scroll, extract via Chrome extension
- Shell execution - Run bash commands with output capture
- Terminal workflows - Execute commands in visible terminal windows (macOS)
- LLM integration - Text classification and generation via OpenAI/Anthropic
- Control flow - Conditionals, retries, and nested workflows
- Recording mode - Capture user actions to generate selectors
- Skill packs - Install domain-specific workflows, roles, and targets from registries
- MCP Server - Expose workflows to AI agents
- REST API - JSON endpoints for mobile and external integrations
Creating Workflows
Workflows are defined as YAML files in the workflows/ directory.
Basic Structure
id: hello_world
title: "Hello World"
steps:
- type: shell.run
cmd: "echo 'Hello from SideButton!'"
Step Types
| Type | Description |
|---|---|
| Browser | |
browser.navigate |
Open a URL |
browser.click |
Click an element by selector |
browser.type |
Type text into an element |
browser.scroll |
Scroll the page |
browser.extract |
Extract text from an element into variable |
browser.extractAll |
Extract all matching elements |
browser.wait |
Wait for element or fixed time delay |
browser.exists |
Check if element exists |
browser.hover |
Position cursor on element |
browser.key |
Send keyboard keys |
browser.snapshot |
Capture accessibility snapshot |
browser.extractMap |
Extract structured data from repeated elements |
browser.injectCSS |
Inject CSS styles into page |
browser.injectJS |
Execute JavaScript in page |
browser.select_option |
Select dropdown option |
browser.scrollIntoView |
Scroll element into view |
browser.fill |
Fill input value (React-compatible) |
| Shell | |
shell.run |
Execute a bash command |
terminal.open |
Open a visible terminal window (macOS) |
terminal.run |
Run command in terminal window |
| LLM | |
llm.classify |
Structured classification with categories |
llm.generate |
Free-form text generation |
| Control Flow | |
control.if |
Conditional branching |
control.retry |
Retry with backoff |
control.stop |
End workflow with message |
workflow.call |
Call another workflow with parameters |
| Data | |
data.first |
Extract first item from list |
Variable Interpolation
Use {{variable}} syntax to reference extracted values or parameters:
steps:
- type: browser.extract
selector: ".username"
as: user
- type: shell.run
cmd: "echo 'Hello, {{user}}!'"
Architecture
┌──────────────────────────────────────────────────────────────────────────┐
│ @sidebutton/server │
│ │
│ ┌─────────────────────┐ ┌──────────────────────────────────────────┐ │
│ │ stdio Transport │ │ Fastify HTTP + WebSocket (port 9876) │ │
│ │ ───────────────── │ │ ──────────────────────────────────── │ │
│ │ stdin → JSON-RPC │ │ GET / → Dashboard (Svelte) │ │
│ │ stdout ← JSON-RPC │ │ GET /ws → Chrome Extension WS │ │
│ │ (Claude Desktop) │ │ POST /mcp → MCP JSON-RPC (SSE) │ │
│ └──────────┬──────────┘ │ GET /api/* → REST API │ │
│ │ └──────────────────────┬───────────────────┘ │
│ │ │ │
│ └──────────────────┬──────────────────┘ │
│ ▼ │
│ ┌────────────────────────────────────────────────────────────────────┐ │
│ │ @sidebutton/core │ │
│ │ │ │
│ │ - Workflow types & parser (YAML) │ │
│ │ - Step executors (37 step types) │ │
│ │ - Variable interpolation │ │
│ │ - Execution context & events │ │
│ └────────────────────────────────────────────────────────────────────┘ │
└──────────────────────────────────────────────────────────────────────────┘
▲ ▲ ▲ ▲
│ stdio │ WebSocket │ HTTP POST │ REST
▼ ▼ ▼ ▼
┌──────────────┐ ┌─────────────────┐ ┌─────────────────┐ ┌───────────────────┐
│Claude Desktop│ │ Chrome Extension│ │ Claude Code │ │ Mobile App │
│ (MCP stdio) │ │ (Browser Auto) │ │ (MCP SSE) │ │ (REST Client) │
└──────────────┘ └─────────────────┘ └─────────────────┘ └───────────────────┘
Project Structure
sidebutton/
├── packages/
│ ├── core/ # @sidebutton/core
│ │ └── src/
│ │ ├── types.ts # Workflow types
│ │ ├── parser.ts # YAML loader
│ │ ├── executor.ts # Workflow runner
│ │ └── steps/ # Step implementations
│ ├── server/ # @sidebutton/server
│ │ ├── bin/ # CLI entry point
│ │ └── src/
│ │ ├── server.ts # Fastify HTTP server
│ │ ├── stdio-mode.ts # stdio transport entry point
│ │ ├── extension.ts # WebSocket client
│ │ ├── mcp/ # MCP handlers
│ │ │ ├── handler.ts # MCP JSON-RPC logic
│ │ │ ├── stdio.ts # stdio transport adapter
│ │ │ └── tools.ts # Tool definitions
│ │ └── cli.ts # Commander CLI
│ └── dashboard/ # Svelte web UI
│ └── src/
│ ├── App.svelte
│ └── lib/
├── extension/ # Chrome extension
├── workflows/ # Public workflow library
├── actions/ # User-created workflows
├── skills/ # Installed skill packs (per-domain)
└── run_logs/ # Execution history
Browser Extension
Install the Chrome extension to enable browser automation:
After installing:
- Navigate to any website
- Click the SideButton extension icon
- Click "Connect This Tab"
MCP Server (AI Agent Integration)
SideButton supports both stdio and SSE transports for MCP, implementing protocol version 2025-11-25 with full resumability support.
Claude Desktop
Add to ~/Library/Application Support/Claude/claude_desktop_config.json:
{
"mcpServers": {
"sidebutton": {
"command": "npx",
"args": ["sidebutton", "--stdio"]
}
}
}
The --stdio flag uses stdin/stdout for MCP communication. The HTTP server still runs in the background for browser extension connectivity.
Claude Code
Add to ~/.claude/settings.json:
{
"mcpServers": {
"sidebutton": {
"type": "sse",
"url": "http://localhost:9876/mcp"
}
}
}
Cursor
Add to ~/.cursor/mcp.json:
{
"mcpServers": {
"sidebutton": {
"url": "http://localhost:9876/mcp"
}
}
}
Available MCP Tools
| Tool | Description |
|---|---|
run_workflow |
Execute a workflow by ID |
list_workflows |
List all available workflows |
get_workflow |
Get workflow YAML definition |
get_run_log |
Get execution log for a run |
list_run_logs |
List recent workflow executions |
get_browser_status |
Check browser extension connection |
capture_page |
Capture selectors from current page |
navigate |
Navigate browser to URL |
snapshot |
Get page accessibility snapshot |
click |
Click an element |
type |
Type text into an element |
scroll |
Scroll the page |
screenshot |
Capture page screenshot |
hover |
Hover over element |
extract |
Extract text from element |
select_option |
Select dropdown option |
evaluate |
Execute JavaScript in browser |
Environment Variables
| Variable | Required For | Description |
|---|---|---|
OPENAI_API_KEY |
llm.* steps |
OpenAI API key for LLM workflows |
ANTHROPIC_API_KEY |
llm.* steps |
Anthropic API key (alternative) |
Development
# Install dependencies
pnpm install
# Build all packages
pnpm build
# Start server locally
pnpm start
# CLI commands
pnpm cli list # List workflows
pnpm cli status # Check status
pnpm cli serve # Start server
Watch Mode
# Full dev mode (all packages with watch rebuild)
pnpm dev
# Individual components
pnpm dev:server # Server with auto-restart on :9876
pnpm dev:dashboard # Dashboard watch build (outputs to server)
pnpm dev:core # Core library watch build
In dev mode:
- Server auto-restarts on code changes via tsx watch
- Dashboard continuously rebuilds into the server package
- Access everything at
http://localhost:9876
Platform Automation Disclaimer
SideButton is a general-purpose browser automation framework. When automating third-party platforms:
- Review Terms of Service: Many platforms prohibit or restrict automation. You are responsible for complying with the terms of any platform you automate.
- Account Risk: Automation may result in account restrictions or suspension on some platforms.
- Use Responsibly: Only automate actions you would perform manually. Respect rate limits and platform guidelines.
The authors do not endorse or encourage violations of third-party terms of service.
Legal
License
This project uses mixed licensing. See LICENSING.md for details.
- Engine, server, CLI, dashboard — Apache-2.0
- Browser extension — FSL-1.1-Apache-2.0 (converts to Apache-2.0 on 2029-03-15)
Yorumlar (0)
Yorum birakmak icin giris yap.
Yorum birakSonuc bulunamadi