the-dev-squad

agent
Security Audit
Warn
Health Pass
  • License — License: MIT
  • Description — Repository has a description
  • Active repo — Last push 0 days ago
  • Community trust — 22 GitHub stars
Code Warn
  • process.env — Environment variable access in pipeline/orchestrator.ts
  • process.env — Environment variable access in src/app/api/chat/route.ts
  • fs module — File system access in src/app/api/chat/route.ts
Permissions Pass
  • Permissions — No dangerous permissions requested
Purpose
This agent orchestrates five separate Claude Code sessions to automatically plan, review, code, and test software projects. It acts as an automated development pipeline that runs entirely on a local Claude subscription without requiring external API keys.

Security Assessment
Overall risk: Medium. The tool requires access to environment variables and the local file system to read code and write generated outputs, which is standard for a development agent. It does not request dangerous system permissions and has no hardcoded secrets. However, because it orchestrates multiple autonomous AI sessions that read and write to your local files, you should carefully monitor exactly what code is generated and executed on your machine.

Quality Assessment
The project is actively maintained, with repository activity as recent as today. It is released under the permissive and standard MIT license. Community trust is currently early-stage but positive, indicated by 22 GitHub stars. The comprehensive README demonstrates clear intent, strict stage-gating, and a well-thought-out architecture. Compatible command-line interfaces are currently unknown.

Verdict
Use with caution.
SUMMARY

5 Claude Code sessions that talk to each other to build software. Bulletproof plans. Bulletproof builds. No API costs.

README.md

The Dev Squad

5 Claude Code sessions that talk to each other to build software.

Version License Claude Opus 4.6 5 Agents Zero API Cost Node 22+

The Dev Squad Demo


One plans. One reviews. One codes. One tests. One supervises. They communicate through structured signals, review each other's work, and loop until every step is right. The result is bulletproof plans that produce bulletproof builds.

No API keys. No per-token costs. All 5 sessions run on your Claude subscription.


Why This Exists

I was spending hours writing build plans. Not specs — full plans with complete, copy-pasteable code for every file. I found that if the plan was bulletproof, the build was bulletproof. No guessing, no improvising, no "I'll figure it out during implementation." The coder just follows the plan.

But writing those plans was brutal. I'd go back and forth with Claude — "is this thorough enough?", "did you verify this package exists?", "what about error handling?" — until the session would lose context. I'd open a new session, paste the plan, keep going. Then I'd open another session for the reviewer, another for the coder, another for testing. I was manually orchestrating 4-5 Claude sessions, copying messages between them.

The Dev Squad is what happens when you automate that. The planner writes the plan with complete code — not descriptions, not pseudocode, actual code. The reviewer tears it apart and loops with the planner until there are zero gaps. Only then does the coder touch it. The coder doesn't think — it follows the plan exactly. The tester doesn't guess — it checks every item against the plan.

The key insight: the plan IS the code. Agent A doesn't write a spec sheet. A writes a plan that contains every line of code the coder will need. The reviewer's job is to make sure that plan is so complete that the coder never has to ask a single question. That's what makes the builds bulletproof — by the time C starts coding, every decision has already been made and verified.

I built a template and checklist that A follows — research, verify from source, write complete code, self-review, fill gaps. A can't skip steps. B can't approve until every question is answered. The pipeline enforces quality at every stage so I don't have to.

My rule: the plan must be 100% bulletproof with zero errors and evidence to verify every decision before I move forward with a build. No "this should work." No "I think this package exists." Every claim is verified from source. Every code block is complete and tested in the planner's head before the coder ever sees it.

The result: builds come out with no errors. I used to spend hours after a build going back and fixing things — missing dependencies, wrong API signatures, broken imports. Now, 99% of the time, the build produces exactly what I asked for. On the rare occasion something needs troubleshooting, every agent still has complete context because we didn't burn through the session going in circles. The planner remembers the concept. The coder remembers what it built. The tester remembers what it tested. Nobody lost context because each agent only did its one job.

This saves me hours every day.


The Agents

Agent Role What It Does
A Planner Chats with you about the concept, researches everything, writes a build plan with complete code for every file
B Reviewer Reads A's plan and tears it apart. Asks hard questions. Loops with A until there are zero gaps.
C Coder Follows the approved plan exactly. Writes every file, installs deps, builds the project. No improvising.
D Tester Reviews C's code against the plan, runs it, catches bugs. Loops with C until everything passes.
S Supervisor Your diagnostic assistant. If something breaks or loops, S reads the event log and helps figure out what went wrong.

Each agent is a separate Claude Code session running Claude Opus 4.6. They communicate through structured JSON signals routed by an orchestrator. Every restriction is enforced by a PreToolUse hook — the agents literally cannot break the rules.

How It Works

1. Open the viewer
2. Chat with Agent A — describe what you want to build
3. Hit START
4. Watch 5 agents build it autonomously
5. Your project is in ~/Builds/

Phase 0: Concept — You talk to Agent A. Describe what you want. A asks clarifying questions until the scope is clear. This is the only human interaction required.

Phase 1: Planning — A reads the build plan template, researches the concept (web searches, docs, source code), and writes plan.md with complete, copy-pasteable code for every file. No placeholders. A self-reviews multiple times before sending to B.

Phase 1b: Plan Review — B reads the plan and sends structured questions back to A. They loop until B is fully satisfied and approves. The plan is locked. No agent can modify it.

Phase 2: Coding — C reads the locked plan and builds exactly what it says. Every file, every dependency, every line of code.

Phase 3: Code Review + Testing — D reads the plan and the code. Checks every item. If anything doesn't match or fails, D sends issues back to C. They loop until D approves and all tests pass.

Phase 4: Deploy — The finished project is ready.

The plan-review loop between A and B catches design gaps before a single line of code is written. The test loop between C and D catches implementation bugs before anything ships. Each loop has no round limit — they keep going until it's right.


The Viewer

A pixel art office where 5 agents sit at desks. You watch them work in real-time:

  • Live Feed — Every event from every agent, timestamped and color-coded
  • Dashboard — Phase progress, elapsed time, file count, errors
  • 5-Panel Grid — S (supervisor) panel on the left, A/B/C/D on the right. Each panel shows that agent's activity with auto-scroll. Click any panel to expand.
  • Per-Panel Chat — Each panel has its own input. Talk directly to any agent.
  • Controls — START, STOP, Reset, View Plan

When idle, agents wander the office, visit the hookah lounge, and play ping pong.


Requirements

  • Claude Code CLI — this is the engine. You must have the claude command installed and working in your terminal. Install it from claude.ai/code.
  • Active Claude subscription — Max, Pro, or Team. All 5 agent sessions run on your subscription. No API key needed.
  • Node.js 22+
  • pnpm

Installation

git clone https://github.com/johnkf5-ops/the-dev-squad.git
cd the-dev-squad
pnpm install
pnpm dev

Open http://localhost:3000.

That's it. The viewer handles everything — spawning agents, running the orchestrator, managing builds.


How to Use

The UI

The screen is split into two sections:

Top half — A pixel art office with 5 agents at desks. They animate in real-time as they work. Below the office is a live feed showing every event from every agent. To the right is a dashboard with phase progress, elapsed time, and controls.

Bottom half — A 5-panel grid. The S (Supervisor) panel spans the left column. The A, B, C, D panels fill the right in a 2x2 grid. Each panel shows that agent's activity and has its own chat input at the bottom.

Step by Step

1. Reset — Click the Reset button to clear any previous session. This gives you a clean slate.

2. Talk to the Planner — Click on Agent A's panel (top-left of the 4-panel grid, labeled "Planner"). Type your concept in the chat input at the bottom of A's panel and hit Send. Tell A what you want to build. A will ask clarifying questions — answer them until A says it's ready.

3. Start the Pipeline — Click START. The orchestrator takes over. A writes the plan, B reviews it, C codes it, D tests it. You watch it all happen in real-time across the panels.

4. Watch — Each panel auto-scrolls as events come in. Click any A-D panel to expand it into a full scrollable modal for more detail. The dashboard shows which phase you're in and tracks progress.

5. Stop (if needed) — Click STOP at any time to abort the pipeline. This kills the orchestrator and all agent sessions immediately.

6. View Plan — Once A has written the plan, a View Plan button appears. Click it to read the full build plan in a modal.

7. Done — When the pipeline completes, your project is in ~/Builds/<project-name>/. The dashboard shows "COMPLETE" and you'll hear a notification chime.

After the Build

Once the build is complete, you can chat directly with any agent for post-build work. Click on C's panel and ask it to fix a bug. Click on D's panel and ask it to run more tests. Each agent retains context from the build.

The Supervisor (S Panel)

The S panel on the left is not part of the pipeline. S is your diagnostic assistant. If something breaks, stalls, or loops, type in the S panel to ask what went wrong. S can read the event log, the plan, the code, and help you figure out the issue. You don't need to use S during a normal build — it's there when things go sideways.

Controls Reference

Button What It Does
START Creates a project directory from your concept, spawns the orchestrator, begins the autonomous pipeline
STOP Kills the orchestrator and all active agent sessions immediately
Reset Stops everything, clears the staging area, resets any stuck projects. Use before starting a new build.
View Plan Opens the current plan.md in a modal (appears after A writes the plan)

Security

Agents are constrained by hooks, not prompts. A PreToolUse hook gates every tool call:

Agent Can Write Can Run Bash Can Spawn Agents
A (Planner) plan.md only No No
B (Reviewer) Nothing No No
C (Coder) Inside ~/Builds/ (except plan.md) Yes (dangerous cmds need approval) No
D (Tester) Nothing Yes (dangerous cmds need approval) No
S (Supervisor) Unrestricted Yes No

Additional protections:

  • No agent can write outside ~/Builds/
  • Plan is locked after B approves — no agent can modify it
  • Safe bash commands auto-approve, dangerous ones require your click
  • All sessions use --permission-mode auto for Claude's built-in safety classifier

How Agents Communicate

Agents communicate via structured JSON — no text parsing:

// B reviewing A's plan
{ "status": "approved" }
{ "status": "questions", "questions": ["What about error handling?"] }

// D reviewing C's code
{ "status": "approved" }
{ "status": "issues", "issues": ["Missing input validation"] }

// D testing
{ "status": "passed" }
{ "status": "failed", "failures": ["PUT /users returns 500"] }

The orchestrator routes these signals between agents and advances the pipeline when an approval is received.


Project Structure

the-dev-squad/
  src/
    app/
      page.tsx                      # Main page — dashboard, panels, controls
      api/                          # API routes (chat, start, stop, reset, state)
    components/
      mission/                      # Pixel art office scene
    lib/
      use-pipeline.ts               # React hook — polls state, exposes actions
  pipeline/
    orchestrator.ts                 # Spawns agents, routes signals, enforces flow
    .claude/hooks/approval-gate.sh  # Per-agent permission enforcement
    role-a.md, role-b.md, etc.      # Agent role context files
    build-plan-template.md          # Template A follows when writing plans
  public/
    sprites/                        # Character and furniture sprites

Contributing

See CONTRIBUTING.md for guidelines.

License

MIT - see LICENSE for details.

Copyright (c) 2026 CrashOverride LLC


Built with Claude Code. Runs on Claude Code. No API required.

Reviews (0)

No results found