little-loops

agent
Security Audit
Warn
Health Warn
  • License — License: MIT
  • Description — Repository has a description
  • Active repo — Last push 0 days ago
  • Low visibility — Only 5 GitHub stars
Code Pass
  • Code scan — Scanned 12 files during light audit, no dangerous patterns found
Permissions Pass
  • Permissions — No dangerous permissions requested

No AI report is available for this listing yet.

SUMMARY

A decision engine for Claude Code. MIT licensed.

README.md

Little Loops Logo

Version License Python Version PyPI Version Docs

little-loops

The toolkit for long-horizon, eval-gated AI software development. Built as a Claude Code plugin.

Today's agents do small tasks well and ship features poorly. little-loops removes that ceiling with three things they're missing: durability (the run outlives the session), consistency (the toolbelt is the process), and verification (the harness is the spec).

Stop babysitting chats. Start shipping features.

1. Chat sessions are holding you back. Run asynchronous agents until done — scale without limits.

The unit of work is the feature, the sprint, or the overnight optimization — not a single chat. Runs survive terminal close, context exhaustion, and laptop sleep. Parallel sprints fan out across isolated worktrees and complete independently of your terminal.

  • ll-parallel — kick off N concurrent feature implementations in isolated worktrees. Walk away. They converge without you.
  • --background + ll-loop resume — runs survive terminal close, sleep, and reboot. Resume picks up exactly where it stopped, mid-trajectory.
  • harness-optimize — score-gated overnight optimization runs. Accept-or-revert each mutation. If interrupted, resume to the highest-scoring commit.
  • Session handoff — a fresh context picks up mid-issue without losing the thread. Context limits stop being a planning constraint.

Ship features, not sessions.

2. Smart tools create smart processes.

Raw agents re-derive the same structural moves differently each run. The ll- CLI removes the improvisation surface: context gathering, issue lifecycle, sprint moves, and worktree setup all run as typed commands. Two runs of the same feature land in the same shape — by construction, not by prompting.

  • 33 typed CLI tools (ll-issues, ll-sprint, ll-loop, ll-parallel, ll-action, etc.) — structural work runs as commands, not improvised tool calls
  • /ll:manage-issue — composes the CLIs into a fixed plan → implement → verify → complete sequence. The agent reasons inside steps, not about which steps to take
  • Skill harnesses (/ll:ready-issue, /ll:wire-issue, /ll:confidence-check) — same inputs, same gates, same outputs
  • Worktree setup, branch naming, issue ID generation — mechanical operations that produce identical structure across runs

Same feature, same shape, every run.

3. Harness-driven development is awesome. And hard. Auto-generate autonomous harnesses and let your agents go anywhere.

Harness-driven development is TDD's analog for agent-built software: define what "working" looks like first, then iterate until the harness passes. little-loops grades, writes, and improves the harness for you — removing the engineering tax that keeps most teams skipping evals entirely.

The harness grades:

  • Eleven layered gate types (exit code through full agentic simulation), cheapest first — failures route back to execution, not forward
  • Stall detection catches the "already done" no-op that silently burns through iteration budgets

The harness writes itself:

  • /ll:create-eval-from-issues — turn an issue's acceptance criteria into a runnable harness in under a minute
  • /ll:create-loop — auto-derive the full harness from your project config
  • ll-loop validate — dry-run the FSM before paying for a real run

The harness improves itself:

  • harness-optimize — hill-climbing on harness artifacts. One targeted edit per iteration, benchmark, accept on rising score, revert otherwise
  • Prompt optimization loops — point at a prompt, converge to a target score
  • /ll:audit-loop-run — four-valued verdict catches failure modes humans miss

Loops Guide for gate types, FSM authoring, and harness patterns

Point at context. Get a harness.


Install

Prerequisites: Claude Code + Python 3.11+

# Add the GitHub repository as a marketplace
/plugin marketplace add BrennonTWilliams/little-loops

# Install the plugin
/plugin install ll@little-loops

# Install CLI tools (for ll-parallel, ll-loop, ll-auto, etc.)
pip install little-loops

Using Codex CLI? See docs/codex/getting-started.md — run /ll:init --codex and ll-adapt-skills-for-codex --apply to get started.

Alternative install methods

Local development install: see CONTRIBUTING.md.

Manual configuration — add to .claude/settings.local.json:

{
  "extraKnownMarketplaces": {
    "local": {
      "source": {
        "source": "directory",
        "path": "/path/to/little-loops"
      }
    }
  },
  "enabledPlugins": {
    "ll@local": true
  }
}

First 60 seconds

Four ways to feel the difference, top to bottom:

Scan and triage a codebase

/ll:init                  # Auto-detect project type, generate config
/ll:scan-codebase         # Find issues (technical)
/ll:prioritize-issues     # Auto-assign P0–P5 priorities
/ll:map-dependencies      # Cross-issue dependency graph

Ship an issue end-to-end

/ll:manage-issue bug fix BUG-001   # Plan → implement → verify → complete

Fan out a parallel sprint

ll-sprint create v2-launch --issues FEAT-001,FEAT-002,FEAT-003
ll-parallel --workers 3            # Three isolated worktrees, three features, zero babysitting

Eval-driven development

/ll:create-eval-from-issues FEAT-001   # Turn acceptance criteria into a runnable harness
ll-loop validate harness-optimize      # Dry-run the FSM before paying for a real run
ll-loop run harness-optimize -b        # Score-gated hill climbing in the background

What's in the box

  • 28 slash commands — issue discovery, refinement, planning, code quality, git, automation
  • 9 specialized agents — codebase analysis, quality assurance, automation, and research
  • 63 skills — deterministic harnesses for common workflows (confidence checks, issue wiring, loop creation)
  • 33 CLI toolsll-auto, ll-parallel, ll-sprint, ll-loop, ll-action, and more
  • 83 FSM loops — recurring automation workflows (backlog triage, sprint building, eval harnesses)
  • Configuration system — project-type templates for Python, JS/TS, Go, Rust, Java, .NET, and generic
  • Design tokens — WCAG AA palette template set with FSM context injection for artifact-generating loops

Full reference: Command Reference · CLI Reference


Documentation

License

MIT

Reviews (0)

No results found