heimdall

agent
Security Audit
Warn
Health Warn
  • License — License: MIT
  • Description — Repository has a description
  • Active repo — Last push 0 days ago
  • Low visibility — Only 5 GitHub stars
Code Warn
  • network request — Outbound network request in bin/lib/behavioral_diff.py
Permissions Pass
  • Permissions — No dangerous permissions requested

No AI report is available for this listing yet.

SUMMARY

Autonomous Claude Code agent — 14 specialized agents, 10-parallel execution, project-specific skill learning, mechanical checkpointing, wave-based planning with acceptance gates, caveman ultra (~75% token savings). One prompt → finished project.

README.md

Heimdall 🛡️

Verification gates for AI coding agents — catches what the model misses, proves it did.

Claude Code
License: MIT
Version

Install

curl -fsSL https://raw.githubusercontent.com/randomittin/heimdall/v2.0.5/install.sh | bash

No sudo. No telemetry. Idempotent — re-run to upgrade. Reversible:

hmd uninstall    # removes everything; nothing else was touched

Prefer to inspect first?

curl -fsSL https://raw.githubusercontent.com/randomittin/heimdall/v2.0.5/install.sh -o install.sh
less install.sh  # function-wrapped, no eval, no base64 — what you read is what runs
bash install.sh

Prerequisites: Claude Code 1.0+ · Git · jq (brew install jq)


First run

hmd demo --run

Scaffolds a real full-stack task, builds it, ends with a summary card and a follow-up prompt. Safe to run sight-unseen — hmd demo (without --run) prints the plan and does nothing.


Why Heimdall

  • Catches the silent failures — ordering races, whole-sequence invariants, missing subsystems that pass a naive green suite.
  • Falsifiable gates — every gate is proven able to go red before it is trusted green. The corpus of real failure cases replays on every change; a regression that once shipped can never ship twice.
  • Proof of correctness, not just generation — the delta Heimdall sells is the receipt that proves the proof can fail. Generalizes: 0.50 median reuse across 8 cold repos.
  • Full audit trailhmd report produces a machine-readable telemetry report of every gate, mutation score, and corpus catch-rate from the last run.

What's inside

Capability Command Status
Verification gates (secret-scan, bloat, falsify) /hmd:verify in Claude · bin/falsify Shipped
Demo task runner hmd demo / hmd demo --run Shipped
Issue-resolution loop hmd (auto-retries failures against corpus) Shipped
Telemetry report hmd report Shipped
Design match (visual diff vs spec) hmd designmatch Shipped
Redum / conformance checker heimdall-redum · heimdall-check Shipped
Reuse engine (cold-repo analysis) bin/lib/reuse_analyzer.py Shipped
Debloat scanner heimdall-debloat --report-only Shipped
Parallel workers hmd --team N "task" (N tmux panes, independent — no shared state) Shipped (no coordination layer)
Benchmark suite heimdall-bench Shipped

Running on your own work

cd /path/to/your/project
heimdall --auto "build a real-time dashboard with auth and charts"

--auto runs a background safety classifier that blocks prompt injection and risky escalation. It is the default. --dangerously-skip-permissions exists but is not the default — only use it in a throwaway sandbox.


Failures visible on purpose

Live flagship status: evals/flagship/STATUS.md — the ❌ rows are kept in view. The corpus dip log and golden provenance are at evals/corpus/CORPUS-STATUS.md and evals/oracles/emulator-gb/fixtures/golden/VERIFICATION.md.

A verification system that can't show you its own failures can't be trusted with yours.


Contributing

  • Stack packs (skills/stacks/) — teach Heimdall a framework's conventions and build commands.
  • Oracle packs (evals/oracles/) — add a falsifiable external gate for a new domain.

See CHANGELOG.md for release history.


License

MIT

Reviews (0)

No results found