crucible
Health Warn
- License — License: MIT
- Description — Repository has a description
- Active repo — Last push 0 days ago
- Low visibility — Only 6 GitHub stars
Code Pass
- Code scan — Scanned 4 files during light audit, no dangerous patterns found
Permissions Pass
- Permissions — No dangerous permissions requested
No AI report is available for this listing yet.
Stress-test decisions with a 16-persona council. A Claude Code skill for PMs, founders, and builders facing wicked problems.
Crucible
A council of 17 personas that stress-tests one idea under structured conflict. Built as a Claude Code skill. Useful when you are a product manager, founder, or builder staring at a decision that does not have a right answer.
Wicked problems need multiple lenses. This is seventeen of them, with First Principles and Expansionist as the spine.
Full story: claudecodeguide.dev/crucible
What it does
You give Crucible a question. The chair selects a bench from the roster of 17 based on the problem type. Each persona writes an opening take in parallel, blind to the others. Selected pairs then rebut each other (with the opposing argument anonymized so they attack the argument, not the persona). The chair synthesises the transcript and issues a verdict of PROCEED, REFRAME, or KILL, plus a mandatory minority report.
The central tension of every Decision-mode run is First Principles vs Expansionist: strip the problem to its irreducible facts, then ask what the 100x version looks like with AI-era leverage. The verdict must address where the two landed. This is the friction PMs and builders most often skip and most need.
Transcripts persist to disk so your main thread stays clean.
Install
As a Claude Code skill
# Clone into your Claude Code skills folder
git clone https://github.com/mshadmanrahman/crucible ~/.claude/skills/crucible
That is the whole install. Open Claude Code and type /crucible followed by a question.
Without Claude Code
The persona files in personas.md are portable. You can paste a single persona prompt into ChatGPT or Claude.ai and run that lens manually. You lose the orchestration; you keep the rubrics.
Usage
Five modes depending on the weight of the question.
Solo mode (single-lens exploration, ~10-15 seconds)
/crucible --solo first-principles should we ship this feature now or after the refactor?
Or natural language:
give me the Expansionist read on the morning digest cron
One persona, no debate, just the rubric output. Use for exploration, not decisions.
Council mode (daily reflex, ~30-45 seconds)
/crucible --council should I split this feature into two releases?
Five personas in parallel: Contrarian, First Principles, Expansionist, Outsider, Operator. Five short challenges plus a one-line call. Modelled on Andrej Karpathy's LLM Council. Use this between meetings.
PM Daily mode (bounded ideation, ~60 to 90 seconds)
/crucible --pm what is the sharpest first experiment for this discovery?
Four personas with a short verdict and one concrete next action. Use for feature direction, user research design, ideation-grade questions.
Decision mode (full council, ~30 to 60 seconds per phase)
/crucible should I keep working on X vs pivot to Y?
Nine to eleven personas (Core 6 plus bench specialty). Full duels including the mandatory First Principles vs Expansionist showdown. Written transcript. Use for anything with more than a week of time, real money, or reputation at stake.
Existential mode (full roster of 17)
/crucible --existential should I leave my job to ship this full time?
All 17 personas. Extended verdict. Guardrailed: if you invoke this more than once a quarter, the chair warns you that life decisions do not usually move that fast.
The 17 personas
Core 6 (always convened in Decision and Existential modes):
- Contrarian attacks the premise
- Advocate argues the strongest case for
- Pre-mortem names the failure mechanism
- Operator Monday-morning reality check
- First Principles strips the problem to bedrock facts
- Expansionist finds the 100x version with AI-era leverage
Personal bench (career, moonlight, identity):
- Track Record cites what you have already proven
- Goals Check checks against your stated goals
- Success Vision paints the 12-month win as a scene
- Worst Case Cost quantifies the failure cost
Strategic bench (business, product, market):
- Economist follows the money
- Competitor how the market responds
- Consequences chains of causation
- Historian who has tried this
Creative bench:
- Reframer is this the right question
Conditional:
- Outsider activated for user-facing product calls
- Regulator activated for money, data, or platform ToS
The four phases
- Intake. Chair restates the question in one sentence and asks you to confirm before spending subagents on it.
- Parallel openings. Every selected persona writes its take in parallel. Personas are blind to each other.
- Serial duels with anonymized rebuttals. First Principles vs Expansionist always fires in Decision and Existential modes. Other duels: Advocate vs Contrarian, Success Vision vs Worst Case Cost, Economist vs Consequences, Operator vs Historian, Track Record vs Reframer. The opposing argument is shown without persona labels so the rebuttal attacks the argument, not the persona.
- Verdict. Chair reads all openings and rebuttals, writes the transcript, issues an inline verdict with a mandatory minority report and a mandatory First Principles vs Expansionist resolution line.
Model routing
Each persona runs on the model tier that matches its job.
- Opus: Contrarian, First Principles, Consequences, Chair
- Sonnet: Advocate, Expansionist, Economist, Reframer, Pre-mortem, Success Vision, Track Record, Competitor, Regulator
- Haiku: Operator, Goals Check, Worst Case Cost, Historian, Outsider
Why this matters: running 17 large-model agents in parallel collapses context and burns budget. Routing by role keeps a full run cheap enough to use twice a week, and Council mode cheap enough to use daily.
Guardrails
Three are worth calling out.
Motivated convening check. Before firing Decision or Existential mode, the chair asks: has a similar question been convened in the last 30 days? What concrete action will a verdict produce in the next 14 days? What new information has arrived since the last time you thought about this? Honest answers keep the skill from becoming a sophisticated form of procrastination.
Track Record ownership classification. Track Record has to label every cited project as personal moonlight, client work, collaborative work, or day-job work before drawing any pattern claim across them. This exists because the first run confused a piece of client work for a personal choice, and the correction is now structural.
Anonymized rebuttals. In duels, the opposing argument is shown without persona labels. Prevents deference to dramatic framings (Pre-mortem's failure scenes tend to win on tone if you know who wrote them) and forces argumentative honesty.
Files
SKILL.md, orchestration specpersonas.md, all 17 rubricsbenches.md, problem-type to bench mapping plus all 5 modes
Contributing
The roster is deliberately small so each voice has space. But I will add personas that earn their seat.
If you are a PM, founder, or builder with a blind spot you wish a council would catch, write a persona for it and open a pull request. The bar is a concrete rubric. "Name the hidden assumption and attack it with one argument" is a rubric. "Be contrarian" is not.
Issues and PRs welcome at github.com/mshadmanrahman/crucible.
Acknowledgements
The Council mode and the First-Principles-vs-Expansionist spine are inspired by Andrej Karpathy's LLM Council pattern (five lenses: contrarian, first principles, expansionist, outsider, executor). Crucible extends it with bench-typed Decision mode, mandatory minority reports, and anonymized rebuttals.
License
MIT.
Built by Shadman Rahman. More on the backstory at claudecodeguide.dev/crucible.
Reviews (0)
Sign in to leave a review.
Leave a reviewNo results found