My Claude Code Setup

Actively maintained. A summary of how I use Claude Code for academic work — slides, papers, data analysis, and more — packaged so you can fork it for your own research. See CHANGELOG.md for the latest changes.

Live site: psantanna.com/claude-code-my-workflow

A ready-to-fork foundation for AI-assisted academic work. You describe what you want — lecture slides, a research paper, a data analysis, a replication package — and Claude plans the approach, runs specialized agents, fixes issues, verifies quality, and presents results. Like a contractor who handles the entire job. Extracted from a production PhD course and extended by a growing community.

Quick Start (5–10 minutes, plus ~30 min for first-time installs)

Before you start: Claude Code + git are the minimum. To run the included HelloWorld demos end-to-end you also need XeLaTeX (Beamer sample) and Quarto (Quarto sample). R and the GitHub CLI are recommended. Python 3 is used by a few internal scripts (check-palette-sync.py, check-tikz-prevention.py) and is pre-installed on macOS/Linux. Full list in Prerequisites below. Fastest path: clone first, then run ./scripts/validate-setup.sh — it reports exactly what's missing with install links.

Only need Python/R/markdown? You don't need XeLaTeX or Quarto. The agents, rules, skills, and orchestration patterns work for any text/code artifact. Skip the HelloWorld demos and head straight to /data-analysis, /review-paper, /lit-review, or /review-r.

Session 2 onwards: MEMORY.md (committed) collects generic [LEARN] entries that help all forkers; .claude/state/personal-memory.md (gitignored) is for machine-specific notes. See .claude/rules/meta-governance.md for the distinction.

1. Fork & Clone

# Fork this repo on GitHub (click "Fork" on the repo page), then:
git clone https://github.com/YOUR_USERNAME/claude-code-my-workflow.git my-project
cd my-project
./scripts/validate-setup.sh        # reports missing tools with install links

Replace YOUR_USERNAME with your GitHub username.

2. Start Claude Code and Paste This Prompt

claude

Using VS Code? Open the Claude Code panel instead. Everything works the same — see the full guide for details.

Avoid prompt fatigue. Out of the box, Claude Code asks permission for every tool invocation. After the first few approvals, toggle Auto-accept edits mode (a keybinding; see the permission modes section of the guide) or run claude --permission-mode acceptEdits. For fully-autonomous runs on a trusted repo, Bypass mode skips prompts entirely. The template's .claude/settings.json pre-approves ~100 common Bash and Edit/Write patterns, so even at default permissions most work is unattended.

Then paste the starter prompt from the guide, filling in your project details:

I am starting to work on [PROJECT NAME] in this repo. [Describe your project in 2–3 sentences.] I've set up the Claude Code academic workflow... Please read the configuration files and adapt them for my project. Enter plan mode and start.

The full guide has the complete starter prompt with all the details.

What this does: Claude reads all the configuration files, fills in your project name, institution, and preferences, then enters contractor mode — planning, implementing, and (within the skill you invoke) running the review + verify loop. You approve the plan, invoke a skill, and the skill handles the rest within its scope.

Heavily adapting CLAUDE.md for a non-academic project? Anthropic's built-in /init command will re-derive a CLAUDE.md from your codebase as a starting point. The pre-shipped CLAUDE.md in this template already covers the academic setup — you only need /init if your fork diverges substantially (e.g., a Python/ML project that doesn't use LaTeX or Quarto).

3. Verify Your Setup

Before building real lectures, confirm your environment works:

./scripts/validate-setup.sh        # Checks XeLaTeX, Quarto, Python, git, etc.

Then inside Claude:

/compile-latex HelloWorld          # Compiles Slides/HelloWorld.tex to PDF
/deploy HelloWorld                 # Renders Quarto/HelloWorld.qmd to HTML

If both succeed, delete Slides/HelloWorld.tex and Quarto/HelloWorld.qmd and start on your real work.

How It Works

Goal-first, gate-enforced (the v2.0 shift)

You don't craft a perfect prompt — you state a goal and let the work loop toward it under gates. Specialist agents do the labor; enforcing gates decide when it's good enough; you adjudicate the disagreements they surface. Three things make that trustworthy:

Real gates, not reminders. A version-controlled pre-commit hook (run ./scripts/install-hooks.sh once) runs the surface-sync + quality (≥80) checks on every commit — bypassing the skill no longer bypasses the review. A git-guardrails hook blocks destructive git (reset --hard, clean -f, push --force, add -A); the review runtime re-checks any reviewer-introduced "fatal" finding before it counts.
A real orchestration runtime. Reviews fan out to forked specialist agents, reduce over a shared finding schema, judge with a hallucination gate, and loop until dry — see orchestrator-protocol.md.
Ground truth as a process. A mismatch isn't always a failure: a defensible, named alternative is recorded as EXPLAINED and carried into your response-to-referees, while genuine errors stay fail-closed.

This is not an autonomous daemon — the loop is always you- or skill-initiated, and you stay the auditor. Scheduled Routines handle recurring chores (nightly reproducibility, weekly lit-delta, inbox triage) and notify only when they find something.

Contractor Mode

You describe a task. For complex or ambiguous requests, Claude first creates a requirements specification with MUST/SHOULD/MAY priorities and clarity status (CLEAR/ASSUMED/BLOCKED). You approve the spec, then Claude plans the approach and invokes the right skill (e.g. /create-lecture, /qa-quarto, /review-paper --adversarial). That skill implements the orchestrator runtime internally — implement, verify, review, fix, re-verify, score — and returns a summary when the work meets quality standards. Say "just do it" and it runs the full loop; commits still require an explicit /commit (which the pre-commit hook then gates).

Specialized Agents

Instead of one general-purpose reviewer, 18 focused agents each check one dimension. A representative sample:

proofreader — grammar/typos
slide-auditor — visual layout
pedagogy-reviewer — teaching quality
r-reviewer — R code quality
domain-reviewer — field-specific correctness, slides (template — customize for your field)
domain-referee / methods-referee / editor — manuscript peer-review pipeline (/review-paper --peer)

Each is better at its narrow task than a generalist would be. The /slide-excellence skill runs the slide-review agents in parallel; /review-paper --peer runs the paper-review pipeline. The same pattern extends to any academic artifact — manuscripts, data pipelines, proposals.

Adversarial QA

Two agents work in opposition: the critic reads both Beamer and Quarto and produces harsh findings. The fixer implements exactly what the critic found. They loop until dry — converging when a round surfaces no new issue (a 5-round cap is the fallback, not the primary stop). This catches errors that single-pass review misses.

Quality Review

Every artifact gets a score (0–100). Scores below threshold halt the workflow and surface the findings — the user decides whether to fix or explicitly override:

80 — commit threshold
90 — PR threshold
95 — excellence (aspirational)

Framing honesty: Thresholds are advisory at the harness level — the /commit skill runs quality checks and halts on failure. And as of v2.0, running ./scripts/install-hooks.sh once installs a real pre-commit hook (.githooks/pre-commit) that runs the surface-sync + quality (≥80) gates on every commit, so bypassing the skill no longer bypasses the review. Opt out per-commit with SKIP_QUALITY_GATE=1 or git commit --no-verify.

Context Survival

Plans, specifications, and session logs survive auto-compression and session boundaries. The PreCompact hook saves a context snapshot before Claude's auto-compression triggers, ensuring critical decisions are never lost. MEMORY.md accumulates learning across sessions, so patterns discovered in one session inform future work.

For forced compression (long pipelines, mid-plan handoffs), /compress-session (v1.9.0) distils the conversation into a structured note — decisions, next actions, and discarded-as-noise — instead of letting auto-compaction truncate. /promote-memory (v1.9.0) periodically harvests generic learnings from gitignored personal-memory.md to committed MEMORY.md via a five-critic council.

Verification Discipline (v1.7.0+)

Multiple complementary verification layers run before submission:

/verify-claims (v1.7.0) — Chain-of-Verification with a forked verifier that cannot self-confirm because it has never seen the draft. v1.9.0 adds HIGH/MED/LOW-WARN severity tiers; HIGH-WARN (fabricated citation, numerical contradiction) gate-refuses /commit.
/audit-reproducibility (v1.7.0; Stata coverage v1.9.0) — every numeric claim in the manuscript is cross-checked against the script output that produced it. v1.9.0 adds passport.yaml — a per-paper YAML state file with PASS/FAIL/STALE/UNVERIFIED status per claim.
/humanize (v1.9.0) — detect AI-voice tells (boilerplate transitions, hedging stacking, sycophancy) before submission. Read-only by design; auto-rewriting degrades quality.
/review-paper --variance N (v1.9.0) — runs N referees with sampled dispositions and reports a decision distribution, not a point estimate. Motivated by AgentReview (ACL 2024) finding 37% of decisions vary purely from disposition sampling.

The Guide

For a comprehensive walkthrough, read the full guide (or see the source).

It covers:

Why This Workflow Exists — the problem and the vision
Getting Started — fork, paste one prompt, and Claude sets up the rest
The System in Action — specialized agents, adversarial QA, quality scoring
The Building Blocks — CLAUDE.md, rules, skills, agents, hooks, memory
Workflow Patterns — slides, research, reproducibility, presentation rhetoric, sequential adversarial audits, and more
The Ecosystem — extensions by clo-author, claudeblattman, MixtapeTools, autoresearch, ClaudeCodeTools, and a growing community
Customizing for Your Domain — creating your own reviewers and knowledge bases

2026 Features

The guide covers Claude Code's latest capabilities:

Model lineup — Opus 4.8 (claude-opus-4-8) is the newest model and the API default (GA 2026-05-28, $5/$25 per MTok, 1M context, defaults to high effort); Opus 4.7 is the prior generation. Sonnet 4.6 is the workhorse (1M context); Haiku 4.5 the fast tier. Sonnet 4 + original Opus 4 retire 2026-06-15 → migrate to Sonnet 4.6 / Opus 4.8. (Verified against Anthropic docs 2026-05-31.)
Effort levels — /effort sets cost vs. thoroughness (low / medium / high / xhigh / max). Opus 4.8 defaults to high — its high does roughly what 4.7's xhigh did for fewer tokens, so reserve xhigh for extended exploration and ultracode (xhigh + dynamic workflows) for the largest autonomous runs.
/goal <verifiable condition> (v1.9.0; Anthropic May 2026) — keep working across turns until a fast model confirms the condition holds. Pairs with /commit quality gates for verified-end-state runs.
claude agents dashboard (v1.9.0; Anthropic May 2026) — single screen for parallel review work (/review-paper --peer, /slide-excellence).
Cost-Conscious Composition — prompt-cache TTL (5-min default on API keys; 1-hour automatic on Claude subscriptions), 70/20/10 model routing (Haiku/Sonnet/Opus), /cost + /usage monitoring, Agent SDK credit-pool split (2026-06-15).
Skill frontmatter — effort, context: fork, agent, hooks, disable-model-invocation (v1.8.0+), and dynamic content ($ARGUMENTS, !command syntax)
Permission modes — Normal, Auto-accept, Plan, Auto (classifier-gated; on Team / Enterprise / API and rolling out to Max; needs Opus 4.6+ or Sonnet 4.6), Bypass
Hook handler types — command, prompt, and HTTP handlers with 20+ hook events; hooks see effort.level and $CLAUDE_EFFORT (Apr 2026 Week 19)
Advanced agent configuration — model, maxTurns, isolation, tool restrictions; model-routing.md rule codifies per-agent tier (v1.9.0)
Worktree base ref (v1.9.0; Anthropic Apr 2026) — worktree.baseRef setting controls fresh (default; remote default-branch) vs head (local HEAD) for new worktrees
Built-in skills — /fewer-permission-prompts, /team-onboarding, /autofix-pr, /powerup, Ultraplan, /loop (self-pacing)
Plugins — /discover-plugins for third-party extensions

Use Cases

Academic Task	How This Workflow Helps
Lecture slides (Beamer/Quarto)	Full creation, translation, multi-agent review, deployment
Research papers	Literature review, manuscript review, simulated peer review (`/review-paper --peer [journal]`), reviewer-disposition variance reporting (`--variance N`)
Data analysis	End-to-end R pipelines (`/data-analysis`) or Stata pipelines via `stata-mcp` (`/stata-replication`, v1.9.0), replication verification, publication-ready output
Monte Carlo simulations	Reproducible simulation studies (`/simulation-study`, v1.10.0) — parameterized DGP, estimator grid, bias/RMSE/coverage/size/power with Monte Carlo SEs, dedicated `sim-reviewer` review pass
Package development	R package release gate (`/r-package-check`, v1.10.0) — `devtools::document()` + tests + `R CMD check --as-cran` + CRAN-policy triage + `r-package-reviewer` (Stata / Python checks on the roadmap)
Replication packages	AEA-compliant packaging, reproducibility audit trails, `passport.yaml` claims provenance (v1.9.0)
Presentations	Rhetoric of decks principles, visual audit, cognitive load review
Research proposals	Structured drafting with adversarial critique
Preregistration	OSF / AsPredicted / AEA RCT Registry-ready document (`/preregister --style`) — full workflow in Pattern 16
Manuscript submission discipline	`/humanize` (detect AI voice), `/verify-claims` HIGH-WARN gate (block fabricated citations), reviewer-disposition variance

Disciplines preloaded: Economics (top-5 journal profiles, R conventions) and Political Science (APSR / AJPS / JOP profiles, formal-theory + survey-experiment paper types, conjoint/cjoint conventions). Forkers extend for psych / sociology / public-health via journal profiles + paper types + discipline cards.

One repo, many project types

This workflow is designed as a single hub for an entire research program — not one paper at a time. The same CLAUDE.md, rules, agents, and quality gates serve courses and lectures, papers and referee reports, data analysis and replication packages, Monte Carlo simulation studies (/simulation-study + sim-reviewer), and the R package release gate (/r-package-check + r-package-reviewer) — all new in v1.10.0. On the roadmap: Stata / Python package checks (SSC / PyPI) and personal-productivity workflows. See .claude/references/v2.0-backlog.md for what's next.

What's Included

18 agents, 51 skills, 32 rules, 7 hooks (click to expand)

Agents (`.claude/agents/`)

Agent	What It Does
`proofreader`	Grammar, typos, overflow, consistency review
`slide-auditor`	Visual layout audit (overflow, font consistency, spacing)
`pedagogy-reviewer`	13-pattern pedagogical review (narrative arc, notation density, pacing)
`r-reviewer`	R code quality, reproducibility, and domain correctness
`tikz-reviewer`	Merciless TikZ diagram visual critique
`beamer-translator`	Beamer-to-Quarto translation specialist
`quarto-critic`	Adversarial QA comparing Quarto against Beamer benchmark
`quarto-fixer`	Implements fixes from the critic agent
`verifier`	End-to-end task completion verification
`domain-reviewer`	Template for your field-specific substance reviewer
`claim-verifier` (v1.7.0)	Chain-of-Verification fact-checker in a forked context
`editor` (v1.5.0)	Journal editor for `/review-paper --peer` (desk review + referee selection + synthesis)
`domain-referee` (v1.5.0)	Disposition-primed substance referee for `--peer` mode
`methods-referee` (v1.5.0+)	Paper-type-aware methodology referee (6 paper types)
`humanize-auditor` (v1.9.0)	Read-only AI-voice auditor invoked by `/humanize`
`promote-memory-council` (v1.9.0)	Five-critic council for `[LEARN]` promotion to MEMORY.md
`sim-reviewer` (v1.10.0)	Monte Carlo simulation reviewer — DGP/estimand match, Monte Carlo SE, coverage-vs-truth, claims↔tables parity
`r-package-reviewer` (v1.10.0)	R package-source reviewer — DESCRIPTION/NAMESPACE hygiene, roxygen completeness, testthat coverage, CRAN-policy red flags

Skills (`.claude/skills/`)

Skill	What It Does
`/compile-latex`	3-pass XeLaTeX compilation with bibtex
`/deploy`	Render Quarto + sync to GitHub Pages
`/extract-tikz`	TikZ diagrams to PDF to SVG pipeline
`/proofread`	Launch proofreader on a file
`/visual-audit`	Launch slide-auditor on a file
`/pedagogy-review`	Launch pedagogy-reviewer on a file
`/review-r`	Launch R code reviewer
`/qa-quarto`	Adversarial critic-fixer loop (loops until dry; 5-round cap is a fallback)
`/slide-excellence`	Combined multi-agent review
`/translate-to-quarto`	Full 11-phase Beamer-to-Quarto translation
`/validate-bib`	Cross-reference citations against bibliography
`/devils-advocate`	Challenge design decisions before committing
`/create-lecture`	Full lecture creation workflow
`/commit`	Stage, commit, create PR, and merge to main
`/lit-review`	Literature search, synthesis, and gap identification
`/research-ideation`	Generate research questions and empirical strategies
`/interview-me`	Interactive interview to formalize a research idea
`/review-paper`	Manuscript review: structure, econometrics, referee objections
`/data-analysis`	End-to-end R analysis with publication-ready output
`/learn`	Extract non-obvious discoveries into persistent skills
`/context-status`	Show session health and context usage
`/deep-audit`	Repository-wide consistency audit
`/permission-check`	Diagnose permission layers when prompts fire unexpectedly
`/audit-reproducibility`	Enforce tolerance thresholds on paper ↔ code numeric claims
`/new-diagram`	Scaffold a TikZ diagram from the snippet gallery with prevention + review
`/respond-to-referees`	R&R response-letter generator (maps referee comments to revisions)
`/seven-pass-review`	Seven-pass adversarial manuscript review (parallel forked subagents)
`/checkpoint`	Structured session-handoff snapshot (state + plan pointers + next actions). Companion to narrative session logs.
`/preregister`	Generate a preregistration document (OSF / AsPredicted / AEA RCT Registry style) from a research spec
`/verify-claims` (v1.7.0)	Chain-of-Verification fact-check (forked verifier, fresh context). HIGH/MED/LOW-WARN severity tiers (v1.9.0); HIGH-WARN gate-refuses `/commit`.
`/humanize` (v1.9.0)	Detect AI-voice tells in academic prose (10 detection categories; read-only, no rewrite)
`/compress-session` (v1.9.0)	Distil current session into structured notes (decisions, next actions, discarded-as-noise) before auto-compaction
`/promote-memory` (v1.9.0)	Five-critic council that votes on which `[LEARN]` entries graduate from personal-memory.md to MEMORY.md
`/stata-replication` (v1.9.0)	End-to-end Stata pipeline via the `stata-mcp` MCP server (mirrors `/data-analysis` for R-first projects)
`/simulation-study` (v1.10.0)	Scaffold + run a reproducible Monte Carlo study — parameterized DGP, estimator grid, seeded replications, bias/RMSE/coverage/size/power with Monte Carlo SEs
`/r-package-check` (v1.10.0)	R package release gate — `devtools::document()` + tests + `R CMD check --as-cran`, triage ERROR/WARNING/NOTE vs CRAN policy, `r-package-reviewer` pass
`/replication-package` (v2.0)	Assemble a submission-ready DCAS / openICPSR replication package — standard README, dataset manifest, computational-requirements capture, Table/Figure → script:line map, confidential-data deposit note (blocks on `/audit-reproducibility` FAIL)
`/capture-environment` (v2.0)	Snapshot the computational environment for a replication package — renv.lock + sessionInfo.txt (R), requirements.txt / environment.yml / uv.lock (Python), Stata version + ado list, seeds/RNG, optional pinning Dockerfile
`/did-event-study` (v2.0)	Thin wrapper for staggered DiD / event-study via canonical packages (Callaway–Sant'Anna `did`, Sun–Abraham `fixest::sunab`, HonestDiD sensitivity; Stata equivalents) — surfaces each package's native diagnostics, never reimplements an estimator
`/power-analysis` (v2.0)	Power / required-N / minimum-detectable-effect for study design — two-arm RCT (clustering/ICC, unequal allocation), multi-arm corrections, simulation-based power for non-standard designs; feeds `/preregister`
`/disclosure-check` (v2.0)	Statistical-disclosure-limitation pre-screen for restricted/confidential-data outputs (small cells, complementary-suppression gaps, dominance, PII); CRITICAL/WARNING/OK + gate
`/grant-proposal` (v2.0)	Scaffold an NSF/NIH/ERC/foundation grant proposal by composing primitives (spec → aims/methods, delegated DMP + facilities, coherence pass + requirements checklist)
`/data-management-plan` (v2.0)	Funder-compliant Data Management Plan (NSF / NIH DMS 2023 / ERC / Horizon Europe) — folds in disclosure-avoidance + IRB constraints and a replication-package/environment plan; outputs a draft + funder checklist
`/coauthor-brief` (v2.0)	Collaborator handoff brief — what changed since last brief, per-artifact state, open questions, reproduce-locally + restricted-data access steps
`/triage-inbox` (v2.0)	Schedulable academic inbox + calendar triage via Gmail/Calendar MCP — classifies referee requests, R&R/editor, co-author threads, seminar/conference invites, grant/admin deadlines; proposes one human-gated action each (draft reply, calendar hold, `/new-referee-project`, `/coauthor-brief`, snooze); emits a digest + referee-obligations tracker; degrades gracefully when MCP is absent; never auto-sends
`/diagnose` (v2.0)	Root-cause a wrong/failing empirical result — disciplined reproduce → minimise → hypothesise → instrument → fix loop; tuned for research-code bugs (type coercion, NA/merge blow-ups, clustering/SE choice, seed/package-version drift); `--no-fix` localizes without editing
`/syllabus` (v2.0)	Build/restructure a course syllabus from a topic or reading list — course description + prerequisites, week-by-week schedule (topic→readings→deliverables), measurable learning objectives, assessment scheme + rubric, standard policies (late work / AI use / integrity / accessibility), and a per-week work-list mapping weeks to `/create-lecture` decks; economics-aware (PhD metrics/micro/macro sequences, undergrad)
`/teach-from-paper` (v2.0)	Reads a paper end-to-end and pitches it to a stated audience level — lecture outline (motivation → setup → key result → method → takeaways), the 3-5 results worth presenting with intuition, a slide skeleton for `/create-lecture`, discussion questions, and a problem-set brief for `/scaffold-exercises`
`/respond-to-eval` (v2.0)	Teaching analogue of `/respond-to-referees` — clusters course-eval comments into themes, weights by frequency (signal vs noise), classifies Keep / Change / Investigate / Out-of-scope, and drafts concrete changes mapped to the syllabus + slide decks; saves the plan to `quality_reports/teaching/`
`/scaffold-exercises` (v2.0)	Scaffold a graded problem set across analytical/empirical/coding types, with worked solutions and "why this matters" explainers emitted to a separate solution key
`/new-skill` (v2.0)	Scaffold a new skill that follows this repo's conventions — interviews for purpose, triggers, and tools, writes `.claude/skills/<name>/SKILL.md` from the template with frontmatter/body that pass `check-skill-integrity.py` first try, then reminds to add the surface-table rows

Research Workflow

Feature	What It Does
Exploration folder	Structured `explorations/` sandbox with graduate/archive lifecycle
Fast-track workflow	60/100 quality threshold for rapid prototyping
Simplified orchestrator	implement → verify → score → done (no multi-round reviews)
Enhanced session logging	Structured tables for changes, decisions, verification
Merge-only reporting	Quality reports at merge time only
Math line-length exception	Long lines acceptable for documented formulas
Workflow quick reference	One-page cheat sheet at `.claude/WORKFLOW_QUICK_REF.md`

Rules (`.claude/rules/`)

Rules use path-scoped loading: always-on rules load every session (~100 lines total); path-scoped rules load only when Claude works on matching files. Claude follows ~150 instructions reliably, so less is more.

Always-on (no paths: frontmatter — load every session):

Rule	What It Enforces
`plan-first-workflow`	Plan mode for non-trivial tasks + context preservation
`orchestrator-protocol`	Goal-first review runtime: fan-out → reduce → judge (+ hallucination gate) → loop-until-dry (the contractor loop, now a real runtime)
`session-logging`	Three logging triggers: post-plan, incremental, end-of-session
`meta-governance`	Template vs. working project distinctions
`prompt-shaping` (v2.0)	Ambient habit — shape informal/ambiguous requests before acting (replaces the retired `/prompt` + `/prompt-only` skills)

Path-scoped (load only when working on matching files):

Rule	Triggers On	What It Enforces
`verification-protocol`	`.tex`, `.qmd`, `docs/`	Task completion checklist
`single-source-of-truth`	`Figures/`, `.tex`, `.qmd`	No content duplication; Beamer is authoritative
`quality-gates`	`.tex`, `.qmd`, `*.R`	80/90/95 scoring + tolerance thresholds
`r-code-conventions`	`*.R`	R coding standards + math line-length exception
`tikz-visual-quality`	`.tex`	TikZ diagram visual standards
`beamer-quarto-sync`	`.tex`, `.qmd`	Auto-sync Beamer edits to Quarto
`pdf-processing`	`master_supporting_docs/`	Safe large PDF handling
`proofreading-protocol`	`.tex`, `.qmd`, `quality_reports/`	Propose-first, then apply with approval
`no-pause-beamer`	`.tex`	No overlay commands in Beamer
`replication-protocol`	`*.R`	Replicate original results before extending
`knowledge-base-template`	`.tex`, `.qmd`, `*.R`	Notation/application registry template
`orchestrator-research`	`*.R`, `explorations/`	Simple orchestrator for research (no multi-round reviews)
`exploration-folder-protocol`	`explorations/`	Structured sandbox for experimental work
`exploration-fast-track`	`explorations/`	Lightweight exploration workflow (60/100 threshold)
`tikz-prevention` (v1.4.x)	`Slides/`, `Figures/`, `Preambles/**`	TikZ pre-flight grep checks (P3/P4 collision avoidance)
`tikz-measurement` (v1.5.x)	`Slides/`, `Figures/`, `Preambles/`, `scripts/`	Bézier curve depth math + 6-pass collision protocol (from MixtapeTools)
`content-invariants` (v1.6.x)	`.tex`, `.qmd`, `Preambles/`, `scripts/R/**`	Pre-Flight Reports — proves inputs were read before work
`cross-artifact-review` (v1.7.0)	`master_supporting_docs/`, `.tex`, `.qmd`	Paper ↔ code dependency graph; auto-invokes `/review-r` + `/audit-reproducibility`
`post-flight-verification` (v1.7.0)	Skills generating factual claims	Chain-of-Verification protocol with forked verifier
`summary-parity` (v1.8.x)	`CHANGELOG.md`, `README.md`, `.qmd`, skill/rule/agent `.md`	Anti-whack-a-mole: re-verify summaries against their bodies
`model-routing` (v1.9.0)	`.claude/agents/*/.md`, `.claude/skills/**/SKILL.md`	70/20/10 architect/editor split (Haiku/Sonnet/Opus)
`stata-code-conventions` (v1.9.0)	`*/.do`, `scripts/stata/**`	Stata header scaffold, numbered pipeline, esttab, clustering discipline, AEA compliance
`simulation-conventions` (v1.10.0)	`*/simulation.R`, `/_sim.R`, `explorations/**`	Monte Carlo discipline: DGP/estimand, L'Ecuyer seeding, Monte Carlo SE, coverage-vs-truth, raw-result storage
`r-package-conventions` (v1.10.0)	`R/`, `tests/`, `DESCRIPTION`, `NAMESPACE`, `man/**`	R package-source standards: no `library()` in `R/`, roxygen NAMESPACE, Imports/Suggests, testthat 3e, CRAN policy
`confidential-data` (v2.0)	`data/`, `/.dta`, `/restricted/`, `/confidential/*`	Restricted/IRB-data protocol: never commit raw data, disclosure clearance before release, restricted-data-safe multi-author git topology
`did-conventions` (v2.0)	`*/did.R`, `/eventstudy.R`, `*/att_gt`, `/csdid.do`, `/drdid*`	DiD/event-study standards (Sant'Anna): LONG data + gname coding, doubly-robust default, control-group rule, uniform-band inference, mandatory pre-trend/HonestDiD/didFF diagnostics, replicate-and-verify-to-1e-6
`inference-robustness` (v2.0)	`scripts/*/.R`, `*/.do`, `*/.py`	Multiple-testing (FWER/Romano-Wolf vs FDR/Anderson sharpened-q, pre-register the family) + specification-curve / leave-one-out / wild-cluster-bootstrap robustness

Templates (`templates/`)

Template	What It Does
`session-log.md`	Structured session logging format
`quality-report.md`	Merge-time quality report format
`exploration-readme.md`	Exploration project README template
`archive-readme.md`	Archive documentation template
`requirements-spec.md`	MUST/SHOULD/MAY requirements framework with clarity status
`constitutional-governance.md`	Template for defining non-negotiable principles vs. preferences
`skill-template.md`	Academic skill creation template with domain-specific examples
`decision-record.md`	Architectural decision record (ADR) template
`journal-profile-template.md`	Journal profile for `/review-paper --peer` editor calibration
`preregistration-template.md` (v1.8.0)	Preregistration document scaffold (OSF / AsPredicted / AEA RCT)
`passport-template.yaml` (v1.9.0)	Per-paper YAML passport for numeric-claim provenance (used by `/audit-reproducibility`)
`response-to-referees.md`	R&R response document scaffold

Prerequisites

Tool	Required For	Install
Claude Code	Everything	claude.ai/install
git	Clone + version control	git-scm.com
Python 3 (3.9+)	Internal checkers (palette sync, TikZ prevention)	Preinstalled on macOS/Linux; python.org for Windows
XeLaTeX	LaTeX compilation (Beamer `HelloWorld`, real lectures)	TeX Live or MacTeX
Quarto	Web slides (Quarto `HelloWorld`, real lectures)	quarto.org/docs/get-started
R	Figures and analysis (`/data-analysis`, `scripts/R/` template)	r-project.org
pdf2svg	TikZ → SVG for Quarto (`/extract-tikz`)	`brew install pdf2svg` (macOS), `apt install pdf2svg` (Debian)
gh CLI	PR / issue workflow	`brew install gh` (macOS), `apt install gh` (Debian)

Minimum to fork this template: Claude Code + git + Python 3 (Python is already installed on macOS/Linux).

Minimum to run the included HelloWorld demos end-to-end: add XeLaTeX (for /compile-latex HelloWorld) and Quarto (for /deploy HelloWorld).

Your real lectures may need more — R for scripts/R/ analyses, pdf2svg if you use TikZ extraction, gh CLI if you use the PR-based commit workflow. ./scripts/validate-setup.sh reports which of these are installed and what each unlocks.

Adapting for Your Field

Fill in the knowledge base (.claude/rules/knowledge-base-template.md) with your notation, applications, and design principles
Customize the domain reviewer (.claude/agents/domain-reviewer.md) with review lenses specific to your field
Update the color palette — this is a two-surface contract: change the HEX values at the top of both Preambles/header.tex (Beamer/TikZ) and Quarto/theme-template.scss (Quarto slides) so they agree. Then run ./scripts/check-palette-sync.sh to verify. Forgetting one surface silently produces mismatched Beamer vs. Quarto renderings. See Preambles/README.md for the full contract and the TikZ style library.
Add field-specific R pitfalls to .claude/rules/r-code-conventions.md
Fill in the lecture mapping in .claude/rules/beamer-quarto-sync.md
Customize the workflow quick reference (.claude/WORKFLOW_QUICK_REF.md) with your non-negotiables and preferences
Set up the exploration folder (explorations/) for experimental work

Additional Resources

Claude Code Documentation
Writing a Good CLAUDE.md — official guidance on project memory

Origin

This infrastructure was extracted from Econ 730: Causal Panel Data at Emory University, developed by Pedro Sant'Anna using Claude Code over 6+ sessions. The course produced 6 complete PhD lecture decks with 800+ slides, interactive Quarto versions with plotly charts, and full R replication packages — all managed through this multi-agent workflow. The patterns are domain-agnostic: the same agents, rules, and orchestrator work for any academic project.

Community & Extensions

As of March 2026, 15+ research groups across economics, energy, political science, and engineering have forked and adapted this workflow. The infrastructure (orchestrator, hooks, quality gates) transfers without modification.

Extended workflows:

clo-author by Hugo Sant'Anna (UAB) — Paper-centric research workflows with 17 specialized agents (6 worker-critic pairs plus referees, data-engineer, verifier), simulated blind peer review, AEA replication compliance, and full research lifecycle management. The /review-paper --peer <journal> pipeline in this template is adapted from clo-author with Hugo's permission (pipeline shape, 6-way disposition taxonomy, journal-calibration schema, paper-type branching). Thanks, Hugo.
claudeblattman by Chris Blattman (U Chicago) — Comprehensive guide for non-technical academics: executive assistant workflows, proposal writing, agent debates, and self-improving configuration
MixtapeTools by Scott Cunningham (Baylor) — The Rhetoric of Decks: philosophy and practice of beautiful, rhetorically effective academic presentations
autoresearch by Andrej Karpathy — Constraint-based autonomous research with program.md as constitutional document
ClaudeCodeTools — "The Editor" persona: seven-audit sequential paper review protocol

See the guide's ecosystem section for detailed descriptions, design principles, and more resources.

Versioning & Contributing

What's new: see CHANGELOG.md. We follow loose semver — breaking changes get major bumps so you can decide when to pull updates.
How to contribute: see .github/CONTRIBUTING.md. PRs welcome for generalizable improvements; fork-specific work stays in your fork.
Pin to a version: git checkout v2.0.0 (current as of 2026-06-09).

License

MIT License. See LICENSE.