team-shinchan

agent
SUMMARY

15 Shinchan character agents that debate, plan, execute, and learn. Stop doing everything yourself - orchestrate a team.

README.md

Team-Shinchan

Shinchan

The Agent Harness for Claude Code

Guardrails, Observability, Ontology, and Quality Gates for AI-Powered Development

15 specialist agents with structured workflows, project ontology, budget controls, analytics, and self-learning.

Version
License
Claude Code
GitHub stars

Try It | Features | Commands | The Team


Why an Agent Harness?

AI agents are powerful but unpredictable. Without constraints, they hallucinate architecture, skip reviews, blow through token budgets, and forget past decisions. An Agent Harness solves this by wrapping agents in guardrails, quality gates, and feedback loops -- the same way a test harness wraps code in assertions.

Team-Shinchan turns Claude Code into a harnessed multi-agent system where 15 specialists debate, plan, execute, and learn -- all within well-defined boundaries.

Without a Harness With Team-Shinchan
Agent starts blind, re-reads the whole codebase Project Ontology auto-builds a knowledge graph on first session
Agents skip stages, jump to code Workflow Guard enforces stage-tool matrix
No visibility into agent behavior Analytics with trace IDs track every action
Unlimited token burn Budget Guard caps spend per session
No quality signal on the harness itself Harness Lint checks plugin integrity
Past decisions forgotten Memory + Ontology + session bridging persist context
Code reviewed ad-hoc (or not at all) Action Kamen reviews every phase (mandatory)

Harness Engineering

Team-Shinchan is built on 5 Harness Engineering principles:

1. Context Engineering

Load the right knowledge at the right time.

  • load-kb hook injects project knowledge base at session start
  • Project Ontology auto-builds a knowledge graph of your codebase (entities, relations, dependencies)
  • AGENTS.md auto-generated map of all 15 agents with roles and capabilities
  • Session bridging via session-wrap and resume for cross-session continuity
  • Self-learning memory accumulates project conventions over time

2. Architectural Constraints

Hard boundaries that prevent structural drift.

  • Workflow Guard -- stage-tool matrix blocks code edits during planning (PreToolUse hook)
  • Layer Enforcement -- validator ensures agents stay in their architectural layer
  • Rules System -- 54 rules across coding (14), security (14), testing (13), git (13)
  • Stage-tool matrix -- 6 tools x 4 stages, enforced automatically

3. Guardrails and Quality Gates

Automated checks that prevent bad outcomes.

  • Security Hook -- blocks secrets, destructive git, sensitive files, large files
  • Budget Guard -- token budget enforcement with alerts at 80% and hard stop at 100%
  • Deny List -- 15 pattern rules blocking dangerous operations
  • Self-Check -- completion checklist all execution agents must pass
  • Action Kamen -- mandatory code review with Skepticism Rules (S1-S4) at every phase boundary
  • Sprint-Contract -- AC testability review before execution begins

4. Feedback Loops

Observability and continuous improvement.

  • Analytics -- trace IDs, event tracking, agent performance metrics (src/analytics.js)
  • Harness Lint -- static analysis of the harness itself (src/harness-lint.js)
  • ARCHITECTURE.md auto-generation -- src/gen-architecture-map.js generates agent hierarchy, workflow, invariant rules, and entry points; --check flag integrates into CI
  • Eval -- schema validation and regression detection (src/eval-schema.js, src/regression-detect.js)
  • Auto-Retrospective -- extracts learnings after every completed task
  • Garbage Collection -- lint detects orphaned skills, broken refs, stale configs

5. State Management

Durable state across sessions and workflows.

  • WORKFLOW_STATE.yaml -- structured state file with stage, phase, and progress
  • Trace IDs -- every agent action tagged for end-to-end traceability
  • Session Wrap -- auto-summary on session end, persisted to work tracker
  • Resume -- interrupted workflows resume from last checkpoint with handoff artifacts
  • Work Tracker -- JSONL event log with auto-rotation at 10K lines

What's New

v4.23.0 — Generator-Evaluator Harness Strengthening

Benchmarked from Anthropic's "Harness Design for Long-Running Apps":

  • Sprint-Contract pattern — Nene and Action Kamen negotiate testable AC criteria before execution. Shinnosuke mediates the review with audit logs.
  • Skepticism Rules S1-S4 — Action Kamen now runs Evidence Gate, Assumption Audit, Coverage Traceability, and Regression Guard on every review.
  • Pre-compact handoff — Context resets preserve progress, decisions, and blocking issues. /resume auto-loads handoff state.
  • Test Execution Mode — Action Kamen can run declared test commands and attach output as evidence (run_tests: true).
  • Spec Granularity Rules — 3 rules (Deliverable Anchor, Binary Verifiability, Command Evidence) enforce testable AC writing.
  • Structured rubricseval-rubrics.json with default/documentation/planning rubrics for machine-readable quality scoring.
  • Assumption Audit — Harness Lint verifies structural assumptions (Skepticism Rules, Sprint-Contract, rubrics, handoff) haven't drifted.

v4.22.0 — AK Review Gate for Requirements and Planning

  • Action Kamen auto-review gates after user approval of REQUESTS.md and PROGRESS.md
  • Hard block with max 2 auto-revise retries before user escalation

Earlier Highlights

  • v4.18.0: Domain-aware routing, collaboration scoring, wave parallel execution
  • v4.17.0: Plan Mode integration, token usage tracking, git commit deferred to Stage 4
  • v4.10.0: ARCHITECTURE.md auto-generation, 3-layer ontology, agent self-observation
  • v4.9.0: Brainstorm skill, Socratic interview, two-stage review
  • v4.2.0: 18 enforcement gaps closed — budget hard stop, stage transition gates
  • v4.0.0: Harness foundation — analytics, trace IDs, budget guard, 5-layer architecture

Try It in 2 Minutes

Install

From Marketplace (Recommended)

/plugin marketplace add seokan-jeong/team-shinchan
/plugin install team-shinchan

One-Click Install

curl -fsSL https://raw.githubusercontent.com/seokan-jeong/team-shinchan/main/install.sh | bash

Manual Install

git clone https://github.com/seokan-jeong/team-shinchan.git ~/.claude/plugins/team-shinchan

Verify Installation

/team-shinchan:help

If you see the help menu, you are ready to go.

What's Next?

  • Read the Getting Started Guide
  • Try /team-shinchan:brainstorm {problem} to explore before building
  • Try /team-shinchan:start {your task} to see the full workflow
  • Use /team-shinchan:debate for design decisions

15 Specialized Agents

Orchestration

Agent Role
Shinnosuke Orchestrator
Himawari Atlas (Large Projects)
Midori Debate Moderator

Execution

Agent Role
Bo Execution PO
Kazama Deep Worker

Specialists

Agent Role
Aichan Frontend/UI
Bunta Backend/API
Masao DevOps/Infra

Advisors

Agent Role
Hiroshi Senior Oracle
Nene Strategic Planner
Misae Requirements Analyst
Action Kamen Code Reviewer

Utility

Agent Role
Shiro Fast Explorer
Masumi Documentation
Ume Image/PDF Analysis

Commands

49 commands across workflow, specialist, and utility categories:

Workflow Commands

Command Description
/team-shinchan:start Start integrated workflow
/team-shinchan:resume Resume interrupted workflow
/team-shinchan:autopilot Autonomous execution
/team-shinchan:ultrawork Parallel execution
/team-shinchan:ralph Loop until complete
/team-shinchan:orchestrate Orchestrate multi-agent task
/team-shinchan:bigproject Large project with Himawari
/team-shinchan:status Show workflow status

Analysis and Planning

Command Description
/team-shinchan:brainstorm Explore problem space before requirements
/team-shinchan:debate Trigger expert debate (via Midori)
/team-shinchan:plan Planning session
/team-shinchan:analyze Deep analysis
/team-shinchan:deepsearch Codebase search
/team-shinchan:requirements Gather and clarify requirements
/team-shinchan:research Research a topic
/team-shinchan:design-review Compare design mockups against implementation to detect UI mismatches

Implementation

Command Description
/team-shinchan:implement Execute code changes
/team-shinchan:frontend Frontend/UI development
/team-shinchan:backend Backend/API development
/team-shinchan:devops Infrastructure and DevOps
/team-shinchan:review Code review
/team-shinchan:vision Image/PDF analysis
/team-shinchan:micro-execute Micro-task execution with per-task two-stage review
/team-shinchan:systematic-debugging 4-phase root-cause debugging (investigate, analyze, hypothesize, implement)
/team-shinchan:test-driven-development Enforce RED-GREEN-REFACTOR TDD cycle during implementation

Verification

Command Description
/team-shinchan:verify-implementation Run full verification suite
/team-shinchan:verify-agents Verify agent schema
/team-shinchan:verify-skills Verify skill files
/team-shinchan:verify-consistency Verify cross-references
/team-shinchan:verify-workflow Verify workflow state
/team-shinchan:verify-memory Verify memory system
/team-shinchan:verify-budget Verify token budget

Harness and Ontology

Command Description
/team-shinchan:ontology Query, manage, and visualize project ontology
/team-shinchan:impact-analysis Cascade dependency analysis with risk assessment
/team-shinchan:analytics View agent analytics and trace data
/team-shinchan:budget Check token budget status
/team-shinchan:lint-harness Lint the harness for structural issues
/team-shinchan:eval Run eval checks and regression detection

Memory and Utility

Command Description
/team-shinchan:memories View learned memories
/team-shinchan:learn Add to memory
/team-shinchan:forget Remove memory
/team-shinchan:work-log Query work tracker events
/team-shinchan:session-summary View session summary
/team-shinchan:manage-skills Manage plugin skills
/team-shinchan:help Show help
/team-shinchan:setup First-time setup and health check
/team-shinchan:setup-hud Configure Claude Code statusLine HUD display

Internal / Admin Commands

Command Description
/team-shinchan:release Bump version across plugin.json, marketplace.json, README, and CHANGELOG (maintainers only)
/team-shinchan:verification-before-completion Block completion claims without fresh verification evidence (auto-invoked by agents)

Quick Triggers

No commands needed -- just say:

Say This Activates
"ulw", "fast", "parallel" Ultrawork (parallel mode)
"until done", "complete it" Ralph (persistence mode)
"autopilot", "auto" Autopilot (autonomous)
"brainstorm", "explore" Brainstorm (problem exploration)
"debate", "pros and cons" Debate system
"analyze", "debug", "why" Deep analysis

How Skills Work

Skills are not just documentation -- they automatically invoke specialist agents.

/team-shinchan:brainstorm  (optional — explore problem space)
       |
       v
/team-shinchan:start
       |
       v
+--------------------------------------+
|  Task(subagent_type="shinnosuke")    |  <- Auto-invoked
+------------------+-------------------+
                   v
+--------------------------------------+
|  Shinnosuke orchestrates:            |
|  +-- Misae (Socratic requirements)   |
|  +-- Nene (planning)                 |
|  +-- Shiro (exploration)             |
|  +-- Bo(PO) → domain agent →         |
|       Action Kamen (2-stage review)  |
|  +-- Masumi (completion docs)        |
+--------------------------------------+
Skill Auto-Invokes
/start Shinnosuke -> Full workflow
/brainstorm Hiroshi -> Problem exploration
/plan Nene -> Structured planning
/analyze Hiroshi -> Deep analysis
/deepsearch Shiro -> Masumi
/debate Midori -> Expert panel
/autopilot Shinnosuke -> Autonomous mode
/ultrawork Shinnosuke -> Parallel execution
/ralph Kazama -> Persistent loop
/implement Bo -> Code execution
/frontend Aichan -> Frontend/UI
/backend Bunta -> Backend/API
/devops Masao -> Infrastructure
/review Action Kamen -> Code review
/requirements Misae -> Requirements analysis
/bigproject Himawari -> Large project orchestration
/ontology Ontology engine -> Knowledge graph
/impact-analysis Ontology engine -> Dependency cascade

You run the skill, agents do the work.


Plugin Inventory

Component Count Location
Agents 15 agents/
Skills 50 skills/
Commands 49 commands/
Hooks 30 entries hooks/
Validators 24 tests/validate/
Rules 4 categories (54 rules) rules/
Src Scripts 16 src/

Quality and Testing

Team-Shinchan is validated by 3 tiers of automated testing:

Tier Tests What It Checks
Static Validators 24 Schema, cross-refs, consistency, API contracts, token budget, layer enforcement, agents-map, ontology integrity, hook execution
Agent Behavior (promptfoo) 29 Individual agent role adherence
E2E Workflow 11 Full workflow scenarios (5 types)
# Run static tests locally (free, no API key)
./run-tests.sh static

# Run all tests (requires ANTHROPIC_API_KEY)
./run-tests.sh all

FAQ

Does this replace Claude Code?

No. Team-Shinchan is a plugin that enhances Claude Code by adding specialized agents, guardrails, and observability. You still use Claude Code as normal, but now with harnessed agents instead of one generalist.

Is it free?

Yes, Team-Shinchan is MIT licensed and free to use. Agent calls consume your Claude Code API credits (same as any other Claude conversation). The Budget Guard helps you control spend.

How many API calls does it make?

Quick fixes: ~3 calls (Bo implements -> Action Kamen reviews -> done)
Standard tasks: 10-30 calls (requirements -> planning -> execution -> completion)
Complex tasks with debates: 20-50 calls (includes expert panel discussions)

Each call is purposeful, traced, and documented in the workflow.

Can I use individual agents without the full workflow?

Yes. Each skill works standalone:

  • /team-shinchan:analyze -> Just Hiroshi (deep analysis)
  • /team-shinchan:plan -> Just Nene (planning)
  • /team-shinchan:deepsearch -> Just Shiro + Masumi (search)

Or let Shinnosuke orchestrate the full workflow with /team-shinchan:start.


Credits

Inspired by and built upon:


License

MIT License


Team-Shinchan -- The Agent Harness for Claude Code

Guardrails, Observability, and Quality Gates for AI-Powered Development

Yorumlar (0)

Sonuc bulunamadi