agentic-os

agent
Guvenlik Denetimi
Basarisiz
Health Uyari
  • License — License: MIT
  • Description — Repository has a description
  • Active repo — Last push 0 days ago
  • Low visibility — Only 5 GitHub stars
Code Basarisiz
  • rm -rf — Recursive force deletion command in .agentcortex/bin/deploy.sh
Permissions Gecti
  • Permissions — No dangerous permissions requested

Bu listing icin henuz AI raporu yok.

SUMMARY

Governance-first OS for AI coding agents — structured workflows, delivery gates, engineering guardrails, and 14 professional skills for Claude Code, Cursor, Copilot, Antigravity & Codex.

README.md

Agentic OS v1.4.1

Agentic OS

A governance-first layer for AI coding agents.
Portable workflows, delivery gates, engineering guardrails, and 14 professional skills
for Claude Code, OpenAI Codex, Google Antigravity, Cursor, GitHub Copilot, and other coding agents.

CI Status Security Scan MIT License

繁體中文 · Contributing · Changelog


The Problem

AI coding agents are powerful, but they need shared operating rules. Without structure, they can:

  • Skip steps — jump straight to code without planning or reviewing
  • Hallucinate completion — claim "done" without verifiable evidence
  • Drift from scope — refactor code nobody asked them to touch
  • Lose context — forget decisions across conversations, forcing re-derivation
  • Break things silently — no safety gates between "idea" and "production"

The Solution

Agentic OS is a portable governance framework that gives AI agents a repeatable engineering workflow. Install it into your project, and your AI agents gain:

flowchart LR
    U["User<br/>says"] --> G{"Gate<br/>Engine"}
    G -->|PASS| W["Workflow<br/>+ Skills"]
    W --> E{"Evidence<br/>Required"}
    E -->|PASS| S["Ship<br/>→ SSoT"]
    G -->|FAIL| X1["⛔ STOP"]
    E -->|FAIL| X2["⛔ STOP"]

    style U fill:#8b5cf6,color:#fff,stroke:none
    style G fill:#f59e0b,color:#fff,stroke:none
    style W fill:#3b82f6,color:#fff,stroke:none
    style E fill:#22c55e,color:#fff,stroke:none
    style S fill:#06b6d4,color:#fff,stroke:none
    style X1 fill:#ef4444,color:#fff,stroke:none
    style X2 fill:#ef4444,color:#fff,stroke:none

The idea behind every phase is the same: if there's no verifiable evidence, the task isn't done — and the gates enforce that, instead of trusting the agent to self-report honestly.


Features

Gate Engine & Phase System

Every task flows through mandatory phases. The AI cannot skip ahead.

flowchart LR
    B["/bootstrap"] --> P["/plan"]
    P --> I["/implement"]
    I --> R["/review"]
    R --> T["/test"]
    T --> S["/ship"]

    style B fill:#8b5cf6,color:#fff,stroke:none
    style P fill:#3b82f6,color:#fff,stroke:none
    style I fill:#22c55e,color:#fff,stroke:none
    style R fill:#f59e0b,color:#fff,stroke:none
    style T fill:#ef4444,color:#fff,stroke:none
    style S fill:#06b6d4,color:#fff,stroke:none
Classification Required Phases
tiny-fix Classify → Execute → Evidence → Done
quick-win Bootstrap → Plan → Implement → Evidence → Ship
feature Bootstrap → Spec → Plan → Implement → Review → Test → Handoff → Ship
hotfix Bootstrap → Research → Plan → Implement → Review → Test → Ship
architecture-change Bootstrap → ADR → Spec → Plan → Implement → Review → Test → Handoff → Ship

Engineering Guardrails

A constitution for AI behavior — loaded automatically, enforced at every phase:

  • No Evidence = No Completion — narrative claims are not proof
  • Scope Discipline — unauthorized refactoring is strictly prohibited
  • Destructive Command Blockingrm -rf, git reset --hard, force pushes require pre-approved rollback plans
  • OWASP Top 10 Auto-Scan — security checks run during /implement and /review
  • Confidence Gate — AI must declare confidence level; low confidence triggers escalation

14 Professional Skills

Skills auto-activate based on task classification and workflow phase:

Skill Trigger Description
Test-Driven Development feature, architecture-change Red → Green → Refactor cycles
Systematic Debugging bug encounter 4-phase root cause analysis
Red Team / Adversarial review, test Classification-based security analysis
API Design API endpoints detected Endpoint validation enforcement
Auth Security auth code detected Hashing, tokens, rate limiting
Database Design migration detected Forward-only ORM-aware migration safety
Frontend Patterns UI components Component and state management patterns
Parallel Agent Dispatching complex tasks Coordinated subagent execution
Subagent-Driven Development multi-module tasks Multi-agent coordination
Karpathy Principles all coding tasks Behavioral guardrails against common LLM coding mistakes
Production Readiness feature, architecture-change Pre-ship observability: error sinks, log strategy, rollback telemetry
Verification Before Completion /ship 5-gate check: Scope → Quality → Evidence → Risk → Communication
Git Worktrees parallel branches Worktree isolation workflows
Doc Lookup documentation needed Documentation retrieval strategy

Single Source of Truth (SSoT)

Every project has one canonical state file. AI agents read it first, write to it last.

.agentcortex/context/
├── current_state.md          # Global project state (SSoT)
└── work/
    └── <branch-name>.md      # Per-task work log (isolated)
  • Work Logs track per-task progress, evidence, and gate receipts
  • SSoT tracks global decisions, lessons, and ship history
  • Handoff enables seamless AI-to-AI continuity across conversations

Multi-Agent Collaboration

Built for teams where multiple AI sessions work on the same codebase:

  • One Branch = One Owner — prevents concurrent Work Log corruption
  • Single-Writer Locking — atomic lock files block concurrent sessions per branch (configurable back to advisory)
  • Ship Guard — checks for SSoT conflicts before merging
  • Session Identity — every AI session writes its model name and timestamp

Three Entry Paths

Pick the one that matches your starting point (full opener templates in Quick Start §2):

Starting point First command Full chain
🆕 Brand-new project, multi-feature idea /spec-intake spec-intake → pick feature → bootstrap → app-init → plan → implement
🏗️ Existing repo, adopting Agentic OS /audit (read-only, zero risk) audit → app-init → spec-intake → quick-win → feature
🎯 Single concrete task on an established project /bootstrap bootstrap → plan → implement → review → test → ship

See the Lifecycle Benchmark (繁體中文) for real token consumption data across 6 development scenarios.

Token Efficiency

Designed for cost-effective models (Gemini Flash, Haiku, etc.):

  • Conditional Loading — tiny-fix skips guardrails (~5,000 tokens saved)
  • Skill Cache Policy — metadata-first loading, full SKILL.md only on cache miss
  • Phase Summary — compact 1-liner per phase for low-token resume
  • Read-Once Discipline — governance docs persist in context, never re-read

Quick Start

1. Install

Prerequisites:

Dependency Required? Purpose
Git Required Clone and deploy the framework
Bash Required Run deploy & validate scripts (Git Bash from Git for Windows is enough on Windows)
Python 3.9+ Recommended Enables full validation (metadata, encoding, command sync checks)
SHA-256 tool Required for deploy sha256sum, shasum, or openssl (pre-installed on most systems)

No Python? The framework deploys and works without Python. Validation runs in
degraded mode — Python-dependent checks report WARN instead of FAIL.
Pass --no-python to suppress warnings: bash .agentcortex/bin/validate.sh --no-python

Install (first time):

# Clone Agentic OS
git clone https://github.com/KbWen/agentic-os.git

# Preview what will be deployed (no changes made)
./agentic-os/installers/deploy_brain.sh --dry-run /path/to/your-project

# Deploy into your project
./agentic-os/installers/deploy_brain.sh /path/to/your-project

Update (after first install): the installer lives inside your project. Run it from your project root — it reads the deploy manifest and auto-fetches the latest framework version from GitHub.

bash installers/deploy_brain.sh .

Existing files won't be overwritten. If your project already has AGENTS.md, CLAUDE.md, or other framework-managed files, they are preserved. The new framework version is saved as <filename>.acx-incoming sidecar. Review and merge manually — or ask your AI agent: "Merge each .acx-incoming into its target, preserving my project-specific content and adopting framework updates."

AI-agent install: If you're asking an AI assistant to install Agentic OS, point it to this README. The commands above are deterministic — no platform-specific heuristics required.

Optional local pre-commit validation: enable the bundled Git hook sample to run Agentic OS validation before each commit.

cp .githooks/pre-commit.guard-ssot.sample .githooks/pre-commit
chmod +x .githooks/pre-commit
git config core.hooksPath .githooks

On Windows, run the setup from PowerShell:

Copy-Item .githooks\pre-commit.guard-ssot.sample .githooks\pre-commit
git config core.hooksPath .githooks

The hook runs validate.ps1 from Git Bash on Windows when PowerShell is available, otherwise it runs validate.sh. Validator failures block the commit; guarded SSoT receipt warnings remain advisory.

Windows (PowerShell / CMD)
# Clone Agentic OS (first time)
git clone https://github.com/KbWen/agentic-os.git

# Preview what will be deployed (no changes made)
powershell -ExecutionPolicy Bypass -File .\agentic-os\installers\deploy_brain.ps1 -DryRun C:\path\to\your-project

# Deploy into your project
powershell -ExecutionPolicy Bypass -File .\agentic-os\installers\deploy_brain.ps1 C:\path\to\your-project

# CMD alternative
.\agentic-os\installers\deploy_brain.cmd C:\path\to\your-project

Use the PowerShell entrypoint when possible. It resolves Git Bash directly and does not require a WSL distro.
Git Bash is still required on Windows for shell-based deploy and validation scripts; the PowerShell entrypoint simply locates and invokes it for you.

# Already installed? Run from your project root to update:
powershell -ExecutionPolicy Bypass -File .\installers\deploy_brain.ps1 .

# Validation after install
powershell -ExecutionPolicy Bypass -File .\.agentcortex\bin\validate.ps1

# Lightweight validation when Python is not installed
powershell -ExecutionPolicy Bypass -File .\.agentcortex\bin\validate.ps1 -NoPython

# (Git Bash equivalent)
bash ./.agentcortex/bin/validate.sh --no-python
Text-only usage (no scripts)

If you only want the governance templates (Markdown files) without running any tooling:

  1. Copy the .agent/, .agents/, and AGENTS.md files into your project
  2. Optionally copy .agentcortex/context/ and .agentcortex/templates/ for state management
  3. No Python, Bash, or other tools are needed — all governance is plain Markdown

1a. Customizing Without Conflicts (Fork or Clone)

However you adopt Agentic OS — fork the repo, or clone + deploy_brain.sh into your project — the same rule keeps upgrades painless: add your own files; never edit framework-owned files in place. Put your customizations where the framework guarantees never to touch them, and they survive both git pull upstream (fork) and the next deploy (clone):

You want to… Put it here Why it survives upgrades
Add project governance (narrow/disable a directive) AGENTS.override.md (project root) or ~/.agentcortex/AGENTS.override.md (personal) Loaded present-only at session start; framework never ships these files. MAY narrow/disable directives but cannot relax delivery gates.
Add your own skills .agents/skills/custom-<name>/SKILL.md (+ .agent/skills/custom-<name> metadata) custom-* is a reserved namespace the framework never ships → zero collision, never overwritten.
Adjust skill activation (pin/exclude) .agentcortex/context/private/user-preferences.yaml Gitignored, personal, loaded by bootstrap.

What NOT to do: editing a framework file in place (AGENTS.md, .agent/rules/*, .agent/workflows/*, a shipped skill body) causes merge conflicts on git pull upstream (fork) and is force-updated on the next deploy (clone). Framework skills are the one tolerant exception — if you edit one, deploy preserves your copy as a visible <file>.acx-incoming sidecar rather than silently overwriting it — but the cleaner pattern is always to copy it to a custom-<name> skill and edit that. Governance, security, and workflow files always force-update so you keep receiving fixes.

2. Start Working — Pick Your Entry Point

The right first command depends on your starting point. Always preface with:

"Read AGENTS.md and follow it. Do not claim completion until /review and /test pass."

Then add one of the following based on your situation:

A. Brand-new project, multi-feature idea/spec-intake first

This is a brand-new project. My initial idea is:
[1–2 paragraphs describing the product and its features]

Please run /spec-intake to decompose this into a feature inventory.
After I pick the first feature, run /bootstrap → /app-init to establish
the tech stack ADR, then /plan and /implement.

B. Existing repository adopting Agentic OS/audit first

This repo already has code; Agentic OS was just deployed.
Run /audit (read-only) to map the codebase, then /app-init to record
the tech stack ADR, then /spec-intake when I add features.

C. Single concrete task on an established project/bootstrap directly

/bootstrap
[describe the single task]

Why this matters: a multi-feature raw idea routed straight to /bootstrap
is classified as a single task, skipping the Feature Inventory and
_product-backlog.md decomposition. The Intent Router auto-detects this in
most cases, but explicit beats inferred.

3. Follow the Flow

The AI will classify your task and follow the required phases automatically:

You: "Add pagination to the user list API"

AI: [/bootstrap] → Classification: feature
    → Required: Bootstrap → Spec → Plan → Implement → Review → Test → Handoff → Ship
    → Loading skills: API Design, Test-Driven Development
    → ⚡ ACX

Commands

Command Purpose
/bootstrap Initialize task, classify scope, create work log
/spec-intake Import and decompose external specs
/spec Define verifiable specifications
/plan Create implementation plan with rollback steps
/implement Execute code changes (gate-protected)
/review Logic, security, and scope audit
/test Verify with minimal necessary tests
/handoff Resumable state summary for next session
/ship Consolidate evidence, update SSoT, merge
/adr Architecture Decision Record
/brainstorm Rapid solution exploration
/research Autonomous research and recommendation
/audit Read-only system mapping
/retro Retrospective analysis
/decide Record key decisions with reasoning
/hotfix Emergency fix escalation

Architecture

your-project/
├── AGENTS.md                    # Global AI governance directives
├── CLAUDE.md                    # Claude Code integration entry
│
├── .agent/                      # Agent Intelligence Layer
│   ├── config.yaml              # Governance constants
│   ├── rules/                   # Engineering & security guardrails
│   ├── skills/                  # Skill metadata (auto-trigger defs)
│   └── workflows/               # Workflow definitions (33 workflows)
│
├── .agents/skills/              # Full skill implementations (14 skills)
│   ├── test-driven-development/
│   ├── systematic-debugging/
│   └── ...
│
├── .agentcortex/                # Runtime & State Layer
│   ├── bin/                     # Deploy & validation scripts
│   ├── context/                 # SSoT + work logs
│   ├── docs/                    # Philosophy, platform guides, examples
│   ├── metadata/                # Skill registry & cache index
│   ├── templates/               # Reusable ADR/spec/README templates
│   └── tools/                   # Runtime tools (Python)
│
├── docs/                        # Project-level documentation
│   ├── specs/                   # Feature specifications
│   ├── adr/                     # Architecture Decision Records
│   └── architecture/            # Domain architecture docs
│
└── installers/                  # Cross-platform deploy wrappers
    ├── deploy_brain.sh          # Bash installer
    ├── deploy_brain.ps1         # PowerShell installer
    └── deploy_brain.cmd         # CMD installer

Platform Compatibility

Platform Status Integration
Claude Code Native support CLAUDE.md entrypoint + Claude platform guide
OpenAI Codex Native support AGENTS.md, Codex platform guide, and CLI delegation workflow
Google Antigravity Native support Intent router + Antigravity runtime guidance
Cursor Compatible Uses AGENTS.md / project-rule style guidance
GitHub Copilot Compatible Uses repository instructions and guardrail docs
Any LLM Agent Compatible Model-agnostic Markdown workflows and evidence rules

Philosophy

Agentic OS is built on 10 non-negotiable principles:

# Principle Meaning
1 AI Drives, Human Assists AI follows phases autonomously; human confirms direction
2 Never Skip Phases Sequential execution, not skip-to-end
3 Constitution Over Task Guardrails override any request
4 No Evidence = No Completion Claims without proof are rejected
5 Correctness First Correct > performant > clever
6 Token Efficiency Minimize cold-start cost, maximize context reuse
7 Cross-Model Compliance Works with any AI model, no vendor lock-in
8 Actionable Documentation Every doc must be directly usable
9 Scope Discipline Only touch what was asked
10 Explainability Every AI decision must be traceable

Documentation

Start with the document that matches what you are trying to do:

Goal Start Here Why
Install or upgrade Agentic OS Quick Start, Migration Guide Deployment commands, update flow, and existing-project guidance
Choose the right AI model Model Selection Guide, Lifecycle Benchmark Practical model tradeoffs and measured token costs
Run disciplined agent workflows Agent Philosophy, Testing Protocol Core rules, evidence expectations, and verification standards
Use a specific platform Codex Platform Guide, Claude Platform Guide Platform-specific loading, handoff, and compatibility notes
Understand governance internals Document Governance, Nonlinear Scenarios State, handoff, recovery, and documentation lifecycle rules
Extend skills safely Skill Ecosystem, Token Governance Skill packaging, trigger policy, and context budget control
Document Description
Agent Philosophy 10 core principles
Model Selection Guide Flash vs. Pro guidance
Testing Protocol Testing standards
Project Examples Node.js & Python examples
Token Governance Token optimization strategies
Token Optimization Quickstart Reduce token costs immediately (繁體中文)
Context Budget Context window management
Migration Guide Upgrade an existing install
Codex Platform Guide Codex Web/App adaptation
Claude Platform Guide Claude Code integration
Nonlinear Scenarios Recovery from interrupted sessions
Lifecycle Benchmark 6 real scenarios with token costs
生命週期基準測試 Token 消耗量測(繁體中文)

Contributing

See CONTRIBUTING.md for guidelines on contributing as a human or AI agent.

License

MIT License. See LICENSE for details.


An open-source governance layer for AI coding agents. Contributions and feedback are welcome.

Yorumlar (0)

Sonuc bulunamadi