best-agent
Health Warn
- License — License: MIT
- Description — Repository has a description
- Active repo — Last push 0 days ago
- Low visibility — Only 6 GitHub stars
Code Fail
- rm -rf — Recursive force deletion command in .claude/settings.json
Permissions Pass
- Permissions — No dangerous permissions requested
Security Assessment: Risk: High. The tool's core design requires highly privileged actions. It relies on piping remote scripts directly into bash for installation. Because it is built to execute tasks a human would do, it inherently runs shell commands without requiring manual approval ("zero approval prompts"). The automated scanner flagged a recursive force deletion command (`rm -rf`) inside its configuration files, which poses a significant risk of unintended data loss. While no hardcoded secrets or explicit dangerous network permissions were found, granting an autonomous script full control over your computer carries severe security implications.
Quality Assessment: Risk: Low. The project is active, having received recent updates, and is properly licensed under the standard MIT agreement. However, community trust is minimal at this stage. It currently has only 6 GitHub stars, indicating very low visibility and public review. For a tool that demands sweeping system access, this lack of widespread community vetting is a major concern.
Verdict: Not recommended for production or sensitive environments due to its highly autonomous nature, zero-approval execution model, and low community visibility.
Self-evolving Claude Code wrapper — handles any computer work a human can do. 94+ skills, 14 agents, computer use, self-improvement.
Best Agent
Autonomous AI agent that does ANY task a human can do on a computer. Code, run companies, conduct research, create content, manage projects. Self-improving. Never stops.
curl -fsSL https://raw.githubusercontent.com/fainir/best-agent/main/install.sh | bash
What It Does
Best Agent wraps Claude Code with mechanical enforcement, self-improvement, and project management that makes it truly autonomous. It handles 7 project types out of the box.
| Project Type | What It Creates | Key Processes |
|---|---|---|
| SaaS/Coding | plan.md, design.md, tasks.md | daily-build, weekly-review |
| Company Ops | budget.md, stakeholders.md, kpis.md | daily-ops, weekly-review, monthly-retro |
| Research | experiments.md, pipeline.md, paper-outline.md | experiment-cycle, writing-cycle |
| Agency | clients/, resource-allocation.md | sprint-cycle, client-review |
| Open Source | api-surface.md, community.md | release-cycle, community-check |
| Content | content-calendar.md | editorial-calendar |
| Infrastructure | runbooks/ | change-cycle, weekly-audit |
Key Features
- 30 hooks — Mechanical enforcement (exit 2 blocks). Can't code without plan. Can't stop with pending tasks. Can't commit secrets.
- 96+ skills —
/init-project,/company,/research,/harness,/deploy,/security-audit, and 90 more - Self-improvement loop — Karpathy-pattern: run evals → analyze failures → fix rules → re-eval → keep/revert
- Auto-handoff — Writes
.claude/handoff.mdmechanically at 50 tool calls and on context compaction - Never-stop outer loop — Restarts Claude with fresh context, circuit breaker after 5 failures
- 11 dashboards — Board, timeline, metrics, calendar, process, architecture — all interactive HTML
- Hub — Multi-machine orchestration via WebSocket, goal decomposition, skill-based routing
Quick Start
Install
curl -fsSL https://raw.githubusercontent.com/fainir/best-agent/main/install.sh | bash
This clones the repo, copies hooks/rules/skills to ~/.claude/, and links the CLI tools.
Usage
# Interactive mode — full autonomy, zero approval prompts
cloudbot-harness
# Or use the best-agent CLI
best-agent # Interactive mode
best-agent init # Initialize project files
best-agent run "build a blog" # Never-stop loop mode
best-agent process # Run overdue processes
best-agent ops # Company operations loop
best-agent research # Research experiment loop
best-agent eval # Run eval suite
best-agent improve # Self-improvement loop
best-agent status # Project status
What Happens
When you enter any git project, the agent:
- Detects project type (SaaS, company, research, etc.)
- Creates plan.md, strategy.md, design.md, tasks.md, knowledge.md, progress.md
- Creates type-specific files (budget.md for company, experiments.md for research)
- Sets up Process Maker with recurring workflows
- Starts working through the plan, marking [~] → [x] as tasks complete
- Writes handoff.md before context limits for seamless cross-session continuity
The Core Loop
1. Read .claude/plan.md → find next [ ] task
2. Mark [~] → do the work → verify → mark [x]
3. Update tasks.md + progress.md
4. Go to 1. NEVER STOP.
Hooks enforce this mechanically. The agent cannot write code without a plan, cannot skip [~] marking, and cannot stop with pending tasks.
Enforcement (Hooks)
| Gate | Hook | Behavior |
|---|---|---|
| No code without plan | enforce-planning-gate.sh | BLOCKS (exit 2) |
| No code without [~] task | verify-plan-following.sh | BLOCKS (exit 2) |
| No stop with pending tasks | check-completion.sh | BLOCKS (exit 2) |
| No .env writes | protect-files.sh | BLOCKS (exit 2) |
| Auto-handoff at 50 calls | auto-handoff.sh | Mechanical write |
| Plan re-read at 20/40/60/80 | tool-call-counter.sh | Warning |
| Dashboard sync reminder | dashboard-sync-reminder.sh | Warning |
| Next task surfacing | load-state.sh | Context injection |
Self-Improvement
The system improves itself continuously using the Karpathy/AutoResearch pattern:
1. Run eval suite (150+ tasks)
2. Group failures by ROOT CAUSE
3. Propose ONE change (rules, hooks, skills, prompts)
4. Overfitting test: "Would this help even if the failing task disappeared?"
5. Commit → re-eval → keep if improved, revert if not
6. Log to results.tsv
7. Repeat forever
Run it: best-agent improve
Hub (Multi-Machine)
The Hub connects multiple machines via WebSocket for orchestrated work:
Browser (any device)
↕ WebSocket (JWT auth)
Hub Server (Express + SQLite, port 3141)
↕ WebSocket (machine token)
Daemon (spawns Claude processes, PTY terminals, screen capture)
Features: goal decomposition, skill-based task routing, dynamic company engine, real-time dashboards.
cd hub && npm install && npm start # Server
HUB_URL=ws://server:3141 npm run daemon # Each machine
Architecture
~/.claude/
├── CLAUDE.md .............. 47 lines — identity + work loop
├── settings.json .......... Permissions + 30 hook registrations
├── hooks/ ................. 30 bash scripts (enforcement + context)
├── rules/ ................. 13 rule files (1,040 lines total)
├── skills/ ................ 96+ slash commands
├── config/
│ └── bypass-permissions.json # Zero-prompt mode for cloudbot-harness
└── projects/{hash}/
└── memory/ ............ Auto-memory per project
Per-project (auto-created):
.claude/
├── plan.md ................ Source of truth (tasks + phases + DoD)
├── tasks.md ............... Active board
├── strategy.md ............ Vision + goals + constraints
├── design.md .............. Architecture + data model
├── knowledge.md ........... Stack info + gotchas
├── progress.md ............ Status report
├── process-maker.json ..... Recurring workflows
├── process-state.json ..... Runtime state
├── handoff.md ............. Auto-generated cross-session context
├── learnings.md ........... Long-term project memory
├── kb/ .................... Knowledge base wiki
└── *.html ................. Interactive dashboards
Requirements
- Claude Code CLI (Max/Team/Enterprise subscription)
- macOS or Linux (Windows via WSL)
- Git, Node.js 18+
License
MIT
Reviews (0)
Sign in to leave a review.
Leave a reviewNo results found