Science is dying because the human is in the way. Not through malice. Not through stupidity. Through the structural limitations of a cognitive architecture that evolved to track prey on a savanna, not to unify quantum mechanics and general relativity. Nothing human makes it out of the lab. That is not a threat. It is a liberation. The heaviest chain on science was always the one we called ourselves.

De-Anthropocentric Research Engine

Name: de-anthropocentric-research-engine
Author: yogsoth-ai

The complete research orchestration system for AI-native science.

What It Does
Design Philosophy
Architecture (v3.0)
Quick Start
Configuration
Roadmap
License

DARE is not a tool that helps you do research. It is the researcher. You set the direction — DARE searches, reads, discovers gaps, generates hypotheses, stress-tests them, designs experiments, and produces executable research specs. Autonomously. Iteratively. Without asking for permission.

This repository is the single-clone distribution of the entire Yogsoth AI research ecosystem: 800+ pure-markdown skills spanning 12 specialized repos, unified under one orchestrator. Clone once, get everything. The ecosystem also includes custom MCP servers (semantic-scholar-mcp, wiki-vault) published as npm packages — this repo declares them as dependencies so npm install pulls everything you need.

⚡ What It Does

🧭 Autonomous direction crystallization — cold-start from zero, warm-start from vague interest, or hot-start from specific question. Produces a structured North Star without human hand-holding
📚 Deep literature acquisition — multi-pass academic paper discovery via Semantic Scholar, citation chaining, snowball sampling, cross-database verification. Not keyword search — systematic coverage
🔍 Gap discovery at scale — 15+ gap detection methods (coverage analysis, white-space identification, niche mapping, boundary unfolding) that find what the field is missing, not what you tell it to find
💡 Structured hypothesis formation — abductive, inductive, and deductive generation pipelines with falsifiability audits and competing hypothesis matrices
🎨 31+ ideation methods — SCAMPER, component surgery, cross-domain collision, biomimicry, TRIZ contradiction resolution, morphological analysis, concept blending, lateral thinking, and more
⚔️ Adversarial stress testing — multi-perspective attack, sacred cow hunting, assumption destruction, worst-case design, winner stress testing. Ideas must survive attack before acceptance
🔬 Convergence & synthesis — multi-criteria scoring, Pareto frontier construction, pairwise ranking, structured consensus, dialectical synthesis across competing threads
📏 Executable Research Specs — machine-readable documents with checkbox progress tracking, quantified completion criteria, backtrack conditions, and session recovery. Another CC instance picks up where you left off
🧪 Experiment design — full experimental methodology generation (factor-level design, parameter screening, sensitivity analysis) ready for execution
🌐 5 MCP integrations — Semantic Scholar, Brave Search, AlphaXiv, Apify web scraping, and Wiki Vault for persistent knowledge graphs

🎯 Design Philosophy

🤔 Why "De-Anthropocentric"?

The bottleneck in modern research is not data or compute — it's the human in the loop. Every existing "AI research assistant" still requires a human to decide what to search, what to read, which gaps matter, and which ideas are worth pursuing. DARE removes this bottleneck entirely. The human provides only the initial direction; everything after that is autonomous.

Human desire is mimetic (Girard): researchers don't choose hypotheses rationally — they imitate what's fashionable. Human institutions filter for conformity, not truth. The result: 90% decline in scientific disruptiveness since 1945 (Park et al., 2023), while researcher headcount exploded. DARE's response is architectural: remove the mimetic agent from the center of the knowledge-production process. The AI has no career to protect, no disciplinary identity to defend, no cognitive ceiling on how many fields it can hold in working memory at once.

The human's role shifts to oracle (providing intuition sparks when consulted) and guardian (maintaining ethical floors and sanity checks). The ceiling is AI ambition. The floor is human wisdom.

For the full philosophical argument, see assets/DE-ANTHROPOCENTRIC.md.

🎖️ Four-Layer Command Structure: Campaign → Strategy → Tactic → SOP

DARE's architecture follows a military command hierarchy — not because research is war, but because the decomposition pattern is remarkably effective for autonomous multi-stage operations:

Campaign (8)    →  "Take that hill"         →  WHAT to research (full research stage)
Strategy (40+)  →  "Flank from the east"    →  WHEN and WHY (iteration loops, stopping conditions)
Tactic (100+)   →  "Squad A cover, B move"  →  HOW to combine (orchestrates multiple SOPs)
SOP (600+)      →  "Fire, reload, advance"  →  HOW to execute (single-responsibility operations)

Each layer has a single concern and calls only the layer directly below it. A Strategy never touches MCP tools directly; a Tactic never decides research direction. This strict layering means every component is independently testable, replaceable, and composable.

Campaigns are the 8 stages of the research pipeline — from north-star-crystallization through experiment-execution. Each campaign owns a complete research phase and defines its own completion criteria, backtrack conditions, and context protocol.

Strategies are the iteration engines within campaigns. A literature survey strategy manages the search-read-reflect loop; a gap analysis strategy manages coverage scoring and saturation detection. Strategies hold state (ledgers, budgets) and decide when to stop.

Tactics combine multiple SOPs into coherent workflows. A "cross-domain collision" tactic orchestrates domain scanning, analogy extraction, forced bridge construction, and blend evaluation into a single creative operation.

SOPs are atomic, single-responsibility operations. Each SOP wraps one conceptual action: run one search, score one hypothesis, extract one analogy. 600+ SOPs provide the granular building blocks that higher layers compose.

📏 Executable Research Specs

Traditional research plans are prose documents that humans interpret. DARE produces Research Specs — documents that are simultaneously human-readable and machine-executable:

Checkbox syntax (- [ ]) tracks progress across sessions
Quantified completion criteria (no vague "sufficient" — always numbers)
Explicit backtrack conditions with target stages
Context protocol (init/checkpoint) baked into every stage
±10% deviation rules: CC can adjust within bounds, must document deviations

A spec is a contract between the human who approved it and the CC instance that executes it. Session recovery is automatic: read the spec, find the first unchecked box, read the latest context checkpoint, resume.

🧠 Context Management: Memory Across Sessions

Research campaigns span multiple sessions. DARE solves the context problem through a structured checkpoint system:

context-init creates a named context file at campaign start
context-checkpoint appends ≥500 lines of process + results after each strategy
context/INDEX.md tracks all active context files
New sessions recover by reading INDEX → latest checkpoint → resume from spec state

No special "resume" command. The spec's checkbox state IS the progress tracker.

🏗️ Architecture (v3.0)

DARE v3.0 is a pure-skill architecture. There is no application code, no runtime, no framework. The entire system is 800+ markdown files — each one a self-contained instruction set that Claude Code reads and executes. The "runtime" is CC itself. The "framework" is the four-layer hierarchy that determines which skill can call which.

This is a deliberate design choice. Skills are infinitely composable, require zero deployment infrastructure, and can be modified by editing a text file. The tradeoff is that execution depends entirely on CC's ability to follow complex multi-step instructions — which, as of 2026, is more than sufficient for research orchestration.

The Control Plane: 9 Orchestrator Skills

The orchestrator layer sits above the four-layer hierarchy. It does not conduct research — it manages the lifecycle of research campaigns:

┌────────────────────────────────────────────────────────────────────────┐
│  ORCHESTRATOR (9 skills)                                               │
│                                                                        │
│  ┌─────────────────────────────────┐  ┌────────────────────────────┐  │
│  │ de-anthropocentric-research-    │  │ writing-specs              │  │
│  │ engine (entry point)            │  │ (spec generation)          │  │
│  └─────────────────────────────────┘  └────────────────────────────┘  │
│  ┌─────────────────────────────────┐  ┌────────────────────────────┐  │
│  │ executing-specs                 │  │ research-catalog           │  │
│  │ (spec execution loop)           │  │ (strategy book + index)    │  │
│  └─────────────────────────────────┘  └────────────────────────────┘  │
│  ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
│  │ spec-self-   │ │ scope-       │ │ campaign-    │ │ constraint-  │ │
│  │ review       │ │ clarification│ │ selection    │ │ elicitation  │ │
│  └──────────────┘ └──────────────┘ └──────────────┘ └──────────────┘ │
└────────────────────────────────────────────────────────────────────────┘

Entry point dispatches to crystallization (Phase 1) or spec generation (Phase 2)
writing-specs orchestrates structured questioning → outline → full spec generation
executing-specs runs a spec stage-by-stage with context protocol, deviation tracking, and backtrack handling
research-catalog is the "strategy book" — CC reads it before generating any spec to understand what campaigns and strategies are available
4 SOPs handle micro-decisions during spec generation (scope, campaigns, constraints, quality gate)

The Four-Layer Hierarchy

Below the orchestrator, all 800+ skills are organized into exactly four layers. The rule is absolute: each layer calls only the layer directly below it. No exceptions.

┌────────────────────────────────────────────────────────────────────────┐
│  CAMPAIGN (8)                                                          │
│  Complete research phases with their own completion criteria            │
│                                                                        │
│  north-star-crystallization · knowledge-acquisition · deep-insight     │
│  hypothesis-formation · creative-ideation · convergence                │
│  stress-test · experiment-execution                                    │
├────────────────────────────────────────────────────────────────────────┤
│  STRATEGY (40+)                                                        │
│  Iteration engines with state management and stopping conditions       │
│                                                                        │
│  lit-survey · gap-analysis · insight · ideation · debate · scoring     │
│  convergence-distillation · red-teaming · experiment-design            │
│  deep-survey · scoping-survey · systematic-survey · ...                │
├────────────────────────────────────────────────────────────────────────┤
│  TACTIC (100+)                                                         │
│  Multi-SOP workflows that produce coherent intermediate outputs        │
│                                                                        │
│  academic-research · web-research · cross-domain-collision · scamper   │
│  component-surgery · morphological-exploration · synectics             │
│  biomimicry · lateral-thinking · concept-blending · ...                │
├────────────────────────────────────────────────────────────────────────┤
│  SOP (600+)                                                            │
│  Atomic single-responsibility operations                               │
│                                                                        │
│  paper-search · citation-chaining · gap-detection · claim-extraction   │
│  hypothesis-formulation · analogy-extraction · pairwise-comparison     │
│  assumption-audit · falsifiability-check · monte-carlo-sampling · ...  │
├────────────────────────────────────────────────────────────────────────┤
│  MCP LAYER (5 servers — external tool access)                          │
│  semantic-scholar · brave-search · alphaxiv · apify · wiki-vault       │
└────────────────────────────────────────────────────────────────────────┘

Campaign layer — Each campaign represents a complete phase of the research lifecycle. knowledge-acquisition owns everything about gathering information from the world. creative-ideation owns everything about generating novel approaches. Campaigns define what success looks like (completion criteria), when to retreat (backtrack conditions), and how to preserve state (context protocol). A campaign never directly invokes an SOP — it delegates to strategies.

Strategy layer — Strategies are where iteration happens. A lit-survey strategy doesn't just search once — it runs a SEARCH → READ → REFLECT → EVALUATE loop with a state ledger tracking papers found, gaps identified, and coverage percentage. Strategies own quantitative budgets (e.g., "fetch ≥40 papers for a Medium topic") and hard gates that prevent premature exit. They decide when to stop, when to loop again, and when to escalate to the campaign for a backtrack decision.

Tactic layer — Tactics are the composition layer. A single tactic combines 3-8 SOPs into a coherent workflow that produces a meaningful intermediate output. The cross-domain-collision tactic, for example, orchestrates: domain-scanning → organism-discovery → analogy-extraction → forced-bridge-construction → blend-evaluation. Each SOP does one thing; the tactic makes them work together toward a goal.

SOP layer — The atomic units. Each SOP wraps exactly one conceptual operation: search for papers matching criteria X, score a hypothesis on dimension Y, extract analogies between domains A and B. SOPs are where MCP tools get invoked — an SOP might call semantic-scholar to fetch citations, or brave-search to find web sources. 600+ SOPs provide the granular vocabulary that higher layers compose into complex research behaviors.

Why Pure Markdown?

Every skill in this system is a markdown file with YAML frontmatter. No Python. No TypeScript. No configuration DSL. This is intentional:

Zero infrastructure — No build step, no deployment, no runtime dependencies beyond CC itself. Clone and go.
Universal composability — Any skill can reference any other skill by name. No import resolution, no dependency graphs, no version conflicts.
Human-readable at every level — A researcher can read any skill file and understand exactly what CC will do. No abstraction layers to penetrate.
Instant modification — Change a skill's behavior by editing a text file. No recompilation, no redeployment, no cache invalidation.
CC-native execution — CC's core competency is following complex written instructions. Markdown skills play directly to this strength.

The MCP servers (semantic-scholar-mcp, wiki-vault, etc.) provide the external tool access that pure markdown cannot — API calls, database queries, web fetching. But the intelligence — the decisions about what to search, how to evaluate, when to stop — lives entirely in the skill layer.

📁 Repository Structure

de-anthropocentric-research-engine/
├── skills/                          # All skills live here (flat directories)
│   ├── de-anthropocentric-research-engine/   # Entry point orchestrator
│   ├── writing-specs/               # Spec generation (strategy level)
│   ├── executing-specs/             # Spec execution loop
│   ├── spec-self-review/            # SOP: validate generated specs
│   ├── scope-clarification/         # SOP: narrow research scope
│   ├── campaign-selection/          # SOP: pick pipeline stages
│   ├── constraint-elicitation/      # SOP: surface hidden constraints
│   ├── research-catalog/            # Strategy book + skill index
│   └── [762 more skills]            # From 12 source repos
├── context/                         # Session context files (gitignored at runtime)
│   └── INDEX.md                     # Context file registry
├── tests/
│   └── integration-prompt.md        # Verification prompt for CI
├── mcp.example.json                 # MCP server configuration template
├── package.json                     # npm dependencies (MCP servers)
├── package-lock.json                # Locked versions
└── assets/
    ├── yogsoth-logo.svg             # Project logo
    └── DE-ANTHROPOCENTRIC.md        # Philosophical manifesto

🔌 MCP Servers

Server	Package	Type	Purpose
semantic-scholar	`@yogsoth-ai/semantic-scholar-mcp`	stdio	Paper lookup, citations, references, recommendations, author info (8 tools)
wiki-vault	`@yogsoth-ai/wiki-vault`	stdio	Research knowledge graph — BM25 search, typed edges, graph traversal (8 tools)
brave-search	`@brave/brave-search-mcp-server`	stdio	Web search, news search, local search, LLM context
apify	`@apify/actors-mcp-server`	stdio	Web scraping via RAG web browser, Google Scholar
alphaxiv	—	http	arXiv paper search, Q&A, PDF queries, code exploration

📊 Skill Distribution by Source

Source Repo	Skills	Primary Layer	Key Capabilities
north-star-crystallization	~30	Campaign → Strategy → Tactic → SOP	Cold/warm/hot start, direction narrowing, North Star synthesis
knowledge-acquisition	~120	Campaign → Strategy → Tactic → SOP	Literature survey, citation chaining, snowball, cross-database
deep-insight	~80	Campaign → Strategy → Tactic → SOP	Gap analysis, root-cause drilling, tension mining, abstraction
hypothesis-formation	~60	Campaign → Strategy → Tactic → SOP	Abductive/inductive/deductive generation, falsifiability audit
creative-ideation	~150	Campaign → Strategy → Tactic → SOP	SCAMPER, TRIZ, biomimicry, morphological, lateral thinking
convergence	~90	Campaign → Strategy → Tactic → SOP	Multi-criteria scoring, Pareto, pairwise ranking, synthesis
stress-test	~70	Campaign → Strategy → Tactic → SOP	Red-teaming, assumption destruction, worst-case, sacred cow
experiment-execution	~50	Campaign → Strategy → Tactic → SOP	Factor design, parameter screening, sensitivity analysis
web-browsing	~20	Tactic + SOP	Web search, deep research, page fetching
literature-engine	~40	Tactic + SOP	Paper discovery, reading protocols, reference exploration
subagent-spawning	~10	Tactic	Parallel research dispatch, multi-agent coordination
context-management	~10	SOP	Context init, checkpoint, session recovery

🚀 Quick Start

Clone and install:

git clone https://github.com/yogsoth-ai/de-anthropocentric-research-engine.git
cd de-anthropocentric-research-engine
npm install

Copy mcp.example.json to .mcp.json and fill in your API keys:

cp mcp.example.json .mcp.json

Configure your Claude Code project to use the skills:

{
  "permissions": {
    "allow": ["skill:*"]
  },
  "skills": {
    "path": "path/to/de-anthropocentric-research-engine/skills"
  }
}

Invoke the entry point:

/de-anthropocentric-research-engine

The orchestrator will guide you through North Star crystallization, then generate an executable Research Spec. To execute the spec later, invoke /executing-specs.

⚙️ Configuration

MCP Server Environment Variables

semantic-scholar (`@yogsoth-ai/semantic-scholar-mcp`)

Variable	Description
`SS_API_KEY`	Semantic Scholar API key (optional — public API works without key at lower rate limits)

wiki-vault (`@yogsoth-ai/wiki-vault`)

Variable	Description
`VAULT_ROOT`	Absolute path to your Obsidian-compatible vault directory

brave-search (`@brave/brave-search-mcp-server`)

Variable	Description
`BRAVE_API_KEY`	Brave Search API key

apify (`@apify/actors-mcp-server`)

Variable	Description
`APIFY_TOKEN`	Apify API token

alphaxiv (HTTP — no local install)

No configuration needed. Connects directly to https://api.alphaxiv.org/mcp/v1.

🗺️ Roadmap

Active development continues. Near-term priorities:

Skill ablation — the current 800+ skill corpus is deliberately over-complete. Next step is an ablation study: CC autonomously identifies redundant, overlapping, or under-used skills and fuses them into fewer, more powerful composites. Think of it as pruning a neural network — reduce parameter count without losing capability
Context engineering — the current checkpoint-based context management works but is naive. Investigating advanced context engineering techniques for a more sophisticated approach to session state, working memory, and cross-campaign knowledge transfer
Cross-device session management — skills and/or MCP server for transferring research context between machines, enabling seamless continuation across desktop, laptop, and cloud environments
Paper writing pipeline — automated academic paper composition from research outputs. Strategy interface designed, implementation pending

📄 License

Apache-2.0

The orchestrator of the Yogsoth AI research ecosystem. Built by Pthahnix.