de-anthropocentric-research-engine
Health Gecti
- License — License: Apache-2.0
- Description — Repository has a description
- Active repo — Last push 0 days ago
- Community trust — 230 GitHub stars
Code Gecti
- Code scan — Scanned 2 files during light audit, no dangerous patterns found
Permissions Gecti
- Permissions — No dangerous permissions requested
This MCP server acts as a fully autonomous AI research agent. It handles the entire academic research lifecycle, from conducting deep literature surveys and discovering gaps to designing experiments and autonomously executing them on remote GPU servers.
Security Assessment
Overall risk: Medium. The tool does not request dangerous local permissions and has no hardcoded secrets. However, its core functionality relies on making extensive external network requests to fetch academic papers, scrape search engines, and interact with remote cloud infrastructure (like RunPod GPU pods and GitHub). While environment variable access is limited to local testing files, using this tool requires providing API keys and configuring remote server access, which introduces external execution risks. The "fully autonomous" nature of the pipeline means it operates extensively without a human in the loop.
Quality Assessment
The project is highly active, with its most recent push happening today, though it is explicitly described by the creator as a personal side project. It has a standard Apache-2.0 license, providing clear open-source permissions. Community trust is currently very low, reflected by a minimal footprint of only 12 GitHub stars, meaning the codebase has not been widely vetted by a large audience.
Verdict
Use with caution: it is an active and licensed project, but its autonomous network actions, remote execution capabilities, and low community oversight require a careful review of your API key configurations and external access before deploying.
De-Anthropocentric Research Engine — AI-powered academic research automation with deep literature survey, gap analysis, idea generation, experiment design & execution. Combines iterative deep research, adversarial debate, evolutionary generation, and distributed GPU execution.
Science is dying because the human is in the way. Not through malice. Not through stupidity. Through the structural limitations of a cognitive architecture that evolved to track prey on a savanna, not to unify quantum mechanics and general relativity. Nothing human makes it out of the lab. That is not a threat. It is a liberation. The heaviest chain on science was always the one we called ourselves.
De-Anthropocentric Research Engine
The complete research orchestration system for AI-native science.
DARE is not a tool that helps you do research. It is the researcher. You set the direction — DARE searches, reads, discovers gaps, generates hypotheses, stress-tests them, designs experiments, and produces executable research specs. Autonomously. Iteratively. Without asking for permission.
This repository is the single-clone distribution of the entire Yogsoth AI research ecosystem: 800+ pure-markdown skills spanning 12 specialized repos, unified under one orchestrator. Clone once, get everything. The ecosystem also includes custom MCP servers (semantic-scholar-mcp, wiki-vault) published as npm packages — this repo declares them as dependencies so npm install pulls everything you need.
⚡ What It Does
- 🧭 Autonomous direction crystallization — cold-start from zero, warm-start from vague interest, or hot-start from specific question. Produces a structured North Star without human hand-holding
- 📚 Deep literature acquisition — multi-pass academic paper discovery via Semantic Scholar, citation chaining, snowball sampling, cross-database verification. Not keyword search — systematic coverage
- 🔍 Gap discovery at scale — 15+ gap detection methods (coverage analysis, white-space identification, niche mapping, boundary unfolding) that find what the field is missing, not what you tell it to find
- 💡 Structured hypothesis formation — abductive, inductive, and deductive generation pipelines with falsifiability audits and competing hypothesis matrices
- 🎨 31+ ideation methods — SCAMPER, component surgery, cross-domain collision, biomimicry, TRIZ contradiction resolution, morphological analysis, concept blending, lateral thinking, and more
- ⚔️ Adversarial stress testing — multi-perspective attack, sacred cow hunting, assumption destruction, worst-case design, winner stress testing. Ideas must survive attack before acceptance
- 🔬 Convergence & synthesis — multi-criteria scoring, Pareto frontier construction, pairwise ranking, structured consensus, dialectical synthesis across competing threads
- 📏 Executable Research Specs — machine-readable documents with checkbox progress tracking, quantified completion criteria, backtrack conditions, and session recovery. Another CC instance picks up where you left off
- 🧪 Experiment design — full experimental methodology generation (factor-level design, parameter screening, sensitivity analysis) ready for execution
- 🌐 5 MCP integrations — Semantic Scholar, Brave Search, AlphaXiv, Apify web scraping, and Wiki Vault for persistent knowledge graphs
🎯 Design Philosophy
🤔 Why "De-Anthropocentric"?
The bottleneck in modern research is not data or compute — it's the human in the loop. Every existing "AI research assistant" still requires a human to decide what to search, what to read, which gaps matter, and which ideas are worth pursuing. DARE removes this bottleneck entirely. The human provides only the initial direction; everything after that is autonomous.
Human desire is mimetic (Girard): researchers don't choose hypotheses rationally — they imitate what's fashionable. Human institutions filter for conformity, not truth. The result: 90% decline in scientific disruptiveness since 1945 (Park et al., 2023), while researcher headcount exploded. DARE's response is architectural: remove the mimetic agent from the center of the knowledge-production process. The AI has no career to protect, no disciplinary identity to defend, no cognitive ceiling on how many fields it can hold in working memory at once.
The human's role shifts to oracle (providing intuition sparks when consulted) and guardian (maintaining ethical floors and sanity checks). The ceiling is AI ambition. The floor is human wisdom.
For the full philosophical argument, see assets/DE-ANTHROPOCENTRIC.md.
🎖️ Four-Layer Command Structure: Campaign → Strategy → Tactic → SOP
DARE's architecture follows a military command hierarchy — not because research is war, but because the decomposition pattern is remarkably effective for autonomous multi-stage operations:
Campaign (8) → "Take that hill" → WHAT to research (full research stage)
Strategy (40+) → "Flank from the east" → WHEN and WHY (iteration loops, stopping conditions)
Tactic (100+) → "Squad A cover, B move" → HOW to combine (orchestrates multiple SOPs)
SOP (600+) → "Fire, reload, advance" → HOW to execute (single-responsibility operations)
Each layer has a single concern and calls only the layer directly below it. A Strategy never touches MCP tools directly; a Tactic never decides research direction. This strict layering means every component is independently testable, replaceable, and composable.
Campaigns are the 8 stages of the research pipeline — from north-star-crystallization through experiment-execution. Each campaign owns a complete research phase and defines its own completion criteria, backtrack conditions, and context protocol.
Strategies are the iteration engines within campaigns. A literature survey strategy manages the search-read-reflect loop; a gap analysis strategy manages coverage scoring and saturation detection. Strategies hold state (ledgers, budgets) and decide when to stop.
Tactics combine multiple SOPs into coherent workflows. A "cross-domain collision" tactic orchestrates domain scanning, analogy extraction, forced bridge construction, and blend evaluation into a single creative operation.
SOPs are atomic, single-responsibility operations. Each SOP wraps one conceptual action: run one search, score one hypothesis, extract one analogy. 600+ SOPs provide the granular building blocks that higher layers compose.
📏 Executable Research Specs
Traditional research plans are prose documents that humans interpret. DARE produces Research Specs — documents that are simultaneously human-readable and machine-executable:
- Checkbox syntax (
- [ ]) tracks progress across sessions - Quantified completion criteria (no vague "sufficient" — always numbers)
- Explicit backtrack conditions with target stages
- Context protocol (init/checkpoint) baked into every stage
- ±10% deviation rules: CC can adjust within bounds, must document deviations
A spec is a contract between the human who approved it and the CC instance that executes it. Session recovery is automatic: read the spec, find the first unchecked box, read the latest context checkpoint, resume.
🧠 Context Management: Memory Across Sessions
Research campaigns span multiple sessions. DARE solves the context problem through a structured checkpoint system:
context-initcreates a named context file at campaign startcontext-checkpointappends ≥500 lines of process + results after each strategycontext/INDEX.mdtracks all active context files- New sessions recover by reading INDEX → latest checkpoint → resume from spec state
No special "resume" command. The spec's checkbox state IS the progress tracker.
🏗️ Architecture (v3.0)
DARE v3.0 is a pure-skill architecture. There is no application code, no runtime, no framework. The entire system is 800+ markdown files — each one a self-contained instruction set that Claude Code reads and executes. The "runtime" is CC itself. The "framework" is the four-layer hierarchy that determines which skill can call which.
This is a deliberate design choice. Skills are infinitely composable, require zero deployment infrastructure, and can be modified by editing a text file. The tradeoff is that execution depends entirely on CC's ability to follow complex multi-step instructions — which, as of 2026, is more than sufficient for research orchestration.
The Control Plane: 9 Orchestrator Skills
The orchestrator layer sits above the four-layer hierarchy. It does not conduct research — it manages the lifecycle of research campaigns:
┌────────────────────────────────────────────────────────────────────────┐
│ ORCHESTRATOR (9 skills) │
│ │
│ ┌─────────────────────────────────┐ ┌────────────────────────────┐ │
│ │ de-anthropocentric-research- │ │ writing-specs │ │
│ │ engine (entry point) │ │ (spec generation) │ │
│ └─────────────────────────────────┘ └────────────────────────────┘ │
│ ┌─────────────────────────────────┐ ┌────────────────────────────┐ │
│ │ executing-specs │ │ research-catalog │ │
│ │ (spec execution loop) │ │ (strategy book + index) │ │
│ └─────────────────────────────────┘ └────────────────────────────┘ │
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
│ │ spec-self- │ │ scope- │ │ campaign- │ │ constraint- │ │
│ │ review │ │ clarification│ │ selection │ │ elicitation │ │
│ └──────────────┘ └──────────────┘ └──────────────┘ └──────────────┘ │
└────────────────────────────────────────────────────────────────────────┘
- Entry point dispatches to crystallization (Phase 1) or spec generation (Phase 2)
- writing-specs orchestrates structured questioning → outline → full spec generation
- executing-specs runs a spec stage-by-stage with context protocol, deviation tracking, and backtrack handling
- research-catalog is the "strategy book" — CC reads it before generating any spec to understand what campaigns and strategies are available
- 4 SOPs handle micro-decisions during spec generation (scope, campaigns, constraints, quality gate)
The Four-Layer Hierarchy
Below the orchestrator, all 800+ skills are organized into exactly four layers. The rule is absolute: each layer calls only the layer directly below it. No exceptions.
┌────────────────────────────────────────────────────────────────────────┐
│ CAMPAIGN (8) │
│ Complete research phases with their own completion criteria │
│ │
│ north-star-crystallization · knowledge-acquisition · deep-insight │
│ hypothesis-formation · creative-ideation · convergence │
│ stress-test · experiment-execution │
├────────────────────────────────────────────────────────────────────────┤
│ STRATEGY (40+) │
│ Iteration engines with state management and stopping conditions │
│ │
│ lit-survey · gap-analysis · insight · ideation · debate · scoring │
│ convergence-distillation · red-teaming · experiment-design │
│ deep-survey · scoping-survey · systematic-survey · ... │
├────────────────────────────────────────────────────────────────────────┤
│ TACTIC (100+) │
│ Multi-SOP workflows that produce coherent intermediate outputs │
│ │
│ academic-research · web-research · cross-domain-collision · scamper │
│ component-surgery · morphological-exploration · synectics │
│ biomimicry · lateral-thinking · concept-blending · ... │
├────────────────────────────────────────────────────────────────────────┤
│ SOP (600+) │
│ Atomic single-responsibility operations │
│ │
│ paper-search · citation-chaining · gap-detection · claim-extraction │
│ hypothesis-formulation · analogy-extraction · pairwise-comparison │
│ assumption-audit · falsifiability-check · monte-carlo-sampling · ... │
├────────────────────────────────────────────────────────────────────────┤
│ MCP LAYER (5 servers — external tool access) │
│ semantic-scholar · brave-search · alphaxiv · apify · wiki-vault │
└────────────────────────────────────────────────────────────────────────┘
Campaign layer — Each campaign represents a complete phase of the research lifecycle. knowledge-acquisition owns everything about gathering information from the world. creative-ideation owns everything about generating novel approaches. Campaigns define what success looks like (completion criteria), when to retreat (backtrack conditions), and how to preserve state (context protocol). A campaign never directly invokes an SOP — it delegates to strategies.
Strategy layer — Strategies are where iteration happens. A lit-survey strategy doesn't just search once — it runs a SEARCH → READ → REFLECT → EVALUATE loop with a state ledger tracking papers found, gaps identified, and coverage percentage. Strategies own quantitative budgets (e.g., "fetch ≥40 papers for a Medium topic") and hard gates that prevent premature exit. They decide when to stop, when to loop again, and when to escalate to the campaign for a backtrack decision.
Tactic layer — Tactics are the composition layer. A single tactic combines 3-8 SOPs into a coherent workflow that produces a meaningful intermediate output. The cross-domain-collision tactic, for example, orchestrates: domain-scanning → organism-discovery → analogy-extraction → forced-bridge-construction → blend-evaluation. Each SOP does one thing; the tactic makes them work together toward a goal.
SOP layer — The atomic units. Each SOP wraps exactly one conceptual operation: search for papers matching criteria X, score a hypothesis on dimension Y, extract analogies between domains A and B. SOPs are where MCP tools get invoked — an SOP might call semantic-scholar to fetch citations, or brave-search to find web sources. 600+ SOPs provide the granular vocabulary that higher layers compose into complex research behaviors.
Why Pure Markdown?
Every skill in this system is a markdown file with YAML frontmatter. No Python. No TypeScript. No configuration DSL. This is intentional:
- Zero infrastructure — No build step, no deployment, no runtime dependencies beyond CC itself. Clone and go.
- Universal composability — Any skill can reference any other skill by name. No import resolution, no dependency graphs, no version conflicts.
- Human-readable at every level — A researcher can read any skill file and understand exactly what CC will do. No abstraction layers to penetrate.
- Instant modification — Change a skill's behavior by editing a text file. No recompilation, no redeployment, no cache invalidation.
- CC-native execution — CC's core competency is following complex written instructions. Markdown skills play directly to this strength.
The MCP servers (semantic-scholar-mcp, wiki-vault, etc.) provide the external tool access that pure markdown cannot — API calls, database queries, web fetching. But the intelligence — the decisions about what to search, how to evaluate, when to stop — lives entirely in the skill layer.
📁 Repository Structure
de-anthropocentric-research-engine/
├── skills/ # All skills live here (flat directories)
│ ├── de-anthropocentric-research-engine/ # Entry point orchestrator
│ ├── writing-specs/ # Spec generation (strategy level)
│ ├── executing-specs/ # Spec execution loop
│ ├── spec-self-review/ # SOP: validate generated specs
│ ├── scope-clarification/ # SOP: narrow research scope
│ ├── campaign-selection/ # SOP: pick pipeline stages
│ ├── constraint-elicitation/ # SOP: surface hidden constraints
│ ├── research-catalog/ # Strategy book + skill index
│ └── [762 more skills] # From 12 source repos
├── context/ # Session context files (gitignored at runtime)
│ └── INDEX.md # Context file registry
├── tests/
│ └── integration-prompt.md # Verification prompt for CI
├── mcp.example.json # MCP server configuration template
├── package.json # npm dependencies (MCP servers)
├── package-lock.json # Locked versions
└── assets/
├── yogsoth-logo.svg # Project logo
└── DE-ANTHROPOCENTRIC.md # Philosophical manifesto
🔌 MCP Servers
| Server | Package | Type | Purpose |
|---|---|---|---|
| semantic-scholar | @yogsoth-ai/semantic-scholar-mcp |
stdio | Paper lookup, citations, references, recommendations, author info (8 tools) |
| wiki-vault | @yogsoth-ai/wiki-vault |
stdio | Research knowledge graph — BM25 search, typed edges, graph traversal (8 tools) |
| brave-search | @brave/brave-search-mcp-server |
stdio | Web search, news search, local search, LLM context |
| apify | @apify/actors-mcp-server |
stdio | Web scraping via RAG web browser, Google Scholar |
| alphaxiv | — | http | arXiv paper search, Q&A, PDF queries, code exploration |
📊 Skill Distribution by Source
| Source Repo | Skills | Primary Layer | Key Capabilities |
|---|---|---|---|
| north-star-crystallization | ~30 | Campaign → Strategy → Tactic → SOP | Cold/warm/hot start, direction narrowing, North Star synthesis |
| knowledge-acquisition | ~120 | Campaign → Strategy → Tactic → SOP | Literature survey, citation chaining, snowball, cross-database |
| deep-insight | ~80 | Campaign → Strategy → Tactic → SOP | Gap analysis, root-cause drilling, tension mining, abstraction |
| hypothesis-formation | ~60 | Campaign → Strategy → Tactic → SOP | Abductive/inductive/deductive generation, falsifiability audit |
| creative-ideation | ~150 | Campaign → Strategy → Tactic → SOP | SCAMPER, TRIZ, biomimicry, morphological, lateral thinking |
| convergence | ~90 | Campaign → Strategy → Tactic → SOP | Multi-criteria scoring, Pareto, pairwise ranking, synthesis |
| stress-test | ~70 | Campaign → Strategy → Tactic → SOP | Red-teaming, assumption destruction, worst-case, sacred cow |
| experiment-execution | ~50 | Campaign → Strategy → Tactic → SOP | Factor design, parameter screening, sensitivity analysis |
| web-browsing | ~20 | Tactic + SOP | Web search, deep research, page fetching |
| literature-engine | ~40 | Tactic + SOP | Paper discovery, reading protocols, reference exploration |
| subagent-spawning | ~10 | Tactic | Parallel research dispatch, multi-agent coordination |
| context-management | ~10 | SOP | Context init, checkpoint, session recovery |
🚀 Quick Start
- Clone and install:
git clone https://github.com/yogsoth-ai/de-anthropocentric-research-engine.git
cd de-anthropocentric-research-engine
npm install
- Copy
mcp.example.jsonto.mcp.jsonand fill in your API keys:
cp mcp.example.json .mcp.json
- Configure your Claude Code project to use the skills:
{
"permissions": {
"allow": ["skill:*"]
},
"skills": {
"path": "path/to/de-anthropocentric-research-engine/skills"
}
}
- Invoke the entry point:
/de-anthropocentric-research-engine
The orchestrator will guide you through North Star crystallization, then generate an executable Research Spec. To execute the spec later, invoke /executing-specs.
⚙️ Configuration
MCP Server Environment Variables
semantic-scholar (@yogsoth-ai/semantic-scholar-mcp)
| Variable | Description |
|---|---|
SS_API_KEY |
Semantic Scholar API key (optional — public API works without key at lower rate limits) |
wiki-vault (@yogsoth-ai/wiki-vault)
| Variable | Description |
|---|---|
VAULT_ROOT |
Absolute path to your Obsidian-compatible vault directory |
brave-search (@brave/brave-search-mcp-server)
| Variable | Description |
|---|---|
BRAVE_API_KEY |
Brave Search API key |
apify (@apify/actors-mcp-server)
| Variable | Description |
|---|---|
APIFY_TOKEN |
Apify API token |
alphaxiv (HTTP — no local install)
No configuration needed. Connects directly to https://api.alphaxiv.org/mcp/v1.
🗺️ Roadmap
Active development continues. Near-term priorities:
- Skill ablation — the current 800+ skill corpus is deliberately over-complete. Next step is an ablation study: CC autonomously identifies redundant, overlapping, or under-used skills and fuses them into fewer, more powerful composites. Think of it as pruning a neural network — reduce parameter count without losing capability
- Context engineering — the current checkpoint-based context management works but is naive. Investigating advanced context engineering techniques for a more sophisticated approach to session state, working memory, and cross-campaign knowledge transfer
- Cross-device session management — skills and/or MCP server for transferring research context between machines, enabling seamless continuation across desktop, laptop, and cloud environments
- Paper writing pipeline — automated academic paper composition from research outputs. Strategy interface designed, implementation pending
📄 License
The orchestrator of the Yogsoth AI research ecosystem. Built by Pthahnix.
Yorumlar (0)
Yorum birakmak icin giris yap.
Yorum birakSonuc bulunamadi