EvoSkills

agent
SUMMARY

๐Ÿงฌ Extend EvoScientist with Installable Skill & Knowledge Packs

README.md

๐Ÿงฌ EvoSkills

The official skill repository for EvoScientist. Each skill is an installable knowledge pack that extends EvoScientist with domain-specific expertise.

๐Ÿ“ฆ Installation

[!IMPORTANT]
These skills are purpose-built for EvoScientist โ€” together they amplify each other, unlocking the full potential of both the agent and the skills. Under EvoScientist, skills evolve across research cycles through persistent memory (evo-memory).

In-session commands

Install all skills at once:

/install-skill EvoScientist/EvoSkills@skills

Or install a single skill:

/install-skill EvoScientist/EvoSkills@skills/paper-planning

Ask EvoScientist directly

Simply ask the agent in conversation:

"Install all skills from EvoScientist/EvoSkills@skills."

[!TIP]
Not using EvoScientist? These skills are compatible with any coding agent.
One command via skills.sh to install on Claude Code, OpenCode, Cursor, Codex, Gemini CLI, DeepAgents, and more:

npx skills add EvoScientist/EvoSkills

โœจ Available Skills

Skill Description
research-ideation ๐Ÿ’ก Research ideation, literature tree & problem finding
idea-tournament ๐Ÿ† Competitive idea ranking & proposal generation
paper-planning ๐Ÿ“ Research paper planning & outline generation
experiment-pipeline ๐Ÿงช Structured 4-stage experiment execution
experiment-craft ๐Ÿ”ง Experiment debugging, logging & iteration
experiment-iterative-coder ๐Ÿ”„ Iterative code refinement (plan โ†’ code โ†’ evaluate โ†’ refine)
paper-writing โœ๏ธ End-to-end paper writing assistance
paper-review ๐Ÿ” Automated paper review & feedback
paper-rebuttal ๐Ÿ’ฌ Rebuttal writing after peer review
academic-slides ๐ŸŽค Academic presentation & research talk creation
evo-memory ๐Ÿง  Persistent research memory & self-evolution
paper-navigator ๐Ÿ“š End-to-end academic paper discovery, reading & organization
nano-banana ๐ŸŒ AI-generated presentation slides & illustrations via Gemini

Paper Suite + Self-Evolution Suite: Each skill is self-contained โ€” use them individually or combine freely. The self-evolution skills (idea-tournament, experiment-pipeline, evo-memory) form a learning loop that improves across research cycles.

๐Ÿ”Œ MCP Server Marketplace

The mcp/ directory contains a curated collection of MCP servers that extend agents with external tools โ€” web search, academic paper retrieval, documentation lookup, and more. Browse the full list or install directly:

/install-mcp              # interactive browser
EvoSci mcp install arxiv  # install by name

โ›ณ๏ธ Framework Overview

EvoScientist Framework

The diagram above shows the full EvoScientist pipeline. The Researcher Agent (top, blue) runs idea tree search and Elo tournament ranking to produce a research proposal. The Engineer Agent (bottom, green) executes the 4-stage experiment pipeline. The Evolution Manager Agent (right) manages three memory evolution mechanisms โ€” IDE, IVE, and ESE โ€” that feed learned knowledge back into Ideation Memory (M_I) and Experimentation Memory (M_E) for future cycles.

๐ŸŽข Skill Pipeline

flowchart LR
    A["<b>๐Ÿ”ฌ Research Phase</b><br/>๐Ÿ’ก research-ideation<br/>๐Ÿ† idea-tournament"]
    --> B["<b>โš™๏ธ Experiment Phase</b><br/>๐Ÿ“ paper-planning<br/>๐Ÿงช experiment-pipeline<br/>๐Ÿ”ง experiment-craft<br/>๐Ÿ”„ experiment-iterative-coder"]
    --> C["<b>๐Ÿ“ Writing Phase</b><br/>โœ๏ธ paper-writing<br/>๐Ÿ” paper-review<br/>๐Ÿ’ฌ paper-rebuttal<br/>๐ŸŽค academic-slides"]

    D[("๐Ÿง  evo-memory<br/>(IDE ยท IVE ยท ESE)")] <--> A
    D <--> B
    E["๐Ÿ“š paper-navigator<br/>(standalone)"] -.-> A
    E -.-> B
    F["๐ŸŒ nano-banana<br/>(standalone)"] -.-> C

    style A fill:#7C3AED,stroke:#5B21B6,stroke-width:2px,color:#fff
    style B fill:#D97706,stroke:#B45309,stroke-width:2px,color:#fff
    style C fill:#16A34A,stroke:#15803D,stroke-width:2px,color:#fff
    style D fill:#475569,stroke:#334155,stroke-width:2px,color:#fff
    style E fill:#0369A1,stroke:#075985,stroke-width:2px,color:#fff
    style F fill:#D97706,stroke:#B45309,stroke-width:2px,color:#fff

๐Ÿ’ก research-ideation โ€” Literature Tree & Problem Finding

The starting point of the research pipeline. Guides ideation from literature analysis to solution design:

  • Literature Tree โ€” Build a novelty tree and challenge-insight tree to map the research landscape
  • Problem Selection โ€” 4-level well-established solution check to identify open problems worth pursuing
  • Solution Design โ€” Cross-domain transfer and problem decomposition strategies
  • Paper Reading โ€” 3-level structured Q&A methodology for deep comprehension
  • Counterintuitive Rules โ€” Problem selection matters more than solution design; pursue new failure cases rather than incremental improvements

๐Ÿ† idea-tournament โ€” Competitive Idea Ranking & Proposal Generation

Bridges the gap between having a research direction and having a concrete, validated proposal:

  • Tree-Structured Generation โ€” Expand a seed idea into up to N_I=21 candidates by varying technique, domain, and formulation axes
  • Elo Tournament โ€” Pairwise comparisons on 4 dimensions (novelty, feasibility, relevance, clarity) with Swiss-system pairing
  • Direction Summarization โ€” Synthesize top-3 ideas into promising directions for evo-memory
  • Proposal Extension โ€” Extend the winning idea into a full research proposal (5 sections from paper + 1 practical extension)
  • Counterintuitive Rules โ€” Quantity before quality; the tournament finds surprises; top-3 not top-1

๐Ÿ“ paper-planning โ€” Research Paper Planning & Outline Generation

Guides pre-writing planning before a single word is drafted. Covers four key activities:

  • Story Design โ€” Reverse-engineer the narrative: task โ†’ challenge โ†’ insight โ†’ contribution โ†’ advantage
  • Experiment Planning โ€” Plan comparisons, ablations, and demo scenarios with structured checklists
  • Figure Design โ€” Pipeline figures that highlight novelty; teaser figures that hook reviewers
  • Timeline Management โ€” 4-week countdown schedule from outline to submission

Includes counterintuitive tactics: write your rejection letter first, narrow claims before broadening, and plan fallback narratives.

๐Ÿงช experiment-pipeline โ€” 4-Stage Experiment Execution

A structured framework for executing research experiments with attempt budgets and gate conditions:

  • Stage 1: Initial Implementation โ€” Get baseline code running and reproduce known results (โ‰ค20 attempts)
  • Stage 2: Hyperparameter Tuning โ€” Optimize configuration for your setup (โ‰ค12 attempts)
  • Stage 3: Proposed Method โ€” Implement and validate the novel method (โ‰ค12 attempts)
  • Stage 4: Ablation Study โ€” Prove each component's contribution (โ‰ค18 attempts)
  • Code Trajectory Logging โ€” Structured attempt logging that feeds into evo-memory
  • Counterintuitive Rules โ€” Initial implementation is not wasted time; budget limits prevent rabbit holes; failed attempts are data

Integrates with experiment-craft for failure diagnosis within stages and evo-memory for cross-cycle learning.

๐Ÿ”ง experiment-craft โ€” Experiment Debugging & Iteration

A systematic approach to experiment debugging, logging, and iterative improvement:

  • 5-Step Diagnostic Flow โ€” Collect failures โ†’ find a working version โ†’ bridge the gap โ†’ hypothesize โ†’ fix
  • Counterintuitive Rules โ€” Change one variable at a time; effective experiments beat more experiments
  • Experiment Logging โ€” 5-section structured log template for reproducible records
  • Handoff to Paper-Writing โ€” Feed validated results and logs into paper-writing for drafting

๐Ÿ”„ experiment-iterative-coder โ€” Iterative Code Refinement

Structured plan โ†’ code โ†’ evaluate โ†’ refine cycles for higher code quality:

  • Phase Decomposition โ€” Break complex tasks into 1-5 sequential phases
  • Iteration Loop โ€” Up to 3 iterations per phase (10 total): plan, code, run lint/tests, score, decide
  • Objective Evaluation โ€” ruff lint + pytest with dynamic score weighting and hard caps
  • Failure Mode Guidance โ€” Targeted responses for timeout, syntax, import, test, and lint failures

Integrates with experiment-craft for stuck diagnoses and evo-memory for loading prior strategies.

โœ๏ธ paper-writing โ€” Section-by-Section Paper Drafting

A proven 11-step workflow for writing academic papers with LaTeX templates:

  • Structured Process โ€” From pipeline sketch โ†’ story design โ†’ Method โ†’ Experiments โ†’ Related Work โ†’ Abstract โ†’ Title
  • Section Templates โ€” Three Abstract templates, four Introduction openers, Method module structure, Experiments organization
  • LaTeX Assets โ€” Annotated paper skeleton (paper-skeleton.tex) and booktabs table macros (table-style.tex)
  • Writing Principles โ€” One message per paragraph, topic sentence first, terminology consistency, reverse-outlining
  • Counterintuitive Tactics โ€” Underclaim in prose / overdeliver in evidence; lead with mechanism, not just metrics

๐Ÿ” paper-review โ€” Self-Review & Quality Assurance

Systematic self-review before submission using adversarial and counterintuitive review strategies:

  • 5-Aspect Checklist โ€” Contribution sufficiency, writing clarity, results quality, testing completeness, method design
  • Reverse-Outlining โ€” Extract the outline from finished paragraphs to verify logical flow
  • Figure & Table Quality Checks โ€” Captions, resolution, booktabs, color-blind friendliness
  • Rejection Simulation โ€” Force a reject summary first; attack your own novelty claim
  • Handoff to Rebuttal โ€” After review, feed identified weaknesses into paper-rebuttal for response preparation

๐Ÿ’ฌ paper-rebuttal โ€” Rebuttal Writing After Peer Review

Dedicated rebuttal skill for responding to reviewer feedback after peer review:

  • Score Diagnosis โ€” Color-code every reviewer comment: red (critical), orange (important), gray (minor), green (positive)
  • Champion Strategy โ€” Arm your most positive reviewer with evidence for the Area Chair discussion
  • Tactical Writing โ€” 18 rules for structure, content, and tone in rebuttal responses
  • Counterintuitive Principles โ€” Submit even with extreme scores; concede small points to win the big argument
  • Common Concerns โ€” Response strategies for 12 frequently raised reviewer complaints

๐ŸŽค academic-slides โ€” Presentation & Research Talk Creation

A structured approach to creating academic presentations and preparing research talks:

  • Narrative Arc โ€” Define scope, audience, and key takeaway before touching slides
  • Slide Design โ€” 10 design rules, visual hierarchy, one idea per slide, claim-style titles
  • Practical Creation โ€” .pptx file generation with color palettes, layout code, charts, and figures
  • Delivery & Q&A โ€” Rehearsal protocol, timing, and backup slide preparation
  • Counterintuitive Rules โ€” Slides are not your paper; enthusiasm beats polish; related work builds motivation, not citation counts

๐Ÿง  evo-memory โ€” Persistent Research Memory & Self-Evolution

The learning layer that accumulates knowledge across research cycles. Maintains two memory stores and implements three evolution mechanisms:

  • Ideation Memory (M_I) โ€” Tracks feasible and unsuccessful research directions across idea tournaments
  • Experimentation Memory (M_E) โ€” Stores reusable data processing and model training strategies (paper core), plus architecture and debugging (extensions)
  • IDE (Idea Direction Evolution) โ€” Extracts promising directions after idea tournaments
  • IVE (Idea Validation Evolution) โ€” Classifies experiment failures as implementation vs fundamental direction failures
  • ESE (Experiment Strategy Evolution) โ€” Distills reusable patterns from successful experiment pipelines

Read by idea-tournament and experiment-pipeline at cycle start; updated after each cycle completes.

๐Ÿ“š paper-navigator โ€” Academic Paper Discovery & Reading

End-to-end paper workflow in five stages โ€” from query to organized literature map:

  • Disambiguate โ€” Analyze user intent, resolve ambiguous terms (project names, module names) to actual paper titles
  • Discover โ€” 7 discovery paths: keyword search, citation traversal, recommendations, author tracking, arXiv monitoring, trending detection, GitHub search
  • Evaluate โ€” Quick assessment via TLDR, citations, code availability (HuggingFace + GitHub), and top models by task
  • Read โ€” Full-text retrieval via Jina Reader with 3-level reading strategy (Technical, Analytical, Contextual)
  • Organize โ€” Build novelty trees, challenge-insight trees, and generate structured literature reports (survey, quick scan, deep dive, baseline hunt)

Includes 12 Python scripts powered by Semantic Scholar, HuggingFace, GitHub, arXiv, and Jina Reader APIs.

๐ŸŒ nano-banana โ€” AI-Generated Slides & Illustrations

Generate professional presentation slides and high-quality illustrations using Gemini's image generation API, with an interactive browser-based review loop:

  • 7-Phase Workflow โ€” Content planning conversation โ†’ slides_plan.json โ†’ style selection & batch generation โ†’ browser review โ†’ feedback editing โ†’ PPTX packaging โ†’ cleanup
  • 3 Visual Styles โ€” Lineal Color (flat icons, educational), Gradient Glass (glassmorphism, premium), Vector Illustration (retro, approachable)
  • Interactive Review โ€” Local HTTP server with per-slide feedback; edits are applied without regenerating the entire deck
  • Multi-Model Support โ€” gemini-3-pro-image-preview (best quality), gemini-3.1-flash-image-preview (fast iteration), gemini-2.5-flash-image (rapid prototyping)
  • Counterintuitive Rules โ€” More planning = better slides; edit don't regenerate; never read generated images yourself (use the review server)

๐Ÿ”Back to top

๐ŸŽฏ แฏ“โžค Roadmap

Completed:

  • ๐Ÿง  Self-Evolution Suite โ€” idea-tournament, experiment-pipeline, evo-memory
  • ๐Ÿ“š Literature Survey โ€” Systematic literature search, filtering, and survey generation
  • ๐Ÿ”„ Iterative Coder โ€” Iterative code refinement with plan โ†’ code โ†’ evaluate โ†’ refine cycles
  • ๐ŸŽจ Visual Generation โ€” AI-generated slides & illustrations (nano-banana)

Coming soon:

  • ๐Ÿ… Math Olympiad โ€” Advanced mathematical reasoning & problem solving
  • ๐Ÿ”ฌ Paper Reproduction โ€” Read a paper, reproduce its core results, and verify claims
  • ๐Ÿ’ก Grant & Proposal Writing โ€” Research proposal drafting with funding agency conventions
  • ๐Ÿค– Peer Debate โ€” Multi-agent adversarial discussion to stress-test research ideas
  • ๐Ÿ“ˆ Trend Radar โ€” Analyze publication trends, identify emerging topics & research gaps
  • ๐Ÿ—ฃ๏ธ Paper QA โ€” Interactive question-answering over paper collections, extracting key findings & cross-referencing claims

Stay tuned โ€” more skills are on the way!

๐Ÿ”Back to top

๐ŸŒ Project Roles

Xi Zhang
EvoScientist
Xi Zhang
Xi Zhang
Yougang Lyu
Yougang Lyu
Dinos Papakostas
Dinos Papakostas
Yuyue Zhao
Yuyue Zhao

Xiaoyi DeepResearch Xiaoyi DeepResearch Team and the wider open-source community contribute to this project.

For any enquiries or collaboration opportunities, please contact: [email protected]

๐Ÿ”Back to top

๐Ÿค Contributing

We welcome contributions! See the guides for skills and MCP servers, or start with the Contributing Guidelines.

Every contribution brings us one step closer to a future where AI accelerates scientific breakthroughs for all of humanity.

๐Ÿ“ˆ Star History

Star History Chart

๐Ÿ”Back to top

๐Ÿ“ Citation

If you find our paper and code useful in your research and applications, please cite using this BibTeX:

@article{evoscientist2026, 
  title={EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery}, 
  author={Yougang Lyu and Xi Zhang and Xinhao Yi and Yuyue Zhao and Shuyu Guo and Wenxiang Hu and Jan Piotrowski and Jakub Kaliski and Jacopo Urbani and Zaiqiao Meng and Lun Zhou and Xiaohui Yan}, 
  journal={arXiv preprint arXiv:2603.08127}, 
  year={2026} 
}

๐Ÿ”Back to top

๐Ÿ“œ License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

๐Ÿ”Back to top

Reviews (0)

No results found