Sheppard Agency V3

AI research agent for Ollama
Search, scrape, extract, and synthesize knowledge — all on local hardware.

What It Is

Sheppard is an async Python chat agent backed by Ollama that can do two things:

Chat — normal conversation with persistent memory across sessions
Research — give it /learn <topic> and it will search, scrape, extract, and synthesize knowledge on that topic, storing everything in a structured memory system

The research pipeline works like this:

Searches via SearXNG to find relevant sources
Scrapes them via Firecrawl (local) with Playwright
Distills content into structured knowledge atoms — facts, claims, contradictions
Stores everything in Postgres + ChromaDB + Redis
Generates citable reports on demand

Benchmarked at 82.0/100 on research and memory tasks — i9-12900K, 32GB DDR5, RTX A4000.

Tech Stack

Component	Role
Ollama	Local LLM inference (rnj-1:8b default, configurable)
PostgreSQL	Structured storage — topics, sources, atoms, lineage
ChromaDB	Semantic vector search for RAG retrieval
Redis	Scraping queue, distributed locks, volatile state
Firecrawl (local)	Web scraper with built-in Playwright
SearXNG	Self-hosted search engine for discovery

Memory Design

Postgres  →  Canonical truth. Topics, sources, atoms, lineage.
ChromaDB  →  Semantic projections. Fast similarity search for RAG.
Redis     →  Operational state. Scraping queue, locks, caching.

ChromaDB is treated as a projection — it can be wiped and rebuilt from Postgres at any time.

Quick Start

Requirements

Python 3.10+
PostgreSQL 14+ (with btree_gin extension)
Redis 6.2+
Ollama (local or remote)
Firecrawl-Local
SearXNG

Setup

# 1. Clone and install
git clone https://github.com/B-A-M-N/Sheppard.git
cd Sheppard
pip install -r requirements.txt

# 2. Start supporting services
./start_research_stack.sh

# 3. Initialize database schema
python3 src/memory/setup_v3.py

# 4. Run the agent
python3 main.py

Commands

Command	Description
`/learn <topic>`	Start a background research mission
`/stop`	Stop an active mission
`/missions`	View mission status
`/nudge <instruction>`	Adjust research direction mid-mission
`/query <topic>`	Ask questions about previously learned topics
`/report <id>`	Generate a synthesis report from stored knowledge
`/status`	View system health and backlog

Architecture

Research Pipeline

  Discovery          Acquisition          Condensation         Storage            Synthesis
┌─────────────┐    ┌───────────────┐    ┌───────────────┐    ┌──────────┐    ┌───────────────┐
│  SearXNG    │───▶│  Firecrawl    │───▶│  LLM Extract  │───▶│  Postgres│───▶│  Report Gen   │
│  search     │    │  + Playwright │    │  atoms        │    │  Chroma  │    │  w/ citations │
└─────────────┘    └───────────────┘    └───────────────┘    └──────────┘    └───────────────┘
                                                                  ▲
                                                                  │
                                                              Redis (queue)

Discovery — SearXNG returns results across multiple pages
Acquisition — Firecrawl scrapes pages via Playwright; PDFs offloaded to slower workers
Condensation — LLM extracts structured atoms (facts, claims, tradeoffs) from raw content
Storage — Atoms stored in Postgres with full source lineage; embeddings indexed in ChromaDB
Synthesis — Evidence assembler pulls relevant atoms and generates citable reports

Project Structure

Sheppard/
├── main.py                    # Entry point — interactive chat loop
├── scout_worker.py            # Auxiliary scraping worker
├── start_research_stack.sh    # Service bootstrap
├── requirements.txt
├── src/
│   ├── core/
│   │   ├── sheppard/          # Chat agent, response generation, tool usage
│   │   ├── commands.py        # Slash command handler
│   │   ├── memory/            # Storage adapters (Postgres, Chroma, Redis)
│   │   └── system.py          # System initialization and lifecycle
│   ├── research/
│   │   ├── acquisition/       # Firecrawl client, search, crawling
│   │   ├── condensation/      # Knowledge distillation pipeline
│   │   ├── reasoning/         # Evidence assembly and report synthesis
│   │   ├── archivist/         # Research loop, chunking, indexing
│   │   └── config.py          # Research system configuration
│   ├── memory/
│   │   ├── manager.py         # Memory coordination (PG + Chroma)
│   │   ├── stores/            # Individual store adapters
│   │   └── schema_v3.sql      # Database schema
│   ├── llm/                   # Ollama client, model routing
│   └── config/                # Application settings
└── tests/

Design Principle

Postgres is Truth. Chroma is a projection. Redis is motion. Lineage is permanent.

Acknowledgments

Dallan Loomis — for the interactions and guidance that kept this project on track
My parents — for the support that made all of this possible
My son — the reason I build

License

MPL-2.0