A-Modular-Kingdom

mcp
Guvenlik Denetimi
Uyari
Health Uyari
  • No license — Repository has no license file
  • Description — Repository has a description
  • Active repo — Last push 0 days ago
  • Community trust — 28 GitHub stars
Code Gecti
  • Code scan — Scanned 12 files during light audit, no dangerous patterns found
Permissions Gecti
  • Permissions — No dangerous permissions requested

Bu listing icin henuz AI raporu yok.

SUMMARY

Production-ready AI infrastructure: RAG with smart reindexing, persistent memory, browser automation, and MCP integration. Stop rebuilding tools for every AI project.

README.md

🏰 A-Modular-Kingdom

High-Performance MCP Foundation for RAG, Scoped Memory, and Agentic Tools

✨ New: Check out the Next-Gen Agent Harness →


🏛️ Project Status: Stable Foundation

A-Modular-Kingdom (AMK) is now maintained as a Stable Infrastructure Layer. While it remains a powerful standalone MCP server, it is optimized to serve as the "Engine" for modern agent harnesses.

If you are looking for a natural-language way to orchestrate multiple agents and generate complex teams, please visit:
👉 Harness: The Team-Architecture Factory


The Solution

A-Modular-Kingdom is the infrastructure layer you're missing:

# Start the MCP server
python src/agent/host.py

Now any agent (Claude Desktop, Harness, custom chatbots) gets instant access to:

  • Hierarchical Scoped Memory (Global Rules, Project Context, Persona)
  • Advanced V3 RAG (Hybrid Fusion + Cross-Encoder Reranking)
  • 27+ Production Tools (Vision, Code Exec, Web Search, TTS/STT)

🏗️ Next-Gen Integration: Harness + Kingdom

A-Modular-Kingdom provides the "Batteries," and Harness provides the "Body."

By connecting AMK as an MCP server to a Harness-generated team, you get:

  1. Precision Retrieval: Use AMK's V3 RAG to feed huge codebases into Harness agents without context pollution.
  2. Durable Rules: Save global engineering standards in AMK's global_rules scope so every Harness team follows them.
  3. Tool Isolation: Use AMK's local Python sandbox for code execution triggered by your Harness team.

Build your first Harness team now →


🏗️ Architecture

architecture

📑 Table of Contents


✨ Core Features

  • MCP Protocol - Standard interface for AI tool access
  • 3 RAG Versions - Choose your retrieval strategy (FAISS, Qdrant, custom)
  • Scoped Memory - Global rules, preferences, project-specific context
  • 8+ Tools - Vision, code exec, browser, web search, TTS/STT, and more
  • No Vendor Lock-in - Local Ollama models, open-source stack
  • Production Ready - Smart reindexing, Unicode support, error handling

🚀 Quick Start

Prerequisites

# Required
Python 3.10+
Ollama (for embeddings: ollama pull embeddinggemma)

# Optional
UV package manager (faster than pip)

Installation

# Clone the repository
git clone https://github.com/MasihMoafi/A-Modular-Kingdom.git
cd A-Modular-Kingdom

# Install dependencies
uv sync
# or: pip install -e .

# Pull required Ollama model
ollama pull embeddinggemma

Start the MCP Server

# Start host.py MCP server
python src/agent/host.py

Connect Your Agent

Option 1: Claude Desktop

// Add to claude_desktop_config.json
{
  "mcpServers": {
    "a-modular-kingdom": {
      "command": "python",
      "args": ["/full/path/to/A-Modular-Kingdom/src/agent/host.py"]
    }
  }
}

Option 2: Interactive Client

# Use the included chat interface
python src/agent/main.py

Option 3: Custom Integration

# Connect via MCP in your own agent
from mcp import StdioServerParameters

server_params = StdioServerParameters(
    command="python",
    args=["/path/to/host.py"]
)
# Use with ToolCollection.from_mcp(server_params)

🛠️ Available Tools

The MCP server exposes these tools:

Tool Description Use Case
query_knowledge_base RAG search (v1/v2/v3) "How does auth work in this codebase?"
save_memory Scoped memory storage Save global rules or project context
search_memories Semantic memory search Retrieve past decisions/preferences
web_search DuckDuckGo search Current events, latest docs
code_execute Safe Python sandbox Run code in isolated environment
text_to_speech TTS (pyttsx3/kokoro) Generate audio from text
speech_to_text Whisper STT Transcribe audio files

📚 RAG System

Three implementations with different trade-offs:

V1 - Simple & Fast

  • Stack: FAISS + BM25
  • Speed: <1s
  • Use Case: Small projects, quick prototypes

V2 - Production (Recommended)

  • Stack: Qdrant + BM25 + CrossEncoder reranking
  • Speed: <1s with smart caching
  • Use Case: Production apps, large codebases
  • Features: Smart reindexing, cloud-ready

V3 - Advanced (Highest Accuracy)

  • Stack: Qdrant + BM25 + RRF fusion + CrossEncoder reranking
  • Speed: <1s (cached), 6.7s (cold start)
  • Use Case: Maximum accuracy, complex queries
  • Features: Contextual retrieval, hybrid fusion

Benchmark Results (LLM-as-Judge)

Metric V2 V3
Groundedness 100% 100%
Relevance 80-98% 78-88%
Completeness 75-95% 75-98%
Average 84-98% 84-88%

Evaluated with curated queries on Napoleon.pdf and RAG documentation. Judge: Gemini 2.5 Flash. Results vary based on indexed content.

Usage:

# Via MCP tool
query_knowledge_base(
    query="How does authentication work?",
    version="v2",  # or "v1", "v3"
    doc_path="./src"  # optional
)

Supported Files: .py, .md, .txt, .pdf, .ipynb, .js, .ts


🧠 Memory System

Hierarchical scoped memory with automatic categorization:

Memory Scopes

Scope Persistence Use Case
Global Rules Forever, all projects "Always use type hints"
Global Preferences Forever, all projects "Prefer dark mode"
Global Personas Forever, all projects Reusable agent personalities
Project Context Current project Architecture decisions, tech stack
Project Sessions Temporary Current task, recent changes

Usage

# Save with explicit scope
save_memory(content="Always validate user input", scope="global_rules")

# Or use prefix shortcuts
save_memory(content="#global:rule:Never use eval()")
save_memory(content="#project:context:Uses FastAPI backend")

# Auto-inference from keywords
save_memory(content="User prefers Python 3.12")  # → global_preferences

# Search with priority (global → project)
search_memories(query="coding standards", top_k=5)

Storage: ~/.modular_kingdom/memories/ (global) + project-specific folders


📦 Package Installation

Coming soon: pip install rag-mem (PyPI release in progress)

Install from source in the meantime:

cd packages/memory-mcp
pip install -e .

🎯 Integration Examples

Claude Desktop

Already using Claude Code? Add A-Modular-Kingdom tools:

{
  "mcpServers": {
    "a-modular-kingdom": {
      "command": "python",
      "args": ["/path/to/src/agent/host.py"]
    }
  }
}

Now Claude has access to your codebase RAG, persistent memory, and all tools.

Gemini CLI

// gemini-extension.json
{
  "mcpServers": {
    "unified_knowledge_agent": {
      "command": "python",
      "args": ["/path/to/src/agent/host.py"]
    }
  }
}

Custom Agent

from smolagents import ToolCallingAgent, ToolCollection
from mcp import StdioServerParameters

# Connect to MCP server
params = StdioServerParameters(
    command="python",
    args=["/path/to/host.py"]
)

with ToolCollection.from_mcp(params) as tools:
    agent = ToolCallingAgent(tools=list(tools.tools))
    result = agent.run("Search the codebase for auth logic")

🤖 Example Applications

This repository includes example multi-agent systems built on the foundation:

Council Chamber (Hierarchical)

  • 3-tier agent hierarchy (Queen → Teacher → Code Agent)
  • Validation loops and task delegation
  • Uses ACP SDK + smolagents
  • Location: multiagents/council_chamber/

Gym (Sequential)

  • Fitness planning workflow (Interview → Plan → Nutrition)
  • CrewAI-powered coordination
  • Web interface included
  • Location: multiagents/gym/

Note: These are demonstration applications, not the core product. The foundation (host.py) is the main offering.


🏗️ Architecture

┌─────────────────────────────────────┐
│     Your AI Application             │
│  (Agents, Chatbots, Workflows)      │
└────────────┬────────────────────────┘
             │ MCP Protocol
┌────────────▼────────────────────────┐
│      A-Modular-Kingdom              │
│  ┌─────────┐ ┌─────────┐ ┌────────┐│
│  │   RAG   │ │ Memory  │ │ Tools  ││
│  │ V1/V2/V3│ │ Scoped  │ │ 8+     ││
│  └─────────┘ └─────────┘ └────────┘│
│           host.py (MCP Server)      │
└─────────────────────────────────────┘

🧪 Testing & Performance

Run Tests

# Run all tests
pytest tests/ -v

# Run specific test suites
pytest tests/test_rag_v2.py -v
pytest tests/test_rag_v3.py -v
pytest tests/test_memory_global.py -v

# Run benchmarks
python tests/benchmark_rag.py

Performance

Benchmark Results (GPU/CUDA):

Version Docs Cold Start Warm Query
V2 100 26.8s 0.31s
V3 100 13.9s 0.02s (15x faster!)

Key Features:

  • ✅ GPU acceleration (CUDA) for embeddings and reranking
  • ✅ Smart caching (warm queries <0.5s)
  • ✅ Tested with .py, .md, .txt, .ipynb files
  • ✅ Global memory access from any directory

See detailed benchmarks: docs/RAG_PERFORMANCE.md

Docker Testing

Package verified to work in isolation:

docker build -f Dockerfile.test -t rag-mem-test .
docker run --rm rag-mem-test

🤝 Contributing

Contributions welcome! Focus areas:

  1. Additional RAG strategies - New retrieval techniques
  2. New tool integrations - Expand MCP tool offerings
  3. Performance optimizations - Speed improvements
  4. Documentation improvements - Tutorials, examples

Development Setup

# Fork and clone
git clone https://github.com/MasihMoafi/A-Modular-Kingdom.git
cd A-Modular-Kingdom

# Create branch
git checkout -b feature/your-feature

# Install dev dependencies
uv sync

# Make changes and test
pytest tests/

# Commit with descriptive message
git commit -m "feat: add new tool"

# Push and create PR
git push origin feature/your-feature

📜 License

MIT License - See LICENSE for details


Links


A-Modular-Kingdom: The infrastructure layer AI agents deserve 🏰

Yorumlar (0)

Sonuc bulunamadi