A-Modular-Kingdom
Health Warn
- No license — Repository has no license file
- Description — Repository has a description
- Active repo — Last push 0 days ago
- Community trust — 28 GitHub stars
Code Pass
- Code scan — Scanned 12 files during light audit, no dangerous patterns found
Permissions Pass
- Permissions — No dangerous permissions requested
No AI report is available for this listing yet.
Production-ready AI infrastructure: RAG with smart reindexing, persistent memory, browser automation, and MCP integration. Stop rebuilding tools for every AI project.
🏰 A-Modular-Kingdom
High-Performance MCP Foundation for RAG, Scoped Memory, and Agentic Tools
✨ New: Check out the Next-Gen Agent Harness →
🏛️ Project Status: Stable Foundation
A-Modular-Kingdom (AMK) is now maintained as a Stable Infrastructure Layer. While it remains a powerful standalone MCP server, it is optimized to serve as the "Engine" for modern agent harnesses.
If you are looking for a natural-language way to orchestrate multiple agents and generate complex teams, please visit:
👉 Harness: The Team-Architecture Factory
The Solution
A-Modular-Kingdom is the infrastructure layer you're missing:
# Start the MCP server
python src/agent/host.py
Now any agent (Claude Desktop, Harness, custom chatbots) gets instant access to:
- ✅ Hierarchical Scoped Memory (Global Rules, Project Context, Persona)
- ✅ Advanced V3 RAG (Hybrid Fusion + Cross-Encoder Reranking)
- ✅ 27+ Production Tools (Vision, Code Exec, Web Search, TTS/STT)
🏗️ Next-Gen Integration: Harness + Kingdom
A-Modular-Kingdom provides the "Batteries," and Harness provides the "Body."
By connecting AMK as an MCP server to a Harness-generated team, you get:
- Precision Retrieval: Use AMK's V3 RAG to feed huge codebases into Harness agents without context pollution.
- Durable Rules: Save global engineering standards in AMK's
global_rulesscope so every Harness team follows them. - Tool Isolation: Use AMK's local Python sandbox for code execution triggered by your Harness team.
Build your first Harness team now →
🏗️ Architecture
📑 Table of Contents
- ✨ Core Features
- 🚀 Quick Start
- 🛠️ Available Tools
- 📚 RAG System
- 🧠 Memory System
- 📦 Package Installation
- 🎯 Integration Examples
- 🤖 Example Applications
- 🤝 Contributing
✨ Core Features
- MCP Protocol - Standard interface for AI tool access
- 3 RAG Versions - Choose your retrieval strategy (FAISS, Qdrant, custom)
- Scoped Memory - Global rules, preferences, project-specific context
- 8+ Tools - Vision, code exec, browser, web search, TTS/STT, and more
- No Vendor Lock-in - Local Ollama models, open-source stack
- Production Ready - Smart reindexing, Unicode support, error handling
🚀 Quick Start
Prerequisites
# Required
Python 3.10+
Ollama (for embeddings: ollama pull embeddinggemma)
# Optional
UV package manager (faster than pip)
Installation
# Clone the repository
git clone https://github.com/MasihMoafi/A-Modular-Kingdom.git
cd A-Modular-Kingdom
# Install dependencies
uv sync
# or: pip install -e .
# Pull required Ollama model
ollama pull embeddinggemma
Start the MCP Server
# Start host.py MCP server
python src/agent/host.py
Connect Your Agent
Option 1: Claude Desktop
// Add to claude_desktop_config.json
{
"mcpServers": {
"a-modular-kingdom": {
"command": "python",
"args": ["/full/path/to/A-Modular-Kingdom/src/agent/host.py"]
}
}
}
Option 2: Interactive Client
# Use the included chat interface
python src/agent/main.py
Option 3: Custom Integration
# Connect via MCP in your own agent
from mcp import StdioServerParameters
server_params = StdioServerParameters(
command="python",
args=["/path/to/host.py"]
)
# Use with ToolCollection.from_mcp(server_params)
🛠️ Available Tools
The MCP server exposes these tools:
| Tool | Description | Use Case |
|---|---|---|
query_knowledge_base |
RAG search (v1/v2/v3) | "How does auth work in this codebase?" |
save_memory |
Scoped memory storage | Save global rules or project context |
search_memories |
Semantic memory search | Retrieve past decisions/preferences |
web_search |
DuckDuckGo search | Current events, latest docs |
code_execute |
Safe Python sandbox | Run code in isolated environment |
text_to_speech |
TTS (pyttsx3/kokoro) | Generate audio from text |
speech_to_text |
Whisper STT | Transcribe audio files |
📚 RAG System
Three implementations with different trade-offs:
V1 - Simple & Fast
- Stack: FAISS + BM25
- Speed: <1s
- Use Case: Small projects, quick prototypes
V2 - Production (Recommended)
- Stack: Qdrant + BM25 + CrossEncoder reranking
- Speed: <1s with smart caching
- Use Case: Production apps, large codebases
- Features: Smart reindexing, cloud-ready
V3 - Advanced (Highest Accuracy)
- Stack: Qdrant + BM25 + RRF fusion + CrossEncoder reranking
- Speed: <1s (cached), 6.7s (cold start)
- Use Case: Maximum accuracy, complex queries
- Features: Contextual retrieval, hybrid fusion
Benchmark Results (LLM-as-Judge)
| Metric | V2 | V3 |
|---|---|---|
| Groundedness | 100% | 100% |
| Relevance | 80-98% | 78-88% |
| Completeness | 75-95% | 75-98% |
| Average | 84-98% | 84-88% |
Evaluated with curated queries on Napoleon.pdf and RAG documentation. Judge: Gemini 2.5 Flash. Results vary based on indexed content.
Usage:
# Via MCP tool
query_knowledge_base(
query="How does authentication work?",
version="v2", # or "v1", "v3"
doc_path="./src" # optional
)
Supported Files: .py, .md, .txt, .pdf, .ipynb, .js, .ts
🧠 Memory System
Hierarchical scoped memory with automatic categorization:
Memory Scopes
| Scope | Persistence | Use Case |
|---|---|---|
| Global Rules | Forever, all projects | "Always use type hints" |
| Global Preferences | Forever, all projects | "Prefer dark mode" |
| Global Personas | Forever, all projects | Reusable agent personalities |
| Project Context | Current project | Architecture decisions, tech stack |
| Project Sessions | Temporary | Current task, recent changes |
Usage
# Save with explicit scope
save_memory(content="Always validate user input", scope="global_rules")
# Or use prefix shortcuts
save_memory(content="#global:rule:Never use eval()")
save_memory(content="#project:context:Uses FastAPI backend")
# Auto-inference from keywords
save_memory(content="User prefers Python 3.12") # → global_preferences
# Search with priority (global → project)
search_memories(query="coding standards", top_k=5)
Storage: ~/.modular_kingdom/memories/ (global) + project-specific folders
📦 Package Installation
Coming soon:
pip install rag-mem(PyPI release in progress)
Install from source in the meantime:
cd packages/memory-mcp
pip install -e .
🎯 Integration Examples
Claude Desktop
Already using Claude Code? Add A-Modular-Kingdom tools:
{
"mcpServers": {
"a-modular-kingdom": {
"command": "python",
"args": ["/path/to/src/agent/host.py"]
}
}
}
Now Claude has access to your codebase RAG, persistent memory, and all tools.
Gemini CLI
// gemini-extension.json
{
"mcpServers": {
"unified_knowledge_agent": {
"command": "python",
"args": ["/path/to/src/agent/host.py"]
}
}
}
Custom Agent
from smolagents import ToolCallingAgent, ToolCollection
from mcp import StdioServerParameters
# Connect to MCP server
params = StdioServerParameters(
command="python",
args=["/path/to/host.py"]
)
with ToolCollection.from_mcp(params) as tools:
agent = ToolCallingAgent(tools=list(tools.tools))
result = agent.run("Search the codebase for auth logic")
🤖 Example Applications
This repository includes example multi-agent systems built on the foundation:
Council Chamber (Hierarchical)
- 3-tier agent hierarchy (Queen → Teacher → Code Agent)
- Validation loops and task delegation
- Uses ACP SDK + smolagents
- Location:
multiagents/council_chamber/
Gym (Sequential)
- Fitness planning workflow (Interview → Plan → Nutrition)
- CrewAI-powered coordination
- Web interface included
- Location:
multiagents/gym/
Note: These are demonstration applications, not the core product. The foundation (host.py) is the main offering.
🏗️ Architecture
┌─────────────────────────────────────┐
│ Your AI Application │
│ (Agents, Chatbots, Workflows) │
└────────────┬────────────────────────┘
│ MCP Protocol
┌────────────▼────────────────────────┐
│ A-Modular-Kingdom │
│ ┌─────────┐ ┌─────────┐ ┌────────┐│
│ │ RAG │ │ Memory │ │ Tools ││
│ │ V1/V2/V3│ │ Scoped │ │ 8+ ││
│ └─────────┘ └─────────┘ └────────┘│
│ host.py (MCP Server) │
└─────────────────────────────────────┘
🧪 Testing & Performance
Run Tests
# Run all tests
pytest tests/ -v
# Run specific test suites
pytest tests/test_rag_v2.py -v
pytest tests/test_rag_v3.py -v
pytest tests/test_memory_global.py -v
# Run benchmarks
python tests/benchmark_rag.py
Performance
Benchmark Results (GPU/CUDA):
| Version | Docs | Cold Start | Warm Query |
|---|---|---|---|
| V2 | 100 | 26.8s | 0.31s |
| V3 | 100 | 13.9s | 0.02s (15x faster!) |
Key Features:
- ✅ GPU acceleration (CUDA) for embeddings and reranking
- ✅ Smart caching (warm queries <0.5s)
- ✅ Tested with .py, .md, .txt, .ipynb files
- ✅ Global memory access from any directory
See detailed benchmarks: docs/RAG_PERFORMANCE.md
Docker Testing
Package verified to work in isolation:
docker build -f Dockerfile.test -t rag-mem-test .
docker run --rm rag-mem-test
🤝 Contributing
Contributions welcome! Focus areas:
- Additional RAG strategies - New retrieval techniques
- New tool integrations - Expand MCP tool offerings
- Performance optimizations - Speed improvements
- Documentation improvements - Tutorials, examples
Development Setup
# Fork and clone
git clone https://github.com/MasihMoafi/A-Modular-Kingdom.git
cd A-Modular-Kingdom
# Create branch
git checkout -b feature/your-feature
# Install dev dependencies
uv sync
# Make changes and test
pytest tests/
# Commit with descriptive message
git commit -m "feat: add new tool"
# Push and create PR
git push origin feature/your-feature
📜 License
MIT License - See LICENSE for details
Links
- Medium Article: https://medium.com/@masihmoafi12/a-modular-kingdom-fcaa69a6c1f0
- Demo Video: https://www.youtube.com/watch?v=hWoQnAr6R_E
- PyPI Package: rag-mem
A-Modular-Kingdom: The infrastructure layer AI agents deserve 🏰
Reviews (0)
Sign in to leave a review.
Leave a reviewNo results found