behest
Health Pass
- License — License: Apache-2.0
- Description — Repository has a description
- Active repo — Last push 0 days ago
- Community trust — 11 GitHub stars
Code Pass
- Code scan — Scanned 12 files during light audit, no dangerous patterns found
Permissions Pass
- Permissions — No dangerous permissions requested
No AI report is available for this listing yet.
Build production AI agent runtimes in Rust with typed tools, provider-neutral LLM adapters, streaming, storage, and observability.
What this is
behest provides provider-neutral contracts for chat, streaming, tool calling, embeddings, runtime execution, storage, queues, RAG, and observability.
It is designed for systems that need explicit control over model providers, tool execution, persistence, and operational boundaries — instead of opaque "agent framework" magic.
Status: early foundation crate. Public APIs are intentionally compact, strongly typed, and documented.
Why behest
behest /bɪˈhest/ — n. a person's orders or command.
At the behest of the user, the agent acts.
The core of an agent runtime is not "autonomous consciousness" but controlled delegation: the user issues an intent, and the system composes context, invokes models, executes tools, persists state, publishes events within explicit boundaries — auditable, recoverable, constrainable, and replaceable.
The name behest deliberately avoids inflated metaphors like "brain / cognition / intelligence". It only states an engineering fact:
tool-calling, streaming, memory, queue, RAG, snapshot — all mechanisms exist because someone gave an order.
Design goals
- Rust-native first: typed APIs, explicit errors, no hidden runtime assumptions.
- Provider-neutral core: OpenAI, Anthropic, local models, proxies, or internal providers can implement the same contracts.
- Streaming-first runtime: the agent loop is designed around streamed model events, with non-streaming fallback where appropriate.
- Typed tool boundary: tools are described by JSON Schema and executed through explicit registries.
- Pluggable persistence: memory by default, external stores behind feature flags.
- Operational surface: event publishing, snapshots, session gates, compaction, retry policy, and observability hooks.
- Small public API: foundation primitives over framework sprawl.
What's inside
| Area | Capability |
|---|---|
| Provider contracts | ChatProvider, EmbeddingProvider, request / response models, stream events, provider capabilities |
| Provider registry | In-memory routing for chat and embedding providers |
| Chat model types | messages, content parts, tool calls, response formats, token usage, finish reasons |
| Tool runtime | Tool, FunctionTool, ExternalTool, ToolRegistry, schema generation, execution dispatch |
| Agent runtime | context building, model calls, tool loop, session persistence, event emission |
| Managed runtime | ManagedRuntime unified container, coordinated lifecycle, typed component access, hot-reload |
| Hot-swap reload | drain-aware component replacement with pre/post hooks |
| Drain helper | DrainGuard<T> reference-counted guard for tracking outstanding Arc references |
| Health aggregation | HealthStatus::aggregate, healthz_response, readiness gates |
| Runtime invocation | RuntimeInvocation, EmitRequest, EventKind, Control, transport-neutral emit/on facade |
| Runtime stream | RuntimeEventStore, RuntimeStreamAdapter, RuntimeSubscriptionHub, replay + live fanout |
| Runtime safety | session gate, runtime policy, input admission, doom-loop detection, tool output truncation |
| Storage | memory stores, Redis, SQLx, MongoDB, object storage, Qdrant embeddings |
| Context and RAG | context adapters, static/function adapters, optional RAG adapter |
| Queues | optional event publishing through NATS or Redis Streams |
| Configuration | builder, file-based config, environment variable loading, secret indirection |
| Observability | tracing and optional OpenTelemetry integration |
Quick start
[dependencies]
behest = "0.4"
Create a provider-neutral chat request:
use behest::prelude::*;
let request = ChatRequest::new(ModelName::new("example-model"))
.with_message(Message::system_text("You are concise."))
.with_user_text("Summarize this project in one sentence.");
Register providers in a registry and route requests:
use behest::prelude::*;
let registry = ProviderRegistry::new();
let provider_id = ProviderId::new("my-provider");
// Register a ChatProvider implementation first.
// registry.register_chat(my_provider);
// Then route through the neutral registry.
// let response = registry.complete(&provider_id, request).await?;
More examples in examples/.
Implement a custom provider
behest does not force one vendor SDK into the core. Implement ChatProvider for any model backend, gateway, local inference service, or internal provider.
use async_trait::async_trait;
use behest::prelude::*;
struct EchoProvider {
id: ProviderId,
}
#[async_trait]
impl ChatProvider for EchoProvider {
fn id(&self) -> ProviderId {
self.id.clone()
}
fn capabilities(&self) -> ProviderCapabilities {
ProviderCapabilities::chat()
}
async fn complete(&self, request: ChatRequest) -> ProviderResult<ChatResponse> {
Ok(ChatResponse {
provider: self.id.clone(),
model: request.model,
message: Message::assistant_text("echo"),
finish_reason: FinishReason::Stop,
usage: None,
raw: None,
})
}
}
Streaming providers can override stream.
Define and execute tools
Tools are explicit runtime objects. Each tool exposes a stable name, a human-readable description, and a JSON Schema argument contract.
use behest::prelude::*;
use serde_json::{json, Value};
let tool = FunctionTool::new(
"echo",
"Echoes the input message.",
json!({
"type": "object",
"properties": {
"message": { "type": "string" }
},
"required": ["message"]
}),
|args: Value| async move {
Ok(args.get("message").cloned().unwrap_or(Value::Null))
},
)
.read_only()
.concurrency_safe();
let registry = ToolRegistry::new();
registry.register(tool);
Tool calls returned by a provider can be executed through the registry:
use behest::prelude::*;
use serde_json::json;
let call = ToolCall::new("call_1", "echo", json!({ "message": "hello" }));
let output = registry.execute(&call).await?;
Runtime model
At the runtime layer, AgentRuntime orchestrates the full agent loop, while ManagedRuntime provides a unified container for production deployments:
use behest::prelude::*;
let config = AgentConfig::builder()
.with_file("behest.toml")?
.with_env("BEHEST")?
.build()?;
// One-call construction of a fully configured ManagedRuntime.
let managed = config.build_managed().await?;
// Lifecycle: init → start → serve → stop
managed.init_all().await?;
managed.start_all().await?;
managed.serve().await?; // blocks until shutdown signal
managed.stop_all().await?;
The runtime loop:
RunRequest
-> load or create session
-> admit input
-> build context
-> call model provider
-> stream / persist assistant output
-> execute tool calls
-> append tool results
-> repeat until completion, limit, or error
-> emit AgentEvent values
The runtime brings together:
ProviderRegistryContextPipelineToolRuntimeRuntimeStoreRuntimePolicyCompactionServiceSessionGate- optional event publisher
- optional snapshot store
- optional background job pool
Configuration
AgentConfig supports layered configuration:
- defaults
- file sources
- environment variables
- manual builder setters
use behest::prelude::*;
let config = AgentConfig::builder()
.with_file("behest.toml")?
.with_env("BEHEST")?
.build()?;
let runtime = config.into_runtime().await?;
Secrets can be loaded through env:VAR_NAME indirection:
[providers.openai]
api_key = "env:OPENAI_API_KEY"
See behest.toml example for full configuration structure.
Provider adapters
Concrete provider adapters are feature-gated.
| Feature | Adapter | Chat | Stream | Embeddings | Tools |
|---|---|---|---|---|---|
openai |
OpenAiChatAdapter, OpenAiEmbeddingAdapter |
yes | yes | yes | yes |
anthropic |
AnthropicChatAdapter |
yes | yes | no | yes |
Enable adapters:
[dependencies]
behest = { version = "0.4", features = ["openai", "anthropic"] }
Feature flags
Click to expand full feature listDefault:
| Feature | Description |
|---|---|
tls-rustls |
Default TLS stack using rustls |
Provider adapters:
| Feature | Description |
|---|---|
openai |
OpenAI-compatible chat and embedding adapters |
anthropic |
Anthropic-compatible chat adapter |
TLS:
| Feature | Description |
|---|---|
tls-rustls |
Enable rustls TLS integration for HTTP / enabled backends |
tls-native |
Enable native TLS integration for HTTP / enabled backends |
Storage:
| Feature | Description |
|---|---|
redis |
Redis-backed store support and Redis Streams primitives |
redis-cluster |
Redis Cluster support; implies redis |
sqlx-postgres |
SQLx PostgreSQL store support |
sqlx-mysql |
SQLx MySQL store support |
sqlx-sqlite |
SQLx SQLite store support |
mongodb |
MongoDB session store support |
object_store |
Object storage support, including AWS S3 |
storage-all |
Redis, PostgreSQL, MySQL, SQLite, and MongoDB storage features |
RAG:
| Feature | Description |
|---|---|
rag |
Core RAG context adapter |
qdrant |
Qdrant embedding store backend |
tantivy |
Tantivy backend support |
rag-all |
Enables rag, qdrant, and tantivy |
Queues:
| Feature | Description |
|---|---|
queue |
Core event publisher traits |
nats |
NATS event publisher |
queue-all |
Enables queue, nats, and redis |
Observability:
| Feature | Description |
|---|---|
otel |
OpenTelemetry tracing integration |
Convenience profile:
| Feature | Description |
|---|---|
full |
Opinionated full runtime profile: OpenAI, Anthropic, Redis, Redis Cluster, NATS, PostgreSQL, MongoDB, OpenTelemetry, all RAG backends, all queue backends, and object storage. It intentionally does not enable sqlx-mysql or sqlx-sqlite. |
Example with selected features:
[dependencies]
behest = {
version = "0.4",
default-features = false,
features = ["tls-rustls", "openai", "anthropic", "redis", "queue", "nats"]
}
Error model
behest exposes typed error categories instead of stringly framework failures:
ProviderErrorToolErrorStorageErrorContextErrorRuntimeError- top-level
Error - crate-level
Result<T>
Provider errors distinguish unsupported capabilities, retryable failures, transport failures, invalid responses, and adapter-specific errors.
Tool errors distinguish missing tools, invalid arguments, execution failures, timeouts, and unimplemented external tools.
Lint policy
The crate is intentionally strict:
unsafe_code = "forbid"missing_docs = "deny"unreachable_pub = "deny"clippy::all = "deny"dbg_macro = "deny"expect_used = "deny"todo = "deny"unimplemented = "deny"unwrap_used = "deny"
This project treats public API clarity and failure-path hygiene as part of the runtime contract.
Development
# Format
cargo fmt --all --check
# Check all targets and features
cargo check --all-targets --all-features --locked
# Lint
cargo clippy --all-targets --all-features --locked -- -D warnings
# Test
cargo test --all-features --locked
# Build documentation
RUSTDOCFLAGS="-D warnings" cargo doc --all-features --no-deps --locked
Run the complete local verification set:
cargo fmt --all --check && \
cargo check --all-targets --all-features --locked && \
cargo clippy --all-targets --all-features --locked -- -D warnings && \
cargo test --all-features --locked && \
RUSTDOCFLAGS="-D warnings" cargo doc --all-features --no-deps --locked
License
Licensed under either of:
- Apache License, Version 2.0 (LICENSE-APACHE)
- MIT license (LICENSE-MIT)
at your option.
Reviews (0)
Sign in to leave a review.
Leave a reviewNo results found