Causal-Memory-Layer

mcp
Guvenlik Denetimi
Uyari
Health Uyari
  • License — License: MIT
  • Description — Repository has a description
  • Active repo — Last push 0 days ago
  • Low visibility — Only 5 GitHub stars
Code Gecti
  • Code scan — Scanned 12 files during light audit, no dangerous patterns found
Permissions Gecti
  • Permissions — No dangerous permissions requested

Bu listing icin henuz AI raporu yok.

SUMMARY

CML (Causal Memory Layer) — a foundational memory layer for recording reasons, permissions, and responsibility behind actions, not just events or results. Enables systems in AI, fintech, security, and distributed computing to preserve meaning and causal accountability across time, independent of execution or transport.

README.md

Causal Memory Layer (CML)

CI
Package Validation
PyPI version
Python versions
Production PyPI Install Smoke Test
Status
Audit
License
Safety Eval

CML before-after causal audit visual

Why CML?

Logs show what happened. CML checks why it was allowed.

A workflow can pass every functional test and still be causally invalid: the action succeeded, but the approval, intent, or responsibility lineage is missing, ambiguous, or broken.

ordinary log:  action completed -> OK
CML audit:     parent_cause=approval-42 -> MISSING_PARENT

CML is an open-source causal audit layer for structured action traces, AI-agent workflows, high-trust automation, and reviewable safety infrastructure.

A system may be functionally correct while being causally invalid.

What problem this solves

Modern agent systems can execute tools, call APIs, write files, and send messages faster than humans can review every step.

Most logs tell you that an action happened; they do not prove that the action had a valid upstream approval, task, or responsibility path.

CML adds a small audit primitive for this gap: it checks structured action traces for missing parent causes, ambiguous roots, and broken causal lineage.

This is useful when an agent action succeeds operationally but should still be reviewed because its permission or responsibility chain is missing.

The goal is not to replace observability, policy engines, or security tooling; it is to make causal validity inspectable.

Star this repo if you care about auditable AI agents, deterministic oversight, causal traces, and open-source AI safety infrastructure.

Install

Install from PyPI:

pip install causal-memory-layer

Check the CLI:

cml --help

Install the experimental MCP extra:

pip install "causal-memory-layer[mcp]"
cml-mcp

Current production release:

pip install causal-memory-layer==0.4.0

30-second demo

Run the local API:

docker compose up --build

Then follow the Docker walkthrough:

docs/demo/DOCKER_CAUSAL_MEMORY_WALKTHROUGH.md

Expected example finding:

CML-AUDIT-R1-MISSING_PARENT

The action may look operationally valid, but CML asks whether its causal parent exists.

For an agent-workflow example, run the CrewAI-style causal audit demo:

python examples/crewai_style_causal_audit.py

See docs/integrations/CREWAI_STYLE_CAUSAL_AUDIT.md.

For an approval-lineage example, run:

python examples/agent_approval_lineage_audit.py

This demo shows how CML can flag a synthetic agent trace where a sensitive step lacks a valid policy or human approval ancestor.

Use CML when you need to audit

  • AI-agent tool calls and action chains.
  • Human approval handoffs.
  • Approval-lineage checks for high-trust agent actions.
  • Automation workflows with high-trust actions.
  • Fintech or review-heavy decision paths.
  • Structured traces where responsibility lineage matters.
  • Research benchmarks for causal validity in agentic systems.

Agent-audit MCP integration

CML can also run as an experimental MCP tool server for AI-agent audit workflows.

From PyPI:

pip install "causal-memory-layer[mcp]"
cml-mcp

For local development:

pip install -e ".[mcp]"
cml-mcp

See docs/integrations/MCP_AGENT_AUDIT.md for local MCP client setup and available tools.

For a short coding-assistant setup path, see docs/integrations/CURSOR_MCP_QUICKSTART.md.

How CML differs

System type Usually answers CML adds
Logs What happened? Was the action causally permitted?
Tracing Where did execution go? Did responsibility lineage survive the workflow?
Observability What failed operationally? What succeeded but had broken causal lineage?
Policy checks Is this allowed now? Why was this specific action allowed in this trace?
CML Why was this action allowed? Narrow audit primitive, not a full runtime safety stack.

Audit codes

CML findings are intentionally small and reviewable.

Code Meaning Why it matters
CML-AUDIT-R1-MISSING_PARENT A record points to a parent_cause that does not exist in the trace. The action may have succeeded, but its approval/task lineage is broken.
CML-AUDIT-R2-GAP_NOT_MARKED A record has no parent but is not clearly marked as an observed causal gap. Reviewers cannot tell whether the missing parent is intentional or accidental.
CML-AUDIT-R3-SECRET_NET_MISSING_CHAIN A network/send action follows secret access without a valid causal chain. Useful for reviewing high-risk data-flow and exfiltration-like patterns.
CML-AUDIT-R4-AMBIGUOUS_ROOT A root event label looks malformed or ambiguous. Root authority should be explicit, not guessed from a near-miss string.

These codes do not block execution or certify safety. They make causal-risk patterns visible for review.

See docs/audit/FINDINGS_GLOSSARY.md for more detail.

Fast validation

pip install -e ".[dev]"
pytest
python scripts/run_safety_eval.py

Dashboard:

https://safal207.github.io/Causal-Memory-Layer/

Review links

Current artifact

This repository already contains a working technical artifact, not only a concept.

Current components include:

  • Python causal validation and audit engine;
  • causal chain reconstruction utilities;
  • CLI commands for lineage validation and chain inspection;
  • API layer and store interface;
  • example logs and audit outputs;
  • CrewAI-style agent trace causal audit example;
  • approval-lineage causal audit example;
  • tests for chain logic, audit rules, approval-lineage behavior, and CTAG behavior;
  • API smoke tests for health, audit, and CTAG decode;
  • deterministic safety-eval benchmark with fixtures and tracked results;
  • documentation for vCML semantics and audit rules.

Key implementation entry points:

  • cml/audit.py
  • cml/chain.py
  • cli/main.py
  • api/server.py
  • examples/crewai_style_causal_audit.py
  • examples/agent_approval_lineage_audit.py
  • tests/test_audit.py
  • tests/test_agent_approval_lineage_demo.py
  • tests/test_api_smoke.py

Problem

Many systems record events, outputs, traces, and metrics, but do not validate the causal structure behind authorization and action.

That creates blind spots such as:

  • actions that appear valid but have no grounded parent cause;
  • ambiguous or malformed root authority;
  • actions that succeed operationally while losing approval lineage;
  • state transitions that cannot be tied back to intent, permission, and responsibility.

For agentic systems, this matters because output review alone can miss causally invalid action chains.

What CML does

CML checks whether a high-trust action or state transition was causally valid, not only whether it occurred.

It focuses on:

  • validating causal links between actions and prior authorization;
  • preserving responsibility lineage across multi-step workflows;
  • checking intent and permission continuity across transitions;
  • detecting suspicious or invalid lineage such as missing parents, malformed roots, or broken handoffs;
  • validating causal coherence from structured logs and traces.

Evidence snapshot

  • Production PyPI package: pip install causal-memory-layer
  • Production release: causal-memory-layer==0.4.0
  • Production PyPI install smoke test: pypi-install-smoke.yml
  • Grant evidence pack: docs/evidence/GRANT_EVIDENCE_CML_0.4.0.md
  • CrewAI-style integration example: examples/crewai_style_causal_audit.py
  • Approval-lineage demo: examples/agent_approval_lineage_audit.py
  • CrewAI outreach issue: https://github.com/crewAIInc/crewAI/issues/6063
  • Deterministic benchmark fixtures with expected audit findings: benchmarks/fixtures/
  • Current tracked benchmark result: 6/6 matched
  • Benchmark runner: python scripts/run_safety_eval.py
  • Tracked report: benchmarks/RESULTS.md
  • Reviewer-friendly benchmark interpretation: docs/evidence/BENCHMARK_EVIDENCE_SNAPSHOT.md
  • Larger-grant expansion path: docs/evidence/BENCHMARK_EXPANSION_PLAN_50K_100K.md
  • External validation protocol: docs/evidence/EXTERNAL_VALIDATION_PROTOCOL.md

External reproducibility evidence

External validation notes will be listed here as they are contributed.

Current external validation tasks:

  • Full local validation path: #77
  • MCP demo runner validation: #123

Validation note template:

docs/evidence/external_validation/TEMPLATE_MCP_DEMO_RUNNER.md

This section is intentionally evidence-first: it should only list external notes after they are contributed. It does not claim production safety, compliance readiness, enforcement behavior, or stable Cause Band semantics.

Repository map

  • cml/: core Python implementation
  • cli/: command-line tooling
  • api/: API and store layer
  • vcml/: vCML semantics, format, audit, and boundary docs
  • examples/: sample logs and reports
  • benchmarks/: deterministic benchmark fixtures and results
  • tests/: regression coverage
  • docs/: supporting docs for review, research, and deployment

Scope

CML does not claim to solve all AI safety, security, or compliance problems.

It contributes one focused primitive:

causal-validity checking for structured action traces

See docs/NON_CLAIMS.md for the full scope boundary.

Research direction

The strongest research direction for CML is causal validity checking for agentic oversight.

A useful framing is:

How can we detect actions that appear valid at the surface level but are causally invalid because authorization, approval, or responsibility lineage is missing, ambiguous, or broken?

Bottom line

A system may be functionally correct while being causally invalid.

CML exists to make that distinction inspectable.

Yorumlar (0)

Sonuc bulunamadi