Cognitive Context Layer

The system that decides
what your AI sees.

The context engineering layer for AI agentsOne local binary on both sides of the model. LeanCTX perceives, compresses, remembers, routes, and governs the complete lifecycle of AI context, from file reads to verified outputs.

Get Started Benchmark

Cognitive Context Layer

What is a Cognitive Context Layer?

A Cognitive Context Layer is the infrastructure between your AI tools and your codebase. It controls what files are read, how output is compressed, what knowledge persists across sessions, and whether results meet quality standards before delivery.

AI Agent

LeanCTX Cognitive Context Layer

I/O Intelligence Memory Verify

Your Code & Tools

System Blueprint

The construction drawing

The process topology of the shipped binary: seven entry points, one runtime, one set of local stores. Every box below maps to a real module, port or file on disk.

AGENT CLIENTS

CursorClaude CodeCopilotWindsurfCodexGemini CLI30+ agents

SURFACES — ENTRY POINTS

01 MCP server (stdio) lean-ctx

02 MCP server (HTTP) lean-ctx serve

03 IPC daemon lean-ctx serve --daemon

04 Shell hook lean-ctx -c "<cmd>"

05 API proxy lean-ctx proxy start

06 Web dashboard lean-ctx dashboard

07 Terminal UI lean-ctx watch · gain --live

CORE RUNTIME — SINGLE RUST BINARY

Tool dispatch 81 MCP tools · lazy tool set (5 unified)

Pattern engine 95+ shell patterns · git, cargo, npm, docker, kubectl, …

Read pipeline 10 read modes · auto selection per request

AST layer tree-sitter · 26 languages · regex fallback

Compression entropy · attention U-curve · TF-IDF · query-IB

Adaptive learning 7 online-learning layers · efficacy reporting

LOCAL STORES — XDG DIRS

session cacheBM25 indexproperty graphknowledge (SQLite)savings ledger (Ed25519)config.toml

EXTERNAL PROVIDERS

GitHubGitLabJira (+ OAuth)PostgresMCP bridgeConfig REST (Linear, …)

BM25 chunksgraph edgesknowledge factscache entries

One binary, seven entry points. Surfaces share the same runtime and the same local stores — provider data is consolidated into the identical indexes that file reads use.

Data Flow

What happens to a single read

Sheet 2 traces one request through the runtime, stage by stage, including the cache short-circuit that makes repeated reads nearly free. The shell path runs in parallel with the same accounting.

READ PATH ctx_read(path, mode) · lean-ctx read

PathJail core/pathjail.rs

Canonicalises the path and rejects escapes outside the workspace root before any I/O happens.
Session cache hit → ~13 tokens

Content-addressed lookup keyed by path + mtime/hash. Unchanged files collapse to a stub instead of re-sending content.
AST extraction 26 languages

tree-sitter parses the file into a syntax tree: signatures, imports, call edges — Lua, Luau, Kotlin and GDScript are graph-indexed too. Regex fallback for unsupported languages.
Mode selection 10 modes

auto picks the optimal of 10 read modes (full, map, signatures, diff, task, reference, aggressive, entropy, lines:N-M) from task intent and file size; structure_first biases cold medium-file code reads toward map, and a file flagged suspect on a fix task is forced to full.
Compression adaptive thresholds

Shannon-entropy line filtering, U-curve attention placement (LITM), TF-IDF codebook and query-conditioned Information-Bottleneck fusion — an anti-inflation guard ships the file verbatim whenever framing would cost more tokens than the raw bytes.
Token accounting core/tokens.rs

Exact tiktoken counts (o200k_base; cl100k_base approximation for Claude-family models) on input and output.
Ledger + stats savings sign / verify-batch

Savings are appended to the local ledger (Ed25519-signable), stats and the gain score update, the result streams back.

SHELL PATH lean-ctx -c "cargo test" · IDE bash hook

H1 Allowlist deny-by-default policy

H2 Execute real command, real exit code

H3 Pattern engine 95+ handcrafted patterns

H4 Compressed stdout errors always survive

Both paths end in the same ledger: every compression event is counted with exact tokenizer math and feeds gain, the dashboard and the signed savings ledger.

Specifications

Engineering data sheet

The reference tables behind the drawings: every surface with its transport and lifecycle, the on-disk layout, the adaptive-learning layers, and the security boundaries the runtime enforces.

AProcess model

All surfaces are the same binary in different roles. Nothing requires a cloud connection; everything binds local-first.

REF	SURFACE	TRANSPORT	ENDPOINT	LIFECYCLE	COMMAND
01	MCP server (stdio)	JSON-RPC over stdin/stdout	spawned per editor session	child process of the editor	`lean-ctx`
02	MCP server (HTTP)	MCP Streamable HTTP	localhost, configurable --host/--port	foreground or service	`lean-ctx serve`
03	IPC daemon	Unix Domain Socket	OS data dir, e.g. ~/Library/Application Support/lean-ctx/daemon.sock	launchd / systemd autostart	`lean-ctx serve --daemon`
04	Shell hook	process exec, compressed stdout	wraps IDE bash calls + interactive shells	per command	`lean-ctx -c "<cmd>"`
05	API proxy	HTTP (LLM API pass-through)	localhost:4444 (default)	on demand	`lean-ctx proxy start`
06	Web dashboard	HTTP + bearer token	localhost:3333 (default, --port)	on demand	`lean-ctx dashboard`
07	Terminal UI	TTY (in-place redraw)	live event stream / 1 s refresh	interactive	`lean-ctx watch · gain --live`

BStorage layout — local XDG dirs

Persistent state is plain files under the XDG base directories: inspectable, exportable, deletable. No hidden databases beyond these local folders.

ARTIFACT	FORM	PURPOSE
`config.toml`	TOML	Single config file — integration mode, compression, providers, opt-outs (config dir)
`cache/`	content-addressed	Session file cache; unchanged re-reads collapse to ~13-token stubs (cache dir)
`bm25 index`	inverted index	Lexical search over code chunks + provider documents (data dir)
`context_graph/`	property graph	Imports, calls, types across files and repos — powers map mode + deep queries (data dir)
`knowledge`	SQLite	Persistent facts, decisions, rooms — recalled across sessions, CCP (data dir)
`savings ledger`	append-only JSONL	Every compression event; Ed25519-signable for audit (data dir)
`litm_calibration.json`	JSON	Learned context-position hit rates (lost-in-the-middle calibration) (cache dir)
`events.jsonl`	event stream	Live feed consumed by watch, dashboard and efficacy reports (state dir)

CAdaptive-learning layers

Seven online-learning mechanisms tune compression to your real usage, locally, from quality signals like bounces and edit failures. Deep dive: Adaptive Learning →

L1
Adaptive thresholds Online-learned compression aggressiveness from quality signals (bounces, edit failures, clean runs)
L2
LITM calibration Empirical placement of critical context at positions the model actually attends to
L3
Stigmergic scent field Multi-agent coordination via decaying markers: claimed, done, stuck, hot, avoid
L4
Delta playbook Incremental checkpoint snapshots that survive context compaction
L5
Query-conditioned IB Information-Bottleneck compression fused with query relevance
L6
Theta-gamma chunking Wakeup facts grouped in attention-friendly bursts
L7
Semantic dedup Likelihood-scored redundancy filtering across the session

DSecurity boundaries

Hard guarantees enforced in the runtime. Security model →

PathJail Every file access is canonicalised and confined to the workspace root
IDE config-dir jail Home-level IDE/agent config dirs (~/.claude, ~/.codex, ~/.codebuddy, …) are writable only when allow_ide_config_dirs is opted in; otherwise PathJail blocks them
Shell allowlist Deny-by-default command policy for agent-issued shell executions
Local-first All processing on-device; dashboard binds to localhost and requires a bearer token
Signed evidence Savings ledger entries are Ed25519-signable and batch-verifiable

Integration Modes

One binary. Three ways in.

LeanCTX automatically selects the optimal integration mode for each agent: CLI-Redirect drives the LeanCTX CLI through editor rules with zero MCP overhead, Hybrid combines MCP cached reads with shell compression hooks, and Full MCP provides maximum tool access for protocol-only editors.

CLI-Redirect

For rules-driven and terminal-first agents, plus CI

Editor rules route every read, search and shell call through the CLI. No MCP server, no schema overhead.

lean-ctx -c / read / grep

Hybrid

Default for Cursor, Claude Code, Codex, Windsurf, and 20+ agents

MCP for cached reads (13 tokens), CLI for shell commands and searches, best of both worlds.

MCP cache + CLI shell/search

Full MCP

For JetBrains, VS Code, Neovim, Emacs, Zed

All 81 tools via MCP protocol with lazy tool set, ideal for agents that require MCP.

81 tools via MCP + lazy tool set

Either way, LeanCTX picks the right mode for your editor, automatically. See all 30+ supported tools

Background Daemon

Always on. Always yours.

A small background service keeps your session warm, so cache hits are instant and memory is always there. It starts automatically during setup, restarts itself when you update, and cleans up after itself, nothing to manage.

lean-ctx serve --status

$ lean-ctx serve --status

Daemon running (PID 4139)

Endpoint: ~/Library/Application Support/lean-ctx/daemon.sock (ready)

PID file: ~/Library/Application Support/lean-ctx/daemon.pid

# autostart on login:

$ lean-ctx daemon enable # launchd / systemd

Capabilities

Every Capability, One Binary.

Everything between your code and the AI, handled.

Smart I/O

Deterministic reads, shell compression, search, full context visibility + 99% fewer tokens

14 tools 6 features

Request Compression

An optional local proxy compresses every request to the model — system prompt, history and tool results — prompt-cache safe.

4 tools 5 features

Intelligence

Intent routing, mode selection, adaptive pipeline

13 tools 11 features

Memory

Sessions, project knowledge, graphs, handoffs

12 tools 5 features

Governance

Roles, budgets, SLOs, workflow gates, policies

6 tools 9 features

Verification

Lean4 formal proofs, claim-based verification, Quality Levels 0-4

7 tools 8 features

Integrations

MCP, HTTP, SDK, 29+ IDEs, Cloud, Team Server

6 tools 6 features

Shared Sessions

Workspace & channel-based session sharing across agents

4 tools 5 features

Context Bus

Real-time event stream for context changes via SSE

2 tools 5 features

SDK & API

TypeScript SDK and REST API for external integrations

0 tools 4 features

Verification

Every output carries proof

LeanCTX generates proof artifacts for every session: which files were read, what was compressed, which checks passed, and how tokens were spent. This makes AI work auditable, replayable, and trustworthy.

Cookbook & SDK

Real examples against a running server (<code>/v1/tools/call</code>).

Verification & CI

Guardrails: clippy/tests + output verification.

Memory (Policies)

Feedback, relations, retrieval modes.

Trust

What LeanCTX does, stores, and never does.

The one-paragraph definition

LeanCTX (short for Lean Context) is the open-source context engineering layer for AI agents. One local Rust binary decides what agents read (10 read modes, 60–90% fewer tokens, ~13-token cached re-reads), remembers what they learn (persistent sessions, knowledge graph), guards what they touch (PathJail, secret redaction, budgets, injection detection), proves what they save (Ed25519-signed ledger, reproducible benchmark) and replays what they saw (git-anchored, signed context snapshots you can restore or share); an optional local proxy compresses what they send — every request's system prompt, history and tool output, prompt-cache-safe on the wire. Compression — read-side and wire-side — is one of five subsystems, and every original stays locally retrievable. Works with 30+ AI coding tools via MCP and shell hooks; embeds in any agent via a versioned /v1 API with Python, TypeScript and Rust SDKs. Local use is free forever, enforced by CI.

Read the full story

Take back control of your context.

One local binary on both sides of the model. LeanCTX perceives, compresses, remembers, routes, and governs the complete lifecycle of AI context, from file reads to verified outputs.

Get Started GitHub

The system that decideswhat your AI sees.

What is a Cognitive Context Layer?

The construction drawing

What happens to a single read

Engineering data sheet

AProcess model

BStorage layout — local XDG dirs

CAdaptive-learning layers

DSecurity boundaries

One binary. Three ways in.

Always on. Always yours.

Every Capability, One Binary.

Smart I/O

Request Compression

Intelligence

Memory

Governance

Verification

Integrations

Shared Sessions

Context Bus

SDK & API

Every output carries proof

Take back control of your context.

The system that decides
what your AI sees.