Core Concepts
Adaptive Learning
LeanCTX tunes itself from outcomes: learned compression thresholds, placement calibration, a stigmergic coordination field, delta playbooks — with measurable efficacy.
LeanCTX does not just compress with fixed rules — it learns from outcomes. Seven research-driven layers observe how compression, context placement and multi-agent coordination actually perform on your machine, and adapt continuously. Everything is local-first, bounded and clamped: research-tuned defaults stay the anchor, and learned adjustments decay back toward them when the evidence ages.
Learned compression thresholds
Every compressed read is an implicit experiment. Four outcome signals adjust a per-file-type threshold delta: a bounce (compressed read followed by a full re-read) and a failed edit push compression back; a clean compressed read and a wasted full read push it forward. Deltas are clamped to ±0.15, decay 2% daily, and only apply after 10 observations per extension.
$ lean-ctx learning
Learned compression thresholds:
.rs: delta +0.041 (27 signals) — compresses more
.md: delta -0.060 (11 signals) — backs off Placement calibration (lost in the middle)
LLMs recall the beginning and end of context best. LeanCTX ships research defaults for where to place tasks, anchors and progress — and then measures where your client's recalls actually hit. The begin/end budget share shifts with the evidence (clamped to 35–85%), per client profile.
Stigmergic coordination field
Parallel agents coordinate indirectly — like ant pheromones. Agents deposit
CLAIMED, DONE, STUCK, HOT and
AVOID scents on files and tasks; each kind decays exponentially (10–60 min
half-life). Claims cost zero coordination tokens until they matter:
ctx_agent action="claim" message="src/auth/session.rs"
→ Claimed: src/auth/session.rs (decays in ~10m unless re-claimed)
# A second agent claiming the same file:
→ already claimed by local-48121 (2m ago, still active)
# And ctx_read warns on foreign-claimed files:
[scent: claimed by local-48121 2m ago — coordinate before editing] Every rejected claim is duplicate work prevented — and counted as efficacy evidence. See Multi-Agent Coordination for the direct messaging primitives this complements.
Delta playbook (no more context collapse)
Re-summarizing prior summaries loses detail — the ACE research line measured an 18k → 122 token
collapse with −29% accuracy. LeanCTX checkpoints instead distill sessions into itemized entries
with stable IDs (Strategy, Pitfall, Fact,
FileRef) that are only appended, confirmed, voted and locally evicted — never
rewritten. Resumed sessions replay the playbook, not a lossy prose summary.
Query-aware compression
Three layers make compression relevance-sensitive: entropy-mode reads fuse token entropy with an IDF-weighted score against your active task or latest semantic query; wakeup facts render as topic-clustered chunks (the theta–gamma working-memory model, ~4 items per chunk); and with the embedding engine active, semantically redundant lines are dropped MMR-style against a sliding window of kept lines.
Self-activating embeddings
Semantic features need a local ONNX embedding model (~30–90 MB). On first semantic need
LeanCTX downloads it in the background (SHA-256 pinned, trust-on-first-use) and
warms the engine — no hot path ever blocks. Air-gapped? Set
[embedding] auto_download = false and semantic layers stay off gracefully;
ctx_metrics always shows the engine status and the reason.
Share learning with your team
Learning state exports as a secret-free bundle — file extensions, client profiles and aggregate numbers only. Merges are idempotent and double-count-safe (sample-weighted deltas, max-counters), so bundles can live in a repo or ship via CI:
$ lean-ctx learning export team.json # on the experienced machine
$ lean-ctx learning import team.json # on the new machine Measured efficacy
Adaptation must earn its place. LeanCTX keeps a 30-day efficacy ring and reports: bounce-rate week over week (from the signed savings ledger), placement hit-rate movement, playbook survival, and duplicate work prevented. If a learning layer does not move its metric, it gets retuned — with data on the table.
Research foundations
- LLMLingua / LLMLingua-2 — perplexity- and classifier-based token pruning
- ACE: Agentic Context Engineering — delta contexts, anti-collapse invariants
- Lost in the Middle — U-shaped attention over long contexts
- StreamingLLM / Heavy-Hitter Oracle — attention sinks, KV eviction
- Theta–gamma coupling — working-memory chunk capacity
- Information Bottleneck — relevance-conditioned compression
- Stigmergy — indirect coordination via environmental signals