COLD START
Context vanishes between sessions; every run starts blind.
Context Engine Platform
Your teams are shipping more AI agents than ever — on the same models as everyone else. RCE is the self-recursive memory layer underneath them: one context graph spanning text, images, and code, with semantic recall and grounded answers — so every agent keeps context across sessions and returns work you can trust.
Early access · Self-host or fully managed · Bring your own model keys
THE PROBLEM
Same model, no memory — and complex tasks fall apart.
Context vanishes between sessions; every run starts blind.
Vector RAG returns lookalike chunks, not connected knowledge.
Stuff everything in the window and quality collapses.
Memory only answers when asked; it never anticipates.
INSIDE THE ENGINE
Every modality, at enterprise scale, on a recursive knowledge graph.
MULTIMODAL MEMORY
Enterprise knowledge doesn't live in one format — and neither does RCE's memory. Every modality lands in a single context graph, governed and recalled the same way.
ENTERPRISE-SCALE MEMORY
Per user, per org, per agent, per domain, per repo — every subject gets its own context graph, served and governed as a single system.
MEMORY SYSTEM
RCE isn't a cache — it's a layered memory architecture that remembers, consolidates, and forgets.
RECURSIVE KNOWLEDGE GRAPH
RCE extracts entities and relations, then embeds them twice — text and structure.
HOW IT WORKS
One loop turns raw context into durable, grounded memory.
Documents → recursive KG + dual embeddings → hybrid recall → grounded answer → consolidate & prune. Then it loops.
SEMANTIC RECALL
RCE maps a knowledge graph of everything your agents see, then pulls only what's relevant — by graph relation, semantic similarity, and keyword.
CURATION
RCE doesn't dump your whole knowledge base into the prompt. It:
Result: the infinite context window — think about shipping, not tokens.
Retrieve only what matters. Compress. Rank. Stay in budget.
2.1k / 32k tokensPROACTIVE
Flat RAG waits for a query. RCE anticipates — enriching episodes and rolling forward the context an agent will need next.
The line marks the query. RCE has already surfaced the context — flat RAG is still catching up.
DURABILITY
Flat retrieval collapses when the window fills. RCE consolidates and prunes, so answers stay sharp across long sessions.
PROOF
Same models — better context wins. On LoCoMo, RCE projects on par with the strongest memory systems, approaching full-context quality at a fraction of the tokens.
Note: pre-release projections, benchmarked against published Mem0 / Zep baselines.
LongMemEval (projected): RCE 74.6%* · Zep 71.2% · full-context 60.2%
* RCE figures are pre-release projections, not measured results. Baselines (Mem0, Zep) are published LoCoMo results. A full benchmark run is pending.
COMPARISON
What each approach can actually do.
| Capability | RCE | Flat RAG | Full-context | Memory systems |
|---|---|---|---|---|
| Persistent cross-session memory | ✓ | ✗ | ✗ | ✓ |
| Knowledge graph (relations) | ✓ | ✗ | ✗ | ◐ |
| Hybrid retrieval (graph + semantic + keyword) | ✓ | ◐ | ✗ | ◐ |
| Proactive surfacing | ✓ | ✗ | ✗ | ✗ |
| Consolidation + forgetting | ✓ | ✗ | ✗ | ◐ |
| Grounded / validated answers | ✓ | ◐ | ◐ | ◐ |
| Multi-tenant + self-host | ✓ | ◐ | ◐ | ◐ |
| Token-efficient context | ✓ | ◐ | ✗ | ◐ |
Capability comparison of approaches, not a benchmark. ✓ supported · ◐ partial / varies · ✗ not supported.
EVERY SURFACE
MCP server · Python SDK · REST API · LangChain & CAMEL adapters.
# install pip install recursive-context-engine # ingest a corpus into memory rce ingest ./docs --store memory # recall, grounded rce recall "how does auth work?"
TRUST & GOVERNANCE
Every fact is controlled, traceable, and isolated by tenant — by default.
Authorization, retention, and audit live in the substrate — not bolted on. Policy applies across every graph, every query, every modality.
Every fact in the graph traces back to the source episode that produced it. Audit any answer back to exactly where it came from.
Each org, user, and agent gets its own isolated context graph. No bleed across tenants — memory stays private to whom it belongs.
ENTERPRISE
Your context is your most sensitive asset. RCE keeps it yours.
Namespaced memory per tenant, with quotas.
Scoped sk_live_ keys, per-project access.
OpenTelemetry traces and Prometheus metrics.
Runs in your own VPC via Docker Compose.
Per-tenant usage tracking and analytics.
Centralized KG registry and domain configuration.