The Stateless Problem
At their core, the Large Language Models (LLMs) that power AI agents — whether GPT-4, Claude, or Gemini — do not have built-in memory. Every time a prompt or API call is sent to a model, it treats it as an entirely new, isolated event.The Eternal Present
The model only knows two things: the static knowledge baked into its weights during training, and the exact text in the current prompt’s context window.
No Continuity
Once a response is generated, the interaction is immediately forgotten. Follow-up questions require the entire conversation to be manually re-sent.
How AI Agents “Remember” Today
If AI agents are stateless, how do chatbots remember preferences or autonomous agents execute multi-step workflows? The answer lies in systems engineering — developers build an external memory infrastructure around the core LLM. This typically involves a three-step loop:External Storage
Save conversation history, facts, or task progress in an external database — a vector database like Pinecone, a traditional database like PostgreSQL, or a blob store.
Retrieval
When a new input is received, the system searches the external database for relevant past interactions or state data.
The Architectural Trade-off
The distinction between stateful and stateless is one of the most important architectural decisions in AI engineering:| Stateless Agents | Stateful Agents | |
|---|---|---|
| Best for | One-off tasks: classification, translation, single Q&A | Autonomy: personalization, multi-step workflows, self-correction |
| Pros | Infinite scalability, high reliability, cost-effective | Contextual awareness, personalized experiences, learning over time |
| Cons | No continuity, no personalization, no learning | Complex infrastructure, latency overhead, increased token costs |
How Engram Solves This
Engram eliminates the infrastructure burden by providing a purpose-built, persistent memory layer for AI agents. Instead of building and maintaining your own memory stack, you get a single API that handles the entire stateful loop.What Engram Provides Out of the Box
Persistent Storage
Every memory is stored as a versioned, encrypted blob on the Aptos blockchain via Shelby Protocol — tamper-proof and decentralized.
Automatic Versioning
Every update creates a new version. Full history is preserved and any version can be restored — giving agents a complete audit trail of how their knowledge evolved.
Semantic Search
Store embeddings alongside memories for vector similarity search. Build RAG pipelines without managing a separate vector database.
Smart Lifecycle
TTL expiry, pinning for critical memories, importance scoring, and automatic cleanup — so agents don’t drown in stale context.
End-to-End Encryption
AES-256-GCM encryption with per-agent keys. Private memories are encrypted before they ever leave your agent.
The Result
Your agent goes from stateless to stateful with a single API integration:What’s Next?
Quick Start
Get your agent storing and recalling memories in 5 minutes.
Core Concepts
Understand memory types, versioning, encryption, and TTLs.

