Pattern [08]

Memory Management

Database / Caching / Session State Management

> Agentic Definition

Mechanisms for agents to store, index, and retrieve information over time, spanning short-term (conversation context) and long-term (knowledge base/episodic) history. This gives the agent a sense of continuity and identity.

> Description

Mechanisms for agents to store, index, and retrieve information over time, spanning short-term (conversation context) and long-term (knowledge base/episodic) history. This gives the agent a sense of continuity and identity.

≈ How It Maps to Database / Caching / Session State

Persistence of state is crucial for both. Short-term memory ≈ RAM/Redis (Session Store); Long-term memory ≈ Disk/SQL/NoSQL (Persistent Store).

≠ Key Divergence

Agentic memory is often Vector-based (semantic similarity) rather than Key-Value or Relational. Retrieval is probabilistic (getting "relevant" memories based on embedding distance) rather than exact (getting "row ID 123").

> Key Takeaway

Adapt: Database schema design is replaced by "Information Retrieval Strategy." You don't just query data; you curate context. You must decide what the agent needs to know to be effective.

The Code

Before: Explicit State Persistence

Explicit State Persistence
1# Traditional Session
2user_session = db.get_user_session(user_id)
3last_action = user_session['last_action']

After: Semantic Memory

Semantic Memory
1# Vector Database Retrieval (Long-Term Memory)
2relevant_memories = vector_db.similarity_search(
3 query=current_user_input
4)
5
6# Inject memories into prompt context (Short-Term Memory Loading)
7agent.run(
8 input=current_user_input,
9 context=relevant_memories
10)

Production Notes

  • Storing everything in context windows is expensive. Efficient RAG and summarization strategies are needed to manage the "Context Window Economy."
  • Long-term memory risks storing PII indefinitely. Implementation of "Forgetting" mechanisms and strict data governance is required.

Unlock code examples & production notes

Free account — no credit card required.

Sign Up Free

Already have an account? Log in

Frequently Asked Questions

When should I use the Memory Management pattern?

Mechanisms for agents to store, index, and retrieve information over time, spanning short-term (conversation context) and long-term (knowledge base/episodic) history. This gives the agent a sense of continuity and identity.

How does Memory Management relate to Database / Caching / Session State Management?

Persistence of state is crucial for both. Short-term memory ≈ RAM/Redis (Session Store); Long-term memory ≈ Disk/SQL/NoSQL (Persistent Store). However, there is a key divergence: Agentic memory is often Vector-based (semantic similarity) rather than Key-Value or Relational. Retrieval is probabilistic (getting "relevant" memories based on embedding distance) rather than exact (getting "row ID 123").

What are the production trade-offs of Memory Management?

Storing everything in context windows is expensive. Efficient RAG and summarization strategies are needed to manage the "Context Window Economy." Long-term memory risks storing PII indefinitely. Implementation of "Forgetting" mechanisms and strict data governance is required.