Context Management - Hybrid Retrieval Memory

Multi-modal retrieval pattern combining semantic search, exact/keyword search, and recency search in parallel. Results are merged and reranked into a single context set. Much higher recall because the agent can find both fuzzy references and exact entities. Essential for comprehensive knowledge retrieval.

Download .fz file

Full FlowZap Code

Copy and paste the following FlowZap code into a project in your FlowZap account to see this template diagram.

User {
  n1: circle label="Ask mixed query"
  n2: rectangle label="See answer"
  n1.handle(right) -> Agent.n3.handle(left)
  Agent.n18.handle(right) -> n2.handle(left)
}

Agent {
  n3: rectangle label="Plan retrieval strategy"
  n4: rectangle label="Trigger semantic search"
  n5: rectangle label="Trigger keyword search"
  n6: rectangle label="Trigger recent-history search"
  n7: rectangle label="Merge and rerank"
  n8: rectangle label="Build prompt with hybrid context"
  n9: rectangle label="Call LLM"
  n18: rectangle label="Return answer"
  n3.handle(bottom) -> n4.handle(top) [label="Semantic"]
  n3.handle(right) -> n5.handle(left) [label="Keyword"]
  n3.handle(left) -> n6.handle(right) [label="Recent"]
  n7.handle(right) -> n8.handle(left)
  n8.handle(right) -> n9.handle(left)
  n9.handle(right) -> LLM.n19.handle(left)
}

Semantic {
  n10: rectangle label="Vector search"
  n11: rectangle label="Return semantic matches"
  Agent.n4.handle(right) -> n10.handle(left)
  n10.handle(right) -> n11.handle(left)
  n11.handle(bottom) -> Agent.n7.handle(top) [label="Semantic"]
}

Keyword {
  n12: rectangle label="Exact/ID search"
  n13: rectangle label="Return exact matches"
  Agent.n5.handle(right) -> n12.handle(left)
  n12.handle(right) -> n13.handle(left)
  n13.handle(bottom) -> Agent.n7.handle(top) [label="Keyword"]
}

Recent {
  n14: rectangle label="Scan recent messages"
  n15: rectangle label="Return recent matches"
  Agent.n6.handle(right) -> n14.handle(left)
  n14.handle(right) -> n15.handle(left)
  n15.handle(bottom) -> Agent.n7.handle(top) [label="Recent"]
}

LLM {
  n19: rectangle label="Reason over hybrid context"
  n19.handle(right) -> Agent.n18.handle(left)
}

Related templates

Context Management - Session Memory

Short-term context pattern where the channel sends new messages plus recent history. The agent runtime merges that with local session state, assembles the prompt, and persists the response back into history. Simple but cost and latency grow with history length.

View template

Context Management - Rolling Summary Memory

Compressed history pattern that keeps full history for a while, then when a threshold is hit, summarizes the last chunk and replaces detailed turns with a shorter summary message. Dramatically reduces prompt size on long-running chats while maintaining gist continuity.

View template

Context Management - Profile Memory

Identity-style memory pattern where profile data is loaded at session start. Each prompt combines system persona, user profile, and current message. New facts can be written back to profile memory. Small predictable overhead with big UX lift — the agent remembers your name, stack, tone, and constraints.

View template

Context Management - Semantic Memory

Vector-based memory pattern where text is chunked, embedded, and stored in a vector database. On query, the question is embedded, a vector search is run, candidates are reranked, and top results are injected into the prompt. Where the agent feels like it remembers everything without hallucinating.

View template

Context Management - Episodic Memory

Learning-from-experience pattern where every task run becomes an episode with input, actions, and outcome. Before tackling a new task, the agent fetches similar episodes and uses them as hints. Over time, the agent feels like it's learning instead of repeating the same failed plans.

View template

Context Management - Shared Memory

Multi-agent coordination pattern where an orchestrator breaks work into subtasks, specialist agents pull from and push to a shared state store, and the orchestrator composes the final answer from that shared state. Multi-agent setups feel coherent instead of each assistant having its own inconsistent memory.

View template

Back to all templates