Skip to main content
Enterprise AI, decodedJanuary 1970

May 13, 2026Research Analysis

HippoRAG 2: From RAG to Memory — Non-Parametric Continual Learning for LLMs

Employees do not retrieve institutional knowledge — they associate it, recalling and connecting facts across sources — and most enterprise AI gets this wrong by treating internal knowledge as a search problem: embed, index, look up. HippoRAG 2 reframes it as a memory problem, borrowing the brain's design: a knowledge graph of LLM-extracted relationships acts as the index (the hippocampus) over a corpus of raw passages (the cortex), and retrieval runs Personalized PageRank from the query's entities to surface passages that are structurally related even when they share no keywords. The enterprise payoff is concrete — new documents fold into the graph incrementally, so a constantly changing knowledge base does not force a full re-index, and reported indexing cost undercuts the leading graph-RAG alternatives. Two limits matter before production: the graph is only as good as triple extraction, and highly connected hub entities can dominate PageRank and crowd out genuinely relevant results. The unresolved gap is access control — different users must see different subgraphs, and that has to be engineered in, not assumed.

Retrieval ArchitectureKnowledge Graphs & OntologyEnterprise Knowledge Systems

§ 01

Summary (what this paper is saying)


HippoRAG 2 reframes RAG not as a retrieval mechanism but as a memory system — specifically non-parametric continual learning. Inspired by how the hippocampus indexes memories in the human brain, it builds a knowledge graph from documents using LLM-extracted triples, then retrieves via Personalized PageRank over that graph. HippoRAG 2 improves on v1 by adding stronger context awareness over the KG and better passage integration. The core shift: instead of treating each query as an isolated retrieval task, the system treats the knowledge base as a persistent, accumulating memory that the LLM can associate across. Evaluated across factual memory, sense-making, and associativity benchmarks, it outperforms GraphRAG, RAPTOR, and LightRAG while using significantly fewer resources for offline indexing.

§ 02

Core Argument


Why do enterprises need both RAG and knowledge graphs?

  • Pure vector RAG treats every retrieval as independent — it has no model of how facts connect across documents. This works for single-hop lookups but fails when the answer requires traversing relationships (supplier → region → risk event → impact).
  • KGs provide the associative structure — the graph is the memory index. RAG provides the grounding — the raw passages contain the detail that the graph compresses away.
  • HippoRAG 2's key claim: the right abstraction is not "retrieval" but "memory." Enterprises don't just need to find documents; they need a system that accumulates knowledge over time and can associate new information with existing context. That requires both a graph (for structure) and text (for fidelity).
  • § 03

    RAG Side (Strengths & Limits)


    § 04

    Strengths:


  • Fast deployment, works on existing document corpora
  • Preserves full textual detail and nuance
  • Low indexing cost relative to full KG pipelines
  • § 05

    Weaknesses:


  • No cross-document association — each retrieval is stateless
  • Cannot accumulate knowledge over time without re-embedding
  • Multi-hop reasoning requires the model to do all the connecting inside the context window, which is unreliable at scale
  • § 06

    Knowledge Graph Side (Strengths & Limits)


    § 07

    Strengths:


  • Explicit associations between entities across documents
  • Personalized PageRank enables associative retrieval — finding relevant nodes even without direct keyword match
  • Persistent memory structure that grows incrementally
  • § 08

    Weaknesses:


  • Incomplete by construction — facts outside the extracted triples are invisible
  • Entity extraction quality determines graph quality
  • Expensive to build at document scale (though HippoRAG 2 is cheaper than GraphRAG/LightRAG)
  • § 09

    Key Insight (the "why both" claim)


    The graph is the index; the passages are the memory. HippoRAG 2 uses the KG to identify which passages are relevant (via PageRank traversal from query entities), then retrieves those passages for the LLM to read. Neither component works alone: the graph without passages loses context; passages without the graph lose associativity. The neurobiological framing is deliberate — hippocampal indexing theory says the hippocampus doesn't store memories itself, it stores pointers to where memories live in the cortex. HippoRAG 2 implements this literally: the KG is the hippocampus, the passage corpus is the cortex.

    § 010

    Mental Model (how to think about it)


    Think of it as a two-layer memory architecture. Layer 1 (KG): a graph of entity relationships extracted from your document corpus — the index of what connects to what. Layer 2 (passages): the actual text, stored as-is. At query time: extract entities from the query → run Personalized PageRank on the KG starting from those entities → retrieve the passages linked to the highest-scoring nodes → pass to the LLM. The PageRank step is what enables associative retrieval — it surfaces documents that are structurally related to the query entities even without direct keyword overlap.

    § 011

    Enterprise Implications


  • Continual knowledge accumulation without retraining: new documents can be added to the graph incrementally. For enterprises with constantly updating knowledge bases (policies, contracts, product docs), this is the critical advantage over fine-tuning.
  • Cheaper than alternatives at scale: HippoRAG 2 uses significantly fewer indexing resources than GraphRAG and LightRAG, making it viable for large enterprise corpora.
  • Associative retrieval enables cross-silo queries: "find all projects that share a vendor with Project X" becomes tractable when vendor relationships are explicit in the graph.
  • The memory framing is important for agentic systems — agents need persistent memory across sessions. HippoRAG 2's architecture is directly applicable as the long-term memory layer for an agent that needs to accumulate and associate knowledge over extended operation.
  • § 012

    Technical Mapping


    RAG → vector retrieval, embeddings, chunking

    Graph → entities, relationships, Personalized PageRank

    How they connect:

  • Offline: LLM extracts (subject, relation, object) triples from passages → build KG → link each triple back to source passages
  • Online: query → entity extraction → PageRank over KG from query entities → retrieve linked passages → LLM generates answer from passages
  • The KG determines retrieval scope; the passages provide the actual evidence
  • § 013

    My Critique


  • The quality ceiling is set by triple extraction. If the LLM misses or hallucinates triples during indexing, the graph is wrong and PageRank surfaces the wrong passages. The paper doesn't deeply analyze extraction failure modes.
  • Personalized PageRank is effective for associative retrieval but can over-surface highly connected hub nodes — in enterprise graphs, certain entities (e.g. the CEO, the main product) appear everywhere and will dominate PageRank scores regardless of actual relevance.
  • The neurobiological framing is compelling but slightly oversold — human memory involves forgetting, consolidation, and reconsolidation that the system doesn't model. The analogy is useful but shouldn't be taken too literally.
  • No discussion of access control — in enterprise settings, different users should see different subgraphs based on permissions. This is a real deployment challenge not addressed.
  • § 014

    When this fails


  • High-velocity document environments where the graph becomes stale faster than it can be updated
  • Queries that require numerical reasoning or aggregation across many documents — graph traversal finds relevant nodes but doesn't aggregate
  • Domains where entity boundaries are ambiguous (legal contracts, scientific literature with complex nested references)
  • When hub entities dominate the graph, PageRank-based retrieval degrades for non-hub queries
  • § 015

    Key Takeaways for the CIO


    Watchlist priority: High — 12-month radar.

    Most enterprise AI deployments treat internal knowledge as a retrieval problem: embed documents, index them, search them. HippoRAG 2 reframes this correctly as a memory problem. Employees don't retrieve institutional knowledge — they associate, recall, and connect it across sources. AI systems need the same capability. Vector search alone cannot provide it.

    The practical case for CIOs:

  • Knowledge bases that change constantly break retrieval-only systems. Policies update, contracts are signed, decisions are made. Systems that require a full re-index every time your knowledge base changes are not production-viable at enterprise scale. HippoRAG 2's incremental graph update model addresses this directly.
  • It is cost-competitive with the leading alternatives. HippoRAG 2 indexes faster and uses less compute than Microsoft GraphRAG and LightRAG — the two most widely evaluated enterprise graph RAG systems. Any formal evaluation of graph-based retrieval should include this as a comparator.
  • The architecture supports agentic systems. As enterprises move toward persistent AI agents that operate across sessions, those agents need a long-term memory layer. HippoRAG 2's accumulating knowledge graph is a viable candidate for that layer, across industries and domains.
  • Unresolved gap: Graph-level access control is not addressed. Different users seeing different subsets of organisational knowledge is a baseline enterprise requirement. This must be engineered before any production deployment — it is not provided out of the box.

    Recommended action: Pilot against one high-value internal knowledge domain. Measure multi-hop query accuracy versus your current search or RAG setup. Use the results to decide whether to expand to a full knowledge graph programme.

    Correspondence

    New essays to your desk.

    By subscribing you consent to receive our newsletter. Unsubscribe at any time via the link in any email. Privacy Policy.

    Sent only when there is something worth reading. Unsubscribe anytime.