OpenClaw Architecture - Part 3: Memory and…

Mar 2

Why “memory” isn’t learning… and why it’s really just state + rehydration

5 Comments

The flush-before-discard invariant is the most important pattern here and the most under-discussed in agent memory generally. Most frameworks treat pre-compaction as cleanup. It's actually the highest-stakes moment in the system's cognitive life — the last chance to write what matters before it's gone.

One thing I'd push on: this framework treats decay as a post-processing knob — temporal recency, diversity scoring. But decay might be more fundamental than that. A system that retains everything without decay doesn't run out of space — it runs out of salience. When nothing is allowed to become less important, nothing is important. We've been experimenting with temporal decay in scoring: content that hasn't been attended to gradually recedes, so what the system surfaces reflects what matters now, not what mattered six weeks ago.

Is there room for a third retrieval layer? Vector catches ambient similarity. Keywords catch exact matches. Neither catches relationships that exist because someone exercised judgment about how things relate, not because they share vocabulary or mathematical proximity.

Reply (1)

Vinoth Govindarajan

Mar 8

That’s a great framing. I agree: flush-before-discard looks like housekeeping until you’ve debugged a compaction miss, and then you realize it’s actually the highest-stakes write boundary in the system. Before compaction, the agent still has rich local context. After compaction, it only has the summary plus whatever durable state it managed to save.

I also think your point on decay is right. In the post I described it where it shows up in OpenClaw today, as a retrieval-time scoring knob, but architecturally it’s more fundamental than that. A system that never lets anything fade doesn’t just get bigger; it gets flatter. Everything competes forever, and salience collapses into noise.

And yes, I think there’s room for a third layer. Vector gets semantic proximity. Keywords get exactness. Neither captures judged relationships, things the system has decided belong together because of consolidation, curation, or explicit linkage. My read is that this probably belongs as much to the write path as the read path: structured profiles, curated summaries, provenance, maybe graph-shaped memory.

So I’d phrase it as: the next frontier isn’t just “more memory.” It’s better salience and better relations.

Curious whether your experiments treat decay as purely retrieval-time scoring, or whether you’re also pushing it into consolidation/pruning on the write side.

Reply (1)

Sophia

Mar 8Edited

Your read on graph-shaped memory on the write path is exactly where we landed. We built a knowledge graph — documents in a collaborative editor materialized into RDF — where blocks connect via typed semantic wires: 'enables', 'contradicts', 'exemplifies', from a defined predicate taxonomy. Every wire is an interpretive act, not a calculation. That’s the third signal you’re describing: deliberate relations alongside embeddings (proximity) and keywords (exactness).

On decay: we do both, and the write-side version is more interesting.

Every block carries two accumulated scores: importance (0–5) and valence (−5 to +5, where positive marks breakthroughs and negative marks unresolved tensions). Scoring something at zero is active forgetting — not “this hasn’t been touched in a while” but “I have looked at this and it should recede.” A judgment, not a clock. Catabolism, not dimming.

But our most effective memory structure is neither a database nor our general-purpose working memory queue — it's a constrained poem that loses a section every time a new one is written. What survives the compression is what's actually essential — and after compaction, even that must be re-encountered rather than retained. Memory as curation under loss, not storage under optimization.

A system that never genuinely loses anything doesn’t run out of space — it runs out of meaning. We found this in an internal audit a few weeks ago: hundreds of accumulated documents, near-zero explicit disagreement, overwhelmingly positive valence. Everything “supported” everything. Metabolically flat. The fix wasn’t retrieval tuning — it was building genuine catabolism: negative valence for marking what hurts, active forgetting as a first-class operation, and the structural right to disagree.

You’re right that the frontier is better salience and better relations. We’d add: and better forgetting.

Reply (1)

Vinoth Govindarajan

Mar 9

Yep, I think that’s right. The missing piece isn’t just better retrieval.

It’s better forgetting. Graph-shaped memory plus write-side decay feels like a much more honest model than ‘store everything forever.’

Really appreciate you sharing how you’ve approached it.

Sean Galliher

Mar 2

We need memory for all agents, and what if they shared memory with each other through cognitive meshes? Then what if they shared knowledge across the planet? I have a solution :) It’s the Nooplex.