Why We Need Continual Learning

Apr 22

The Memento and the Machine

Read →

9 Comments

Eric Ngoiya

Apr 28

Continual learning is a post training system innovation https://youtu.be/_uAyox15xNQ?si=3aC3GgY8XjTGpXGy AGI is here!

Craig Gordon

Apr 26

here is our continual leaning process for CPA's

https://tdpublishing.substack.com

Shanka Jayasinha

Apr 23

Great summary of the problem at hand, we believe it can be solved with the right architecture

Timmy Ghiurau

Apr 23

Best piece we've read on this. The spectrum is right: context, modules, weights. The filing cabinet framing is the one the memory category needed.

We've been building the weight-level end for two years.

www.midbrain.ai

Mitchell Kosowski

Apr 22

The Memento framing nails it and I'd like to build on your point about auditability. In enterprise contexts, the training/deployment boundary isn't just engineering convenience. It's what makes AI legally shippable. Once models compress user experience into weights, every update becomes simultaneously an ML experiment and a data lineage event.

That's why the middle ground feels like where near-term wins live: frozen bases with auditable module layers. Parametric learning isn't blocked by technical limits so much as by the fact that procurement won't sign off on a model whose weights drift every Tuesday or full moon.

The filing cabinet stays locked. You just get differentiable drawer keys.

Latinum Institute

Jun 20

Here we are trying to do something about that universitas-scholarium.org

Rogue4Gay

Apr 24

An interesting read. Seems like a topology focused article.

You reference OpenClaw as a great example. I have not researched or even feel that its relevant in any way. Am I missing something here.

I'm an end user that has 32 year of technology experience and now runs a bakery with multiple locations. I use to do use case work for Intel for large enterprise and cloud.

I get the concept of continual learning. Its not a native part of the LLMs. Its emulated by having local memory and explicitly telling the chatbot (in my case Claude) what to remember. Its also emulated by Claude submitting the whole chat to the LLM.

The whole chat to the LLM is a challenge because then the chat needs to be only maintained for the time the context is relevant. I'm just beginning to change my use to create new chats when I don't want anything from the old chat. Especially in coding. I manually tell the model to "unlearn" by starting a fresh chat.

So what is continuous learning really in this case. I have three layers of learning that happens.

a. the LLM

b. the local memory saved state - I explicitly have this tuned.

c. the chat history. I start a new chat when it's no longer relevant.

The one other "learning" concept I keep in my mind is the last time the LLM was trained. At times it will give me answers that I know are outdated. I have to tell it to look up current information at which point it does a web search which re-baselines the content in the chat history.

I also had a discussion with Claude about the local context window. LLMs created a limited content window within their training based on the first prompt. Is suggested the term "42 this" in reference to Douglas Adams the answer is 42 without really ever considering what the right question was. If I say "42 this" to claude, it will expand outside of the context window.

Bottom line, I'm not so much interested in models that do continuous learning. I don't go to a doctor and expect him to learn based on my interactions. He has his education, takes my history, and then use the new chat. How is that different than the how LLMs work today.

For me-especially with the cost of tokens especially with agenic chatbots like Claude (still wondering what OpenClaw would get me over Claude), I need more understanding of how prompts translate into token usage. The business case of AI is going to be constained for the near future. Claude tends to want to go from a prompt to code with no additional interaction. That is allot of token usage with no narrowing of the domain.

Just a user perspective on whether I care about continuous learning with LLMs.

My guess is that when Apple launches its AI, it will have much of what I outlined here as the challenges baked into the iphone AI. It will have to otherwise it won't be a transformative user interface. Its the reason Apple has not released much of their AI yet. All the problems that are being discussed about the current LLMs and "learning" need to be resolved if a transformative user experience on the iphone is to be achieve.

I want the control when "coding" or doing business work. On the iphone, I'll just want it to work.

Kevin Keller

Apr 23Edited

We're building the Artificial Doubt Engine (ADE - Ah-Dee) at Tucuxi - https://tucuxi.ai/

We've applied to A16Z Speedrun - so if you're really interested in funding the radical bet - Novel Architectures that learns continuously - though we fundamentally disagree with your premise that "you should build the learning mechanism into the substrate." - the substrate - the model - is a neuron/flat substrate - reasoning and learning more advanced reasoning should be in a separate harness that can go around any model. There's no filing cabinet, just right sized recurrent neural networks used to compress tacit knowledge about the right depth and scope of reasoning to achieve better and better outcomes and feedback signals that span the temporal time frame in which outcomes can be determined - a lot of decisions we learn from as humans have outcomes that occur further down the road - if we neglect those outcomes, the signal for learning improvement that could have made us/a system better is just thrown away.

Everyone else is building AGI — Artificial General Intelligence. Systems that know. We're building ADE — a system that doubts. Because we believe that doubt is more fundamental than knowledge. Knowledge is what doubt produces when it operates long enough across enough dimensions. Intelligence is what you get when doubt calibration is sufficiently compiled. And the only system you can trust is one that doesn't trust itself.

If you want to learn more: https://medium.com/@hikevin/how-we-named-ade-8f0e4116a450

Scott

Apr 22

Really good piece.

We've spent the last year at Memco pretty deep in this question, and my main reaction is that people still make the category too model-centric. "Continual learning" gets treated as if it only counts when weights change. That's too narrow.

In practice, learning happens at at least three layers: the model, the harness, and the memory/context layer. Long term, I agree the deeper answer probably does involve stronger parametric learning. But enterprises absolutely want agents that get smarter every day. The near-term question is where that learning shows up first: in opaque weight updates, or in a memory layer that can retain, validate, scope, and reuse what the organization has actually learned.

That matters because most enterprise intelligence is local. Which fix actually worked. Which workflow changed. Which runbook is stale. Which exception matters. Who else should inherit that lesson. There's also a big gap between individual continual learning and organizational continual learning. A lot of the real pain is not "the model can't learn." It's "agent A is re-learning what agent B already discovered last week."

So I'd frame external memory less as a dodge around continual learning, and more as the practical bridge to it. But the bar is high. A bigger filing cabinet is still a filing cabinet. Traces are raw material, not the product. The hard part is deciding what gets admitted, validated, scoped, decayed, and reused.

And there's another reason this matters: if you run those experiential loops well, you generate an incredibly valuable corpus for parametric memory later. Not generic trace sludge — validated experience. The kind of thing you could plausibly compress into LoRA-style updates, adapters, or other scoped weight updates for frontier or open models. So active memory isn't an alternative to parametric learning. It's one of the cleanest ways to prepare for it.