Skip to content
BaseThread
Back to Blog

AI memory vs shared context

Does AI actually remember anything? How AI memory works

Does AI remember? By default, no. A model forgets between sessions. Memory features bolt recall on top. Here is how AI memory really works, in plain English.

April 23, 2026by BaseThread

Does AI actually remember anything? By default, no. A language model has no memory of your past conversations. Each time you start a session, the model begins from a blank slate and only knows what is in front of it right now. When a tool like ChatGPT seems to recall your name or your project from last week, that is a separate memory feature layered on top, not the model holding on to anything by itself.

That distinction trips up almost everyone, and it matters. Once you see how "memory" is actually built, the quirks make sense: why a long chat forgets your early instructions, why memory only works in one tool, and why it never quietly becomes something your whole team can use.

Definition

AI memory

By default a language model is stateless: it remembers nothing between sessions and only sees the text in its current context window. AI memory features add recall by saving notes from your chats outside the model and feeding the relevant ones back in later, so the model appears to remember. It is recall through re-feeding, not the model storing memories on its own.

The default: the model forgets everything

Start here, because it is the part people get wrong. A model is stateless. It does not carry anything from one conversation to the next. Close the chat, open a new one, and you are talking to something that has no idea the first conversation ever happened.

Even within a single chat, the model is not "holding" the conversation in some persistent store. Each time it replies, the tool replays the recent messages into the context window, the fixed amount of text the model can take in at once. The model reads that and responds. Nothing sticks around after.

This is why a fresh session knows nothing, and why your AI assistant cheerfully reintroduces itself to a problem you spent an hour on yesterday. It is not being forgetful. It was never holding on in the first place. We go deeper on this in why AI agents forget.

So how does memory get added?

Memory features get around the blank slate with a simple trick: store notes outside the model, then slip them back in when relevant. Roughly:

  1. Watch and extract. As you chat, the feature notices things worth keeping, such as your name, your tech stack, your preferences, or recurring facts about your work.
  2. Save them. Those notes get written to a store outside the model, attached to your account.
  3. Re-feed them. On a later chat, the relevant notes get quietly inserted into the context window before the model responds, so it answers as if it remembered.

The model still is not remembering anything. It is reading notes someone saved and handed back to it. That is the whole mechanism behind ChatGPT memory, Claude's memory, and Cursor memories. Useful, but worth understanding for what it is.

BaseThread, your team's AI tools finally on the same page. Get started.

Why a long chat still forgets the start

Even with memory on, you have probably watched a long conversation lose track of something you said early on. That is the context window again. The window has a fixed size, and a long chat eventually produces more text than fits. To keep responding, the tool trims or summarizes the oldest messages so the newest ones still fit.

The casualty is usually that constraint you set at the very beginning. It scrolled out of the window, so the model can no longer see it. This is not a bug in your prompting. It is a structural limit on how much the model can hold at once, and no amount of "remember what I said earlier" fixes it once the text is gone from the window.

The three things people call "memory"

The word "memory" gets stretched across three different things, which is half the confusion:

  • Built-in memory. The per-user features inside ChatGPT, Claude, and Cursor that save notes from your chats. Personal, automatic, locked to that one tool.
  • The context window. The model's short-term working memory for one response. Not persistent at all, just what fits right now.
  • Memory infrastructure. Developer tools like Mem0 or Zep that let engineers give the agents they build a place to store and recall information. Plumbing, not a finished product.

These are genuinely different, and conflating them leads to bad expectations. If you want the clean line between "memory" and "context," we draw it in context vs memory in AI.

The one-line version

The model forgets by default. Every form of AI "memory" is a way of putting the right text back in front of it at the right time. Once you see that, the quirks stop being mysterious.

Where memory quietly fails: the team

Built-in memory works well for one person in one tool. The trouble starts the moment you want what a team knows to be available everywhere.

Memory is per-user by design. Your saved notes live in your account, in your tool. They do not reach a teammate's Cursor, and there is no mechanism that folds ten people's personal memories into one shared memory. So a decision your AI watched you make does not show up for anyone else, and a new hire's tools start from zero with none of the team's history. That gap is the whole point of per-user AI memory doesn't compound into team knowledge.

What a team actually needs is the opposite of personal memory: one curated source every tool reads, plus an AI-written record of what shipped and what was decided, kept current as work happens. Tools read it back over a standard connection at the start of a task, and write activity and decisions back to it when they finish. That is shared context, and it is built around the team as the unit instead of the individual. The how it works page walks through the read-and-write-back loop end to end.

TL;DR

By default an AI model remembers nothing: it is stateless and only sees its current context window. Memory features fake recall by saving notes from your chats and feeding the relevant ones back in later. Long chats still forget early details because the oldest messages drop out of the window. And all of this is per-user and single-tool, so personal memory never becomes team knowledge. For that, a team needs shared context: one curated source every tool reads, updated as work happens.

One curated source every tool reads and writes back to, so what the team knows reaches everyone, not just one person in one tool.

See memory that works for a team

Related reading

Frequently asked questions

Does AI remember previous conversations?

By default, no. A language model has no memory of past chats. Each session starts blank, and the model only sees what is in front of it right now. The reason a tool like ChatGPT can sometimes recall your past chats is a separate memory feature layered on top that saves notes and feeds them back in. The underlying model is not remembering on its own.

How does AI memory actually work?

A memory feature watches your conversations, extracts things worth keeping such as your name or preferences, and stores them outside the model. On a later chat it quietly inserts the relevant saved notes back into the context so the model appears to remember. It is recall through re-feeding, not the model holding on to anything between sessions.

Why does AI forget what I told it earlier in the same chat?

Because of the context window, the fixed amount of text a model can hold at once. In a long conversation the oldest messages get trimmed to make room for newer ones, so a detail from the start can fall out of view. The model is not ignoring you. That part of the chat is simply no longer in front of it.

Is AI memory shared across my team?

No. The memory features in ChatGPT, Claude, and Cursor are per-user and locked to one tool. Your saved notes do not reach a teammate's tools, and there is no built-in way to merge everyone's memory into one team memory. Sharing what a team knows across tools needs a deliberate shared layer, not personal memory.

Get your team's AI tools on the same page

BaseThread is the shared context-graph that Claude Code, Cursor, and every AI tool your team uses can read, so no one re-explains the same context twice.

Request access