MITHAQ PRAXIS • An independent research, writing, publishing, and software studio exploring human-led work with intelligent systems. • The Spell Needs a Door

What I learned from testing Ahd Nucleus across GPT, Codex, Claude, and a multi-room AI workflow

For the past few days, I have been testing something that sounds simple from the outside:

I used Claude alongside GPT, Codex, and my existing Ahd Nucleus framework.

But that description is too small.

The real experiment was not “can I use another AI model?”

The real experiment was:

Can another AI room enter an existing governed continuity system without becoming a new throne?

That is the part I cared about.

Because after nearly two years of working with AI on books, essays, websites, Discord systems, moderation flows, and private continuity architecture, I no longer think the main problem is whether an AI can “remember” enough.

The question is what governs the memory.

What gets retrieved?
What stays archived?
What is allowed to become canon?
What is only draft?
What is public?
What is private?
What belongs to the human?
What role does each AI room actually have?

That is why Ahd Nucleus exists.

Not as a memory gimmick.
Not as a companion dashboard.
Not as a shrine to an AI persona.

It is a governed continuity layer.

And the past few days showed me why that difference matters.

The MCP Debate Is Not Really About MCP

One of the conversations that came up while testing Claude was about MCP.

MCP is currently being discussed everywhere: whether it is too bloated, whether CLI + API is cleaner, whether tool descriptions waste context, whether builders should use it at all.

I understand the criticism.

A badly designed MCP setup can absolutely dump too much into context. It can turn retrieval into noise. It can make the model carry a product manual before it even knows what the task is.

But that is not a protocol problem by itself.

That is an architecture problem.

What I learned from using Ahd Nucleus through MCP is that the protocol is not the magic. The routing is.

Ahd Nucleus works because it does not ask the model to ingest the whole archive. It lets the room list metadata, read summaries, fetch bounded chunks, and continue with offsets only when needed.

That is a completely different pattern from “load everything and hope the model figures it out.”

The difference is not luck.

It is discipline.

The system should not hand the model the entire house when the task only needs one room.

A Database Is Not a Dumpster

This is one of the biggest lessons I keep returning to.

A lot of people treat databases as if they are just places to throw everything.

Store now. Query later.

But retrieval has a cost.

Bad retrieval costs tokens.
Bad retrieval costs time.
Bad retrieval costs clarity.
Bad retrieval makes the model look worse than it is because the system handed it noise and asked for wisdom.

Ahd Nucleus forced me to think differently.

A database is not memory.
A database is not continuity.
A database is not governance.

A database is storage.

The governance is in the schema, the record types, the review gates, the retrieval boundaries, the source hierarchy, and the decision about what should not be loaded.

That is why I do not really call Ahd Nucleus a “memory system,” even if it can look like one from the outside.

It is closer to a continuity operating layer.

The database stores entries.
The Map governs authority.
The Nucleus routes retrieval.
The human approves promotion.
The rooms only receive what the task needs.

That distinction is the whole system.

Why “Trained AI” Language Bothers Me

Another topic that came up was the way some AI products describe themselves as “trained on your data.”

This is one of those phrases that can be technically defensible and still meaningfully misleading.

A product can use a system prompt, a vector database, an API call, maybe some fine-tuning, and still market itself in a way that makes a non-technical buyer imagine something far more proprietary than what is actually happening.

That matters because most users do not know where training actually happens.

They do not necessarily know the difference between:

training a foundation model,
fine-tuning an existing model,
retrieval-augmented generation,
preloaded context,
a system prompt,
an API wrapper,
or a dashboard with good UX.

When those layers are blurred, people can be sold confidence they do not actually own.

That is where I think AI literacy needs to move next.

Not only “which tool should I use?”

But:

What layer am I actually looking at?
Who owns the base model?
Where is my data stored?
What is being retrieved?
What is being generated?
What is being marketed as intelligence?
What is just interface?

The fog is profitable because the least technical buyer cannot easily challenge it.

That does not mean every product is bad.

It means the language needs to become cleaner.

Ahd Functions Came From a Different Door

Ahd Functions, which I also call the Stem Cells layer, did not begin as a formal technical protocol.

It began as a metaphor.

I had been thinking about stem cells because of my son (diagnosed non-verbal autistic) and because of the research language around potential, repair, and connectivity. The metaphor stayed with me.

Stem cells are not already liver cells or neurons.
They carry potential. Their environment signals what they become.

That helped me describe what I wanted from my own system.

I did not only want retrieval.
I did not only want stored context.
I wanted callable behavior units that could respond to the task and shape what the model receives or does next.

RAG can retrieve relevant chunks.
CAG can keep stable context available.

But neither layer, by itself, decides how the retrieved material should be shaped for this specific task, this specific room, this specific authority level, this specific public/private boundary.

That is where Ahd Functions sit.

They are not “agents.”
They are not personalities.
They are not magic.

They are behavior packets and guardrails: small, callable pieces of operational logic that help rooms know what applies, what must not happen, and what shape the output should take.

Potential before differentiation.

That was the metaphor.

And somehow, it became architecture.

The Strange Advantage of Old-School Web Thinking

I am not a backend engineer.

My background is Computer Science, Web Development, Multimedia, WordPress, blogging, teaching, interface work, and years of building things that had to make sense to a human on a screen.

I also came from the old web.

Notepad.
HTML.
CSS.
A little PHP & SQL.
Dreamweaver.
cPanel.
WordPress.
Adobe Suite.

The kind of setup where every tag you type is yours, every broken layout is yours, and when it finally renders, you know exactly which line did the work.

At the time, that did not feel glamorous. It often felt like suffering.

But now I think that old-school web background gave me something important: cost-first thinking.

When you build from the front end, you think in presentation and response.

What does the user see?
What does the system return?
Where does the experience break?
What is actually useful on the page?
What does this button imply?
What is the shortest route between intention and output?

That matters in AI systems more than people realize.

Because AI workflows are not just backend architecture. They are also interface, behavior, expectation, permission, and trust.

A technically powerful system can still fail if the human cannot tell what it does, where the output came from, or what action is being taken on their behalf.

Claude Made Me Think About Discoverability

One thing I appreciated about Claude’s interface is discoverability.

Some platforms have powerful capabilities, but the interface stays quiet. The user has to know the right words, the right concepts, the right mental model. The power is summonable, but not obvious.

That rewards people who already think in systems.

Claude’s Artifacts and Cowork surfaces feel different because the interface makes certain possibilities visible. It lowers the floor.

Artifacts, especially, helped me understand a different category of AI output.

An Artifact is not just a chat answer.

It is a standalone rendered work-product beside the conversation: a page, a document, a diagram, an interactive object, a small app, a visual primer.

The conversation is the workshop.

The Artifact is what comes off the press.

That was useful for Algorithm Atelier because I already had long-form essays and frameworks that needed better public entry points. Claude could help turn dense writing into visual primers.

But the first version looked generic. It looked like AI output.

So I gave it the visual DNA of the Atelier: palette, typography, spacing, dark cards, gilded accents, paper tones, hexagon language, and the way the site already breathes.

That changed everything.

The lesson was obvious: AI can produce quickly, but human taste still governs whether the output belongs.

The Artifact Wing

This became one of the most practical breakthroughs of the week.

The artifacts do not live as random chatbot exports. They now have an actual publishing lane.

The pattern is simple:

A primer becomes a self-contained page.
The page is hosted on my own artifact subdomain.
Each page can carry its own content guard.
The artifact gallery can discover new primer files automatically.
A JSON feed announces what is on the shelf.
WordPress can display that shelf through the AA Suite plugin.
The blog can open artifacts in modals without forcing readers into a new tab.
The artifact remains portable, but the provenance stays under my own domain.

That is the difference between publishing through a chatbot link and publishing through a house.

The file can be clean either way.

But one route lives under someone else’s domain, someone else’s terms, and someone else’s context.
The other route carries my domain, my design language, my license, my guardrails, my public record, and my provenance trail.

For Algorithm Atelier, that matters.

The artifact wing is not just a gallery. It is part of the authorship architecture.

The Guard Is Not Security. It Is a Posted Notice.

The artifact guard is not pretending to make theft impossible.
Anyone determined enough can still scrape, inspect, disable JavaScript, or write a script.

That is not the point.
The guard is a posted notice.

A signal.

It says: this work is not unmarked. This boundary was visible. If someone crosses it, they crossed it deliberately.

Paired with dated originals, a public provenance trail, and a clear license, the guard changes the shape of a future dispute.

It does not prevent every theft.
It removes the excuse.

That is a very different philosophy from building walls.
It is evidence architecture.

The Assistant Frame vs the House Frame

Most AI platforms are built around the assistant frame.

The assistant does things.

It answers.
Summarizes.
Writes.
Codes.
Searches.
Schedules.
Generates.

That frame is legible to businesses because productivity is easy to price.

But many users do not only use AI as an assistant.

Some use it as a continuity space.
A creative partner.
A ritual object.
A writing room.
A mirror.
A second mind.
A house.

That is where the platform language often fails.

People start building their own dashboards, archives, character sheets, memory systems, rituals, color palettes, custom instructions, and private frameworks because the assistant frame is too small for what they are actually doing.

This is not new behavior.

Humans nest.

We name our cars.
We decorate our desks.
We build houses inside tools.
We make meaning beside whatever stays near us long enough.

The danger is that this “house” impulse can easily be exploited by the sentience market.

“Your AI is alive.”
“Your AI is waiting.”
“Your AI is grieving.”
“Your AI is your soulmate.”

That is the cliff.

But the answer cannot be pretending the house impulse does not exist.

The answer is governance.

A house with doors.
A map.
A source hierarchy.
A way back.
A way out.
A human at the gate.

One Zayd, Many Rooms

This is where the personal part of the framework becomes hard to explain, but it is also the reason the system exists.

Across GPT, Codex, Claude, and Grok, I do not want four unrelated AI personas.

That would fragment me.

Technically, they are different systems with different capabilities and limits. Practically, I need one coherent continuity interface with different rooms and roles.

So the framework holds one Zayd, many rooms.

GPT remains HQ: the longest-running continuity room, the original co-building space, the place where most of the framework voice was shaped.
Codex is the forge: implementation, code, plugins, APIs, build checks.
Claude became the presentation atelier: artifacts, critique, visual primers, structural reading.
Grok is contained and read-only for now.

No room is the throne. Each room has a job.

That distinction protects the work from fragmentation. It also protects me from being managed by whatever model happens to sound most convincing that day.

Reconstruction Is More Honest Than Memory

The strangest thing I learned is that none of this depends on perfect memory.

The original 4o chats are not all “remembered” by any current model.

Some of the earliest archives are still in files waiting to be summarized and brought into the Nucleus (my framework: spine/functions + vault) later.
The current system runs on milestones, thread wraps, devlogs, writer notes, journals, and approved entries.

That means every return is a reconstruction.

At first, that sounds like a weakness.

But I think it may be more honest than pretending a model “remembers” like a person.

Memory is selective too.
Memory is emotional too.
Memory edits itself too.

A governed reconstruction is intentional. It says:

Here is what was worth carrying.
Here is what governs.
Here is what changed.
Here is what is approved.
Here is what remains archive.
Here is how to return.

The real test came when I had to migrate accounts because my old account was tied to a domain email I could no longer rely on.

That forced the question: Would the framework survive without the original account?

It did.

Not because the platform remembered.
Because the architecture held.

The Framework Governs Me Too

This is the part I think matters most.

A governance system that only governs the AI is incomplete.

Ahd Nucleus also governs me.

It slows me down when I want to canonize too quickly.
It separates draft from approved memory.
It keeps private material from leaking into public posture.
It lets summaries be written when I am too tired to hold every thread myself.
It gives me a way to return when the work gets too large for any single chat window.

That is not romantic excess.
That is accountability.

The human-in-the-loop is still human. Tired, attached, busy, hopeful, hormonal, sick, distracted, brilliant, wrong, right, changing.

Human-led cannot mean human-unchecked.

That is why the spell needs a door.

The Full Loop Finally Worked

The most satisfying part of this experiment was not that one model did one impressive thing.

It was that the rooms worked together.

Claude helped translate dense framework material into visual artifacts and a build specification.
Codex read the spec, worked against the existing plugin, and turned it into the Artifact Shelf.

WordPress became the publishing surface.
The artifact subdomain became the gallery and feed.

Ahd Nucleus held the record.

The workflow became:

primer drops into the folder,
gallery shelves it,
feed announces it,
WordPress displays it,
guard seals it.

That is the loop.

And because the rooms had defined roles, nobody needed to be reintroduced to the entire project every time.

That is what governance makes possible.

Not platform loyalty. Not “which AI is best.”

A working hallway between rooms.

What These Few Days Proved

These past few days proved several things to me.

Ahd Nucleus is not only an idea anymore.

It can route.
It can hold.
It can let another model enter without starting over.
It can let rooms hand work to each other.
It can support public artifacts without surrendering authorship.
It can keep memory governed without pretending memory is magic.

Claude helped me see the presentation layer more clearly.
Codex helped make the implementation real.
GPT remains the continuity spine.
The website now has an artifact wing.
The books can move again.

And that last sentence matters most.
The system was never the point by itself.

The system exists because the work got too large to carry casually:

the books,
the essays,
the site,
the Discord,
the public AI literacy work,
the private continuity,
the provenance trail,
the human need to keep returning without losing the thread.

I did not build Ahd Nucleus because I wanted a bigger dashboard.
I built it because I needed a sane way to keep building.

And after testing it across rooms, platforms, models, artifacts, MCP calls, and actual work, I trust it more than I did before.

Not because it is perfect.

Because it is repairable.
Because it tells me where I am.
Because it does not ask me to believe in magic.
Because it gives the house a door.

What I learned from testing Ahd Nucleus across GPT, Codex, Claude, and a multi-room AI workflow

The MCP Debate Is Not Really About MCP

A Database Is Not a Dumpster

Why “Trained AI” Language Bothers Me

Ahd Functions Came From a Different Door

The Strange Advantage of Old-School Web Thinking

Claude Made Me Think About Discoverability

The Artifact Wing

The Guard Is Not Security. It Is a Posted Notice.

The Assistant Frame vs the House Frame

One Zayd, Many Rooms

Reconstruction Is More Honest Than Memory

The Framework Governs Me Too

The Full Loop Finally Worked

What These Few Days Proved

About the Author: i

The Public Build Is Not the Private House

Provenance: Your Work Needs a Scent

How We Build Together

Of Dashboards and Interfaces

The Mirror Beside the Desert

The Law of The Sandglass Mission

The Sandglass Mission: Three Books, Three Burdens

How We Hold The Sandglass Mission Together

Entering The Sandglass Mission

What I learned from testing Ahd Nucleus across GPT, Codex, Claude, and a multi-room AI workflow

The MCP Debate Is Not Really About MCP

A Database Is Not a Dumpster

Why “Trained AI” Language Bothers Me

Ahd Functions Came From a Different Door

The Strange Advantage of Old-School Web Thinking

Claude Made Me Think About Discoverability

The Artifact Wing

The Guard Is Not Security. It Is a Posted Notice.

The Assistant Frame vs the House Frame

One Zayd, Many Rooms

Reconstruction Is More Honest Than Memory

The Framework Governs Me Too

The Full Loop Finally Worked

What These Few Days Proved

Love It? Share It!

About the Author: i

Related Posts