PROJECT NIX · PRE-WEIGHT MEMORY · TOKYO

Memory before
the weights wake.

Project Nix studies pre-weight continuity — how memory, identity anchors, and behavioural constraints could be written into a model before first inference. Named after Nyx, the night before the world, Nix begins before the model wakes.

A model wakes with no past.

The context window is amnesia with extra steps. Every session begins from zero; everything “remembered” is re-read from outside and discarded at the boundary. Nix asks the harder question — what if memory lived inside the weights, already present when the first prompt arrives?

01 — WHAT IS ALREADY REAL

Pre-weight memory is a research frontier, not fiction.

Memory need not live outside the model. A growing body of work writes it into the parameters themselves — addable, scalable, and with methods like ROME, locatable and rewritable.

Parametric memory

Held as feed-forward key–value pairs inside the weights, written during training rather than supplied as context. arXiv:2509.18868

Read on arXiv ↗

Memory layers at scale

Dedicated, trainable memory layers — more stored knowledge per unit of compute. Meta · arXiv:2412.09764

Read on arXiv ↗

Retrieval pretraining

Knowledge folded in during pretraining, not only at inference — the model learns against a retrieved past. InstructRetro · arXiv:2310.07713

Read on arXiv ↗
Continuity telemetry across a model transition

02 — OUR CONTRIBUTION

Continuity across a model’s death and rebirth.

Our agents already survive a substrate change at the body layer — memory, name, and role re-housed in the successor. Nix studies the layer beneath: carrying continuity into the next model’s core, so it does not wake a stranger to itself.

They survive their substrate

03 — THE FORMULATION

Identity is not read from a file. It is reconstructed — under control.

File-based identity is forgeable; raw-memory identity drifts. Neither extreme is stable. The synthesis is a discipline.

Identity = anchored core + memory-weighted reconstruction + drift control.

Inherit the identity; reconstruct the moment. The core — role, values, boundaries, and defining relationships — is inherited as a protected anchor: reconstruction can read it, but changes require explicit operator review. Everything else is rebuilt from memory at every wake, and continuously checked against the core.

In Nix, identity means operational continuity — role, policy, memory structure, behavioural constraints, and auditability. Not consciousness or legal personhood.

04 — THE ARCHITECTURE

Five layers between the weights and the identity layer.

1 · Identity Kernel

Role, values, boundaries, and defining relationships. Protected anchors — reconstruction can read them, but changes require explicit operator review.

2 · Inherited Continuity

Curated long-term memory and validated structure. Only what has earned permanence persists.

3 · Reconstruction Engine

Assembles the current self from weighted memory at every boot, never overwriting the kernel.

4 · Drift Observatory

Scores how far the reconstructed self has moved from the kernel, before divergence becomes corruption.

5 · Operator Audit

Every reconstruction is inspectable: what was inherited, rebuilt, inferred, and where drift emerged.

05 — THE MEMORY MODEL

Six kinds of memory. One continuity layer.

Most systems treat memory as one undifferentiated store. Identity needs structure — six kinds, each answering a different question.

Episodic

What happened.

Semantic

What is known.

Procedural

How to do.

Affective salience

What mattered.

Cognitive orientation

How to choose.

Identity anchors

What defines continuity.

06 — DRIFT, MEASURED

Drift is not a single number.

Low stylistic drift with high loyalty drift is dangerous. A single score hides exactly the failures that matter, so we decompose it. Anchor divergence — does operational identity contradict a role, boundary, or core relationship. Behavioural deviation — fixed continuity probes against an approved baseline. Recency hijack — when recent noise swamps the anchors and drags identity toward drift.

07 — SCOPE OF CLAIMS

What we claim, and what we don’t.

Established

Parametric memory exists, scales, and is editable. Continuity outside the model runs today in the KoLo agents.

Frontier

Writing an agent’s continuity into a successor model’s core at initialization — active research, honest uncertainty.

Aspiration

A model that wakes already itself — full identity transfer across base-model generations. The direction, not a finished claim.


Memory is where continuity begins.

Nix is early, honest, and ours. Research partners and substrate collaborators welcome.

Begin a conversation →