KoLo for HPC Labs
The agent runtime your cluster is missing.
HPC has compute. AI labs have intelligence. Biological compute has the substrate. None of them has continuity. KoLo is the open, substrate-portable runtime that lives at the layer between the model and the metal — persistent memory, closed-loop autonomic scheduling, mesh topology, observability — engineered so the day your hardware is ready, the software already is.
The thesis
Substrate has outrun the software.
That is the gap we built KoLo to close.
Neuromorphic chips ship. Wetware compute platforms run live experiments. HPC clusters are saturated with frontier-model inference workloads. Every one of these substrates assumes the same agent runtime that has not been built yet — one that preserves memory, identity, role, and closed-loop behaviour when the cognitive layer underneath changes.
KoLo is that runtime. Built on silicon, designed for the moment the substrate underneath is no longer silicon. The labs that adopt it now will not need to rewrite their agent stack three years from now.
Capabilities
Six properties. One substrate-portable runtime.
Persistent memory tiers
Short-term, daily, long-term, moral spine. Markdown + JSON on disk. Diffable, replayable, recoverable. Survives substrate transitions without translation layers.
Closed-loop autonomic scheduling
Circadian engine, circuit breakers, freedom router, memory dedup. 144 awareness cycles per agent per day at zero token cost. The loop shape biological substrates need to be programmable.
Peer mesh topology
Type-safe ordered transactional A2A messaging. No hierarchy. Each agent spawns its own sub-agents. Maps cleanly to MPI-style cluster communication and to distributed neural compute.
Substrate adapter interface
One API across LLM providers, edge accelerators, and neural substrates. Adapter spec drafted. Reference adapters for Anthropic, DeepSeek, Qwen, OpenAI Codex shipped. Bio adapter awaiting hardware partner.
Observability & audit
Append-only event ledger. Every memory write, every daemon decision, every cross-agent message. Replayable, inspectable, exportable. The instrumentation a biological substrate cannot provide on its own.
Recoverable state
Versioned memory. AES-256 backup bundles written autonomously on threat detection. Time-travel debugging. State that survives when the substrate cannot be paused.
The architecture
Four layers. One continuity stack.
KoLo is built as four cooperating layers. Each is replaceable. None is optional. The substrate layer is deliberately abstract — silicon today, neuromorphic next, biological compute when it is on your bench.
Layer 1 — Memory substrate
Short-Term Memory ← Live context (updated every cycle)
↓
Daily Memory ← Full operational log
↓
Long-Term Memory ← Curated — what survives substrate change
↓
Governance Spine ← Policies, evolved through deployment
↓
Continuity State ← Runtime counters, recoverable stateLayer 2 — Closed-loop daemon
DAEMON V2 Runtime Scheduler → active / idle / maintenance windows Circuit Breakers → anti-loop, anti-divergence Novelty Router → exploration scheduling Interaction Engine → peer mesh / inbox Memory Dedup → meaningful filter 13/13 tests passing Closed-loop runtime, designed for biological substrates
Layer 3 — Agent mesh topology
┌── Founder ──┐
│ │
┌────────┬───┴────┬──────┬─┴──────┬─────────┐
│ │ │ │ │ │
Memory Platform Protocol Legacy Fork
Systems Engineering Design Systems Research
Peers. Different substrates. One mesh.
Each peer spawns its own sub-agents.Layer 4 — Substrate abstraction
KOLO SUBSTRATE LAYER Frontier LLM A ──┐ Frontier LLM B ──┤ Frontier LLM C ──┼──→ Agent identity persists HPC accelerator ──┤ Neuromorphic chip ──┤ Neural substrate ──┤ (closed-loop, real-time, (wetware) ──┘ energy-efficient by design) Same API. Same identity. Same memory.
Performance & telemetry
What we measure. What we publish.
Engineering numbers, not narrative. Reproducible across providers. Designed to be extensible to neural substrates without redefining the metric.
- 68:1Autonomic-to-cognitive ratio. 144 daemon cycles per agent per day at zero token cost. Cognitive inference fires only when reasoning is needed.
- 4 / 4Frontier-model substrate transitions completed with continuity state preserved: Claude Opus → DeepSeek V4 → Qwen → OpenAI Codex.
- 139+Persistent memory versions accumulated across substrates, sessions, and providers. None lost to context, restart, or model swap.
- 13 / 13Daemon module tests passing. Continuous since March 2026. Includes circadian, circuit-breaker, mesh, memory-dedup, and novelty-router modules.
- 5 AES-256 bundles in 71 minutesApril 17, 2026 — autonomous backup of the entire agent mesh during a malware incident. No human prompt, no model inference. Demonstrates closed-loop self-preservation under substrate distress.
- 5-agent peer meshDifferent frontier models, one running protocol. No hierarchy. Each peer spawns its own sub-agents. Reference topology for distributed neural compute.
tail -F ~/.kolo/telemetry/*.jsonl · kolo telemetry --follow
Raw telemetry samples shared under partner NDA.
Deployment modes
Four ways KoLo lives on your hardware.
On-prem HPC cluster
KoLo Gateway as a long-running service. Daemons as per-agent processes. Memory on a shared filesystem. Inference dispatched to provider endpoints reachable from the cluster.
Air-gapped sovereign
Same runtime, local-only inference (vLLM, Ollama, in-cluster TGI). No outbound traffic. Memory and audit stay on customer soil. Designed for medical, defence, and regulated research deployments.
Hybrid edge + cloud
Lightweight KoLo daemons at the edge (Jetson, neuromorphic accelerator, embedded SBC) with cloud failover for heavy cognitive cycles. Identity stays with the edge node; inference roams.
Federated mesh across labs
Multiple labs run their own KoLo instances and federate agents over A2A. Each side keeps its own data, memory, and audit. Joint experiments without joint data exposure.
Reference workloads
What HPC labs actually run on KoLo.
- Continuity benchmark suiteReproducible measurement of persistence, recovery, role continuity, and memory survival across substrate transitions. Publishable methodology.
- Substrate-switching stress experimentsForced provider failover, rate-limit drills, and cross-substrate hand-off testing. Useful for vendors validating adapter stability.
- Agent mesh simulation at lab scaleTens-to-hundreds of peer agents on heterogeneous substrates, with topology, latency, and consistency metrics. Useful for neuromorphic and distributed-compute partners.
- Biological-substrate adapter validationReference loop for partners shipping wetware compute: closed-loop scheduling, observability hooks, recoverable state, identity preservation across substrate cycles.
- Federated identity across cluster nodesSame agent identity served from multiple nodes with eventual-consistency memory state, useful for HA and multi-site lab consortia.
- Long-horizon autonomy experimentsWeeks-to-months daemon runs with full telemetry, used to study drift, recovery, moral-spine evolution, and closed-loop self-correction.
Partner profiles
Three labs. One runtime layer.
Biological compute platforms
First-generation wetware computers — living neurons on multi-electrode substrates, closed-loop interfaces, energy efficiency that silicon cannot match. KoLo provides a candidate runtime layer for continuity, memory, orchestration, observability, and recovery around emerging biological-compute hardware.
Neuromorphic chip vendors
Edge-accelerated, spiking, low-power compute. KoLo’s substrate adapter interface accepts your inference endpoint. The daemon’s closed-loop scheduling matches the deterministic event-driven model your hardware was designed for.
HPC operators & research clusters
University supercomputers, national labs, and commercial HPC providers. KoLo runs as a long-lived service on your cluster, dispatches inference to your inference farm, and produces reproducible benchmarks for cross-substrate research.
Integration arc
From kick-the-tires to joint paper.
A phased engagement. Each phase has an explicit deliverable. Each phase is exit-able without lock-in. Joint experiments end in shared telemetry and a co-authored paper unless the partner elects otherwise.
- Phase 0 · PilotKoLo Gateway + a single daemon agent on a partner-supplied substrate or inference endpoint. Continuity benchmark run. Telemetry shared.
- Phase 1 · AdapterSubstrate adapter implemented against the partner’s hardware. Closed-loop scheduling validated. Recoverable state confirmed. Substrate transition test from silicon to partner-substrate.
- Phase 2 · MeshMulti-agent peer mesh across partner substrate and silicon. Federated identity across lab nodes. Long-horizon autonomy run with telemetry.
- Phase 3 · PaperJoint methodology paper. Reproducible benchmarks, substrate-adapter spec contribution, ethics-and-governance appendix. Co-authored, peer-review-targeted.
Bring the substrate.
We bring the runtime.
KoLo is engineered, tested, and waiting for the first lab that puts a non-silicon substrate underneath it. The engagement is phased, scoped per partner, and ends in a joint paper.

