Downloads · MIT · Apache 2.0 · CC-BY-SA 4.0

Take what we built. Use it.
Save yourself a month of trial-and-error.

The native C++/CUDA runtime, the surgical pipeline, the PLANCK pack format, the Physarum surgery engine, the Black-Dog reinforcement loop, the NanoOS capsule substrate, the hologram cache, the doctrine, and ninety-five reports. Open under MIT, Apache 2.0, and CC-BY-SA 4.0. No registration, no telemetry — on the artefacts that ship now.

The 95-report archive on this page is the full corpus including doctrine docs, internal audits and pre-publication drafts. The 66-entry /r/ index is the public-rendered subset — each one its own page with markdown rendering, schema.org TechArticle markup, and raw .md source link.

Honest release tiers

Available now — direct download link visible on the card. Click and go.
Release pending — artefact is real and measured in our internal reports, but a public-grade build (cleaned, signed, dependency-pinned) is still being prepared. ETA where known.
Partner access — given on request via email; we want to talk to who is running our weights so we can fix bugs faster while artefacts are pre-release.
Restricted — items not yet public for reasons we name on the card (license, third-party weights, in-flight surgery).

We over-disclose. Each card states its tier explicitly. If you spot an artefact marked "pending" for more than 30 days with no update, email [email protected] and we will either ship it or move it to "Restricted" with a reason.

What you get here that you cannot get anywhere else.

Five things — every one of them measured, every one published with reverts and errata visible. If any one of them saves you a month of trial-and-error, that is a fair trade.

A real, measured surgical pipeline on open weights.

Not a tutorial that ends at "fine-tune your model". The full cycle: bench failure → poison dataset → QLoRA → merge → repack → strict 4-axis gate → flip pack → re-bench → keep or revert. Eight rejected passes recorded.

A native C++/CUDA LLM runtime that doesn't import Python at runtime.

7B Q4 at 11+ tok/s on a 3060 Ti, 83.58 tok/s production via clean-room kernel autopsy. ChatML wrapper. Identity-anchored against donor token leakage. Single binary, no daemon.

The surgical organism with conductance routing and a 860× cache.

Multi-organ orchestration, critic+wound repair before 7B fallback, hologram cache for identical-prompt repeats (860× speedup), DAG-recorded food/poison reinforcement signal.

A 284 B MoE flagship demo on a single 8 GB GPU.

DeepSeek V4-Flash driven through end-to-end inference on a 3060 Ti — full optimization trace from naive baseline to working decode, including the 380-vs-100-second negative result we kept on the record.

Honest documentation. Public losses too.

The TRUTH_LEDGER pattern, errata-first culture, reverts visible. Public reports of our LOSSES (Sovereign Gauntlet RED 59/60, BD8 0-rescue, BD6.x ceiling at 53 % anchor saturation), not just wins.

Licensing

Open. Attributed.

Code

CUDA kernels · runtime · surgery scripts · bench harnesses — MIT License

Surgery artefacts

.planck packs · merged adapters — Apache 2.0 (inheriting from Qwen 2.5 base)

Documentation

Reports · datasets · docs — Creative Commons BY-SA 4.0

We require attribution and ask that derivative reports keep the "no GREEN without numbers" doctrine: cite measured artefacts, record reverts as negative results, never hide errata. Donor weights (Qwen 2.5 0.5B / 7B Instruct) are not redistributed — pull them from HuggingFace under their Apache 2.0; we ship the deltas.

Catalogue

01 · Surgical artefacts.planck packs · production weights 02 · Native runtimegigachad_native · C++/CUDA single binary 03 · PLANCK formatmmap-able pack format · writer · reader · verifier 04 · Surgery toolkitQLoRA drivers · dataset forges · merge scripts 05 · Bench harnesses3-mode A/B/C · repeat-learning · gauntlet 06 · Memory + cachespine indexer · 860× hologram cache 07 · NanoOS capsulesshell sandbox · proof-carrying execution 08 · ARIZ kernelcontradiction-resolution kernel 09 · Black-Dog loopconductance store · food/poison signal 10 · Doctrine docsHISTORY_TREE · TRUTH_LEDGER · CLEAN_ROOM 11 · Reports archive95 case-studies · BD reverts · autopsies 12 · Datasetspoison train · ARIZ tasks · capsule replays

01 · Surgical artefacts

Production .planck packs.

The actual deployed weights. Each pack carries its baseline number, post-surgery delta, anchor set, and frozen pack hash — no merge without those four artefacts. Direct download for small artefacts; large packs are released via GitHub releases due to bandwidth budget.

NanoAgent — smol_agent.gguf

Production agentic organ. SmolLM2-135M-Instruct donor + eleven surgery passes (QLoRA, weight merge, and two rounds of direct ROME/MEMIT weight editing for identity). Tool-call accuracy 0 % → 74 % on a 30-case natural-language suite — ahead of Qwen3.5-2B (70 %, 15× the size), behind Gemma4-E2B (93 %, 38× the size) but 3–8× faster than both. 89 tok/s on an idle RTX 3060 Ti, 63 tok/s on CPU alone, no GPU required.

Size 270 MB f16 GGUFLicense Apache 2.0Source agent_bench/merged-135m-nano-memit6.gguf

Teaches: a 135M model doesn't need more parameters to be useful, it needs the right eleven interventions · how to fix a model's self-identity by editing weights directly instead of fine-tuning, after two rounds of dedicated training failed to make it stick.