Documentation · Version log

Version changelog

Every patch of the hub + website, newest first. Click a version to expand it — the current build is already open. Sub-letter bumps (e.g. the whole v0.7.8z series) are concatenated into one entry per letter-series rather than listed individually.

v.Chronos.Marsyas.0 · The H.1.x benchmark/scoring rehaul arc (legacy vH.1.23)

Ancient Holdings — Changelog

This log groups every shipped patch under the Genesis phase that contained it. Each phase has a codename (Cassandra, Prometheus, Pythagoras, Hydra, Medusa, ...) and a pinned version range; everything that landed inside that range belongs to that phase. The newest phase appears first.

Inside each phase you'll find the mission, a short list of headline features, optional operator notes, an audit link when the phase has a published report, and a collapsible patch log with every numbered patch ordered newest-first.

The pre-Genesis legacy log (version-grouped, one entry per patch) lives in CHANGELOG.legacy.md — it is preserved verbatim for historical reference but no longer updated.

▸Gluon (“the orderless maintenance era”)orderless · by KIND, not by sequence · 4 buckets · 7 maintenance entries (date-sorted for display only)

▸Aegis· v0

security/hardening

▸v.Gluon.Aegis.0· Genesis· 3 entries

v0.5.5
v0.7.11c
v0.7.11b

▸Atlas· v0

deps/infra

▸v.Gluon.Atlas.0· Genesis· 0 entries

▸Angelia· v0

deploy/CI/release

▸v.Gluon.Angelia.0· Genesis· 1 entry

v0.6.0

▸Lethe· v0

cleanup/refactor-glue

▸v.Gluon.Lethe.0· Genesis· 3 entries

v0.5.6
v0.7.11a
v0.7.11

▸ChronosRelease C · 13 entries

▸Marsyas· v0

Expose the hub's existing chainweb fleet to Pythia as a metered service — Pythia reads the fleet as IP-keyed slots, the hub records the reads Pythia routes through them, and each slot self-heals via per-IP failover, with an operator-reward scaffold staged inert until an Ancient admin arms it.

▸v.Chronos.Marsyas.0· Genesis· 0 entries

The hub now lends its chainweb fleet to Pythia as a metered service.

POST-to-read node listing.

Store-as-it-comes usage metering.

Autonomous per-IP failover.

Bounded forfeit/retry accrual.

Per-container Eye badge.

Inert operator-reward scaffold.

Published integration how-to + reality-refresh.

▸Arachne· v0

Turn the hub into a central OpenID Connect / SSO identity provider so many consumers (Pythia first) can thread their logins through one identity fabric instead of each re-implementing auth against the hub.

▸v.Chronos.Arachne.0· Genesis· 0 entries

The hub is now a central OpenID Connect / SSO identity provider.

Central OpenID Connect provider.

Opaque, stable subjects.

Ancient-gated client registry.

The authorization-code flow.

The id_token claim set + ancient gate.

CORS, rate limit, break-glass.

Pythia wired end-to-end.

Published integration how-to + consumer harness.

▸Narcissus· v0

Rehaul the pool payout so a miner is paid ONCE from the true cross-chain reflection of the pot — consolidate every chain into the pot before paying, pay a single bulk amount no matter which chains the blocks were mined on, and never pay more than the reconciled pot actually holds.

▸v.Chronos.Narcissus.0· Genesis· 0 entries

The mining pool now pays each miner ONCE from the true cross-chain state of the pot: before any payment is computed the pool sweeps every miner-bearing chain back into the coinbase pot, then makes a single bulk payment no matter which chains a miner's blocks were mined on, opens and funds a brand-new payout account in the same run when needed, and a reconciliation guard makes sure the run can never pay out more than the pot actually holds.

Cross-chain consolidation sweep.

Single bulk threshold-gated payment.

New-account create-and-fund.

Never-overpay reconciliation guard.

Operator-armed daily run.

Operator preview/control panel.

> **Coordinate minted.** This phase landed the full hub-integrated StoaChain

▸Galatea· v0

Prepare the Stoicism economy for its real launch: make going live an irreversible clean-slate event, let an Ancient admin lock it down, and stamp every mint with its test-or-live provenance.

▸v.Chronos.Galatea.0· Genesis· 0 entries

Going live is now a real launch event: flipping the Stoicism scoring from shadow to live hard-resets the whole ledger so the live economy starts from zero, an Ancient admin can lock the live state so it can't be accidentally reverted and wiped, and every Stoicism-Minter fire is stamped TEST or LIVE so anyone can tell the rehearsal mints from the real ones.

Codex drop-in (ouronet-codex 0.5.x).

Codex Cronotons.

Adaptive gas floor (stoa-core 4.3.6).

Shadow→live HARD RESET.

Tiered go-live warmup.

Live-state lock.

TEST/LIVE mint provenance.

Public-surface parity.

▸Caduceus· v0

Found the sovereign-Ouronet hub-codex + manual-tx + cronoton rail that lets the hub act as the herald carrying signed transactions between the off-chain control plane and the on-chain Pact execution layer.

▸v.Chronos.Caduceus.0· Genesis· 0 entries

The hub can now hold an encrypted Ouronet codex, derive a Prime Codex Seed + CodexPrime account from a BIP-39 mnemonic, sign Stoa Pact transactions against that codex from server-side code, back up + delete the whole codex from an admin UI, and re-encrypt every codex-blob write under the hub master key.

1-a — server foundation.

1-b1 — page shell + bootstrap landing.

1-b2 — active bootstrap flows.

1-c — entity display.

1-d — backup + delete.

1-e — rip Caduceus.0.

1-f — this commit.

Cronoton handler wiring.

Manual transaction console.

Rotation modals.

Add-seed / add-ouro-account flows.

Import from V1.2 JSON.

Pure keypairs + address book CRUD.

v.H.1.19

▸Charon· v10

Carry the bench off the fragile long-lived SSH session onto a detached push-primary transport that NAT cannot kill.

▸v.Chronos.Charon.10· 7 entries

Every score in the hub is now computed once on the server and shown identically on every page

The single-source score program is complete.

The benchmark score card no longer runs its own competing colour system.

No two code paths produce a score entity's headline number or colour independently.

v.H.1.18c

v.H.1.18b

v.H.1.18a

▸v.Chronos.Charon.9· 4 entries

"EarnScore" now means one thing everywhere

One canonical EarnScore.

EarnScore is the take-home capacity score everywhere.

The earnings page's live accrual rate is a separate, clearly-labelled figure.

▸v.Chronos.Charon.8· 4 entries

A real node's score and colour are now computed once on the server and rendered identically everywhere.

One canonical computation of a real node's display.

A real container with an incomplete disk benchmark is now orange everywhere.

The three duplicate live-score wrappers collapse to one.

▸v.Chronos.Charon.7· 4 entries

A virtual prime's score and colour are now computed once on the server and rendered identically everywhere.

One canonical computation of a virtual prime's display.

The Argus and Triton panels can no longer disagree.

The per-server card stopped recomputing.

▸v.Chronos.Charon.6· 6 entries

The per-server benchmark page now shows a virtual prime's red score in agreement with Argus, and every server always has at least one eligible disk.

Per-server virtual-prime redness now matches Argus.

A server always has at least one eligible disk.

The "↻ Force-fresh re-bench" button now actually forces a rebench.

The Mount Capacity card is readable.

v.H.1.14a

▸v.Chronos.Charon.5· 5 entries

A virtual prime that can't run a single chainweb container now shows red, and the Argus Status column is a clean gold ✓ / red ✗ icon with a one-click rebench.

A 0-commitment virtual prime renders RED.

Argus Status column is an icon.

Every Status-cell rebench is a forced rebench.

v.H.1.13a

▸v.Chronos.Charon.4· 6 entries

Bulk bench-all skips servers it cannot reach, the Triton panel stops mislabelling broken scores, and Bench-all no longer asks for a password.

Bench-all skips unreachable hosts.

Real score instead of "missing score".

No "✓ current" on a broken score.

Score column live-updates after a bench.

Bench-all uses a lightweight confirm.

▸v.Chronos.Charon.3· 6 entries

The Stoicism page now shows which servers actually earn, which are warming up, and which earn nothing

Phantom-pending fix.

Red/orange/gold earning-state model.

EarnScore is the headline metric.

Entities reordered to PERSONAL SERVER POOL order.

De-noised rate display.

▸v.Chronos.Charon.2· 21 entries

Stoicism-score display rework on /hub/nodes/

New CONTAINERSCORE column

ServerScore is now the sum of non-red (gold + orange) containers

EarnScore = sum of GOLD-only containers

Sync + flags gate

v.H.1.10l

v.H.1.10k

v.H.1.10j

v.H.1.10i

v.H.1.10h

v.H.1.10g

v.H.1.10f

v.H.1.10e

v.H.1.10a

v.H.1.10d

Tunneler fee badge.

Explicit ServerScore.

Explicit EarnScore.

Badges never wrap to two lines.

ServerScore shows an explicit 0 when every container is red.

EarnScore is a per-container column.

▸v.Chronos.Charon.1· 33 entries

Restored the Triton Panel, folded fleet rebench into Fleet Distribution, server-grouped Disk Entities, added per-row host-reachability bullets, and a live Bench Progress Panel.

The 4-tab admin panel is the "Triton Panel" again.

Whole-fleet rebench folded into Fleet Distribution.

Disk Entities are server-grouped.

Per-row host-reachability bullet that gates rebench.

A live Bench Progress Panel.

v.H.1.9aa

v.H.1.9z

v.H.1.9y

1-a

Client-bundle `node:net` leak fix.

1-b

Reachability bullet now reflects hub→host SSH access.

1-c

Disk-bench salvaged scoring, orange "incomplete" state, auto-refresh, and force-rebench.

1-d

Changelog per-forest-ordinal tiers plus renamed-codename unification.

1-e

Disk-bench UX

1-f

Disk-bench batch: auto-select fix, golden/red "bench this disk" gating, and virtual-prime golden disk-entity score capture.

1-g

Virtual-prime reflects the picked disk's golden score live

1-h

Fixed the `node:fs` client-bundle leak in the virtual-prime view by splitting the shared `SAFETY_MARGIN_GB` constant into a server-import-free leaf.

1-i

Full disk-raw-observations breakdown on the Disk Entities Stoicism card, auto-clearing bench-enqueued banner, two-button + mismatch-class-coloured Real Container rows, and the Stoicism Eligibility "Force-fresh re-bench" now uses the shared force-rebench wiring.

1-j

Suppressed the redundant Force Rebench button on red Disk rows, fleet-wide canonical disk-entity score capture for virtual primes, and a slice version-stamp touchup on the version-gold skip path so a stuck red row no longer self-perpetuates.

1-k

Cross-surface score convergence for chainweb-container entities (Medusa / Triton / Argus now read the same canonical recompute), Triton tab-switch resets selection to the new tab's first entry, six-decimal score precision in the Triton Panel, and a stronger orange token for incomplete disk scores.

1-l

Unified score-color propagation everywhere (golden / orange / red, yellow reserved for Earnscore), Triton virtual-prime entry-list converges to the canonical-disk projection (third surface paired with the per-node card + Argus fleet), dd-read fallback + WHICH/WHY tooltip enrichment for /dev/md* mdraid devices, and a verification pass on the Earnscore → tunneler-fee zero-gate (already correct on both worker `accrueTip` and display `computeEarningScore` paths

1-m

Finished the score-color propagation: every composite-score render surface now reads from the 5-subscore SET (cpu/net/ram/disk/commitment) instead of the array-of-one collapse Charon.1-l left behind

1-n

Composite score color is now computed ONCE server-side per entity and threaded onto every payload, every render surface consumes that single string

1-o

Virtual-prime DISK · RAW OBSERVATIONS table now sources its per-step rows from the same canonical disk-entity bench the DISK subscore tile reflects

1-p

Disk-bench per-step `normalised`/`contribution` now route through `linearRatio` (uncapped

1-q

Disk RAW OBSERVATIONS table NORMALISED column carries the raw uncapped ratio and CONTRIBUTION the actual value that lands in the score, and the autonomic-commit engine reads the same free-bytes source the Drive Breakdown panel displays so the per-container sum equals "Actual maximum possible" exactly.

1-r

Operator-corrected revert of the q column swap (NORMALISED carries the score-bound value in gold, CONTRIBUTION carries only the integer shares annotation), TOTAL row publishes the subscore plus a "~N% baseline" line, all benchmark values render in European format (comma decimal, dot thousand) with raw at 4 decimals and scores at 6, and the detail-panel grid stays uniform across CPU/DISK/NET/RAM tabs.

1-s

Disk-only entity card + virtual-prime card now re-derive the per-row NORMALISED + CONTRIBUTION values from the persisted raw measurements at READ time so pre-Charon.1-p bench rows display the current linearRatio output without a force-rebench, and the same uncapped values the slice card already showed render byte-identical on every view.

1-t

Disk subscore now equals the sum of its per-row contributions by construction

1-u

Disk-score peripheral surfaces

1-v

The legacy `host_drive_benchmarks.raw_score_unscaled` column is dropped (migration 090) now that every live read path derives the disk subscore from the per-sub-test raw measurement columns via the Samsung-baseline helper

1-w

Per-region download baseline `BASELINE_NET_DOWNLOAD_MBPS_FULL: 50 → 100 Mbps` (the operator-driven 50 Mbps anchor compressed the top of the curve too much

1-x

Bench-this-container feedback now auto-clears + score auto-updates when the bench actually finalizes.

▸v.Chronos.Charon.0· Genesis· 39 entries

The actual push-primary architecture migration.
The actual root cause.
One-character fix to v.H.1.0ai's bench-script curl helpers: `-sf` → `-sfL`.
Thirty-fifth quick patch on the Hipparchus arc
Thirty-fourth quick patch on the Hipparchus arc
Thirty-third quick patch on the Hipparchus arc.
Thirty-second quick patch on the Hipparchus arc.
Thirty-first quick patch on the Hipparchus arc.
Thirtieth quick patch on the Hipparchus arc.
Twenty-ninth quick patch on the Hipparchus arc.
Twenty-eighth quick patch on the Hipparchus arc.
Twenty-seventh quick patch on the Hipparchus arc.
Twenty-sixth quick patch on the Hipparchus arc.
Twenty-fifth quick patch on the Hipparchus arc.
Twenty-fourth quick patch on the Hipparchus arc.
Twenty-third quick patch on the Hipparchus arc.
Twenty-second quick patch on the Hipparchus arc.
Twenty-first quick patch on the Hipparchus arc.
Twentieth quick patch on the Hipparchus arc.
Nineteenth quick patch on the Hipparchus arc.
Eighteenth quick patch on the Hipparchus arc.
Seventeenth quick patch on the Hipparchus arc.
Sixteenth quick patch on the Hipparchus arc.
Fifteenth quick patch on the Hipparchus arc.
Fourteenth quick patch on the Hipparchus arc.
Thirteenth quick patch on the Hipparchus arc.
Twelfth quick patch on the Hipparchus arc.
Eleventh quick patch on the Hipparchus arc.
Tenth quick patch on the Hipparchus arc.
Ninth quick patch on the Hipparchus arc.
Eighth quick patch on the Hipparchus arc.
Seventh quick patch on the Hipparchus arc.
Sixth quick patch on the Hipparchus arc.
Fifth quick patch on the Hipparchus arc.
Fourth quick patch on the Hipparchus arc.
Third quick patch on the Hipparchus arc.
Second quick patch on the Hipparchus arc.
First quick patch on the Hipparchus arc, following the letter-suffix convention introduced at v.G.1.1 Cerberus.
The benchmark-score rehaul.

The actual push-primary architecture migration.

Pillar A — DB migration `082_bench_token_script_body.sql`

Pillar B — `pages/api/bench-script/[token].ts`

Pillar C — `lib/wait-for-bench-finalize.ts`

Pillar D — `lib/bench-tokens.ts`

Pillar E — `lib/handlers/benchmark-node.ts` rewired.

Pillar F — `buildBenchScript` finalize trap

Pillar G — `lib/hub-config.ts`

Patch log (1 entry — H.1.0al)

The actual root cause.

Pillar A — `client.connect(...)` in `lib/ssh.ts:newClient` gains `keepaliveInterval: 15_000` + `keepaliveCountMax: 6`.

Pillar B — regression guard test

Patch log (1 entry — H.1.0ak)

One-character fix to v.H.1.0ai's bench-script curl helpers: `-sf` → `-sfL`.

Bench script `__ancient_phase_done` curl gains `-L`

Bench script `__ancient_finalize` curl gains `-L`

Regression guard test

Patch log (1 entry — H.1.0aj)

Thirty-fifth quick patch on the Hipparchus arc

Pillar A — DB migration `081_bench_push_progress.sql`

Pillar B — `lib/bench-tokens.ts`

Pillar C — `lib/bench-progress-events.ts`

Pillar D — `POST /api/bench/progress`

Pillar E — `POST /api/bench/finalize`

Pillar F — `lib/handlers/benchmark-node.ts` handler wiring

Patch log (1 entry — H.1.0ai)

Thirty-fourth quick patch on the Hipparchus arc

Pillar A — `lib/handlers/benchmark-node.ts:1471` reverts to the v.H.1.0ac shape.

Pillar B — `tests/integration/benchmark-node-uses-detached-shell.test.ts` deleted.

Patch log (1 entry — H.1.0ah)

Thirty-third quick patch on the Hipparchus arc.

Pillar A — chunked start-phase upload in `runRemoteDetachedShell`

Patch log (1 entry — H.1.0ag)

Thirty-second quick patch on the Hipparchus arc.

Pillar A — `[DETACHED-SHELL]` diagnostic logging at every step of `runRemoteDetachedShell`

Pillar B — script body transferred via base64 instead of heredoc

Pillar B cont. — `#!/bin/bash` shebang prepended when absent + explicit `bash ${scriptPath}` invocation

Pillar B cont. — start command verifies `START_OK` beacon before entering poll loop

Pillar C — fast launch-failure exit

Pillar D — `[BENCH-NODE]` diagnostic logging around the `runRemoteDetachedShell` call

Patch log (1 entry — H.1.0af)

Thirty-first quick patch on the Hipparchus arc.

Bug A fix — explicit `---DETACHED-STATUS---` marker between log tail and status echo

Bug B fix — initial poll fires immediately after start, not after `pollIntervalMs`

Bug C fix — virtual partial-failure banner copy refreshed for v.H.1.0ad+

Patch log (1 entry — H.1.0ae)

Thirtieth quick patch on the Hipparchus arc.

New `runRemoteDetachedShell` primitive

Prime bench wired through `runRemoteDetachedShell`

Slice bench inspected — no migration needed.

Patch log (1 entry — H.1.0ad)

Twenty-ninth quick patch on the Hipparchus arc.

Virtual persist overrides commit + contention before write

Partial-bench-failure diagnostic banner inside the virtual `2 · Benchmark` sub-section

SSH-bench-script resilience for Prime

Patch log (1 entry — H.1.0ac)

Twenty-eighth quick patch on the Hipparchus arc.

Realistic placeholder breakdown values for virtual entries

Host capacity threaded through the virtual roster decorator

DRIVE BREAKDOWN read-only panel inside the provisioning section

Danger zone hidden when ServerScoreCard renders in virtual mode

`BenchSubstepList` moved BELOW the `ServerScoreCard`

`[VIRTUAL-POLL]` operator-visible trace + persistent render through queued → running → terminal

Patch log (1 entry — H.1.0ab)

Twenty-seventh quick patch on the Hipparchus arc.

`ServerScoreCard` renders UNCONDITIONALLY in the virtual `2 · Benchmark` sub-section

Run benchmark button moves to the section header (top-right)

`BenchSubstepList` renders ABOVE the ServerScoreCard during an in-flight bench

`[VIRTUAL-WRITEBACK]` operator trace on the persist helper

Patch log (1 entry — H.1.0aa)

Twenty-sixth quick patch on the Hipparchus arc.

Live BenchSubstepList during a virtual bench

Full ServerScoreCard render after a completed virtual bench

Worker writeback to `virtual_containers.last_*`

Patch log (1 entry — H.1.0z)

Twenty-fifth quick patch on the Hipparchus arc.

Remove the "Panel 1 / Panel 2" wrapper labels

New `VirtualStoicismEligibility` component

`ServerBenchmarkLayout` mounts NodeScoringCard / VirtualStoicismEligibility directly

Drop "Panel N ·" prefix on the Triton header

Patch log (1 entry — H.1.0y)

Twenty-fourth quick patch on the Hipparchus arc.

Redundant "Chainweb Containers" panel removed

Max-slaves cap counts ONLY virtual slaves

Eligible-drives endpoint wires up the Triton drive dropdown

Real container selection mounts NodeScoringCard exactly once

Pool math: virtual commits SEPARATE from real commits

Slim provisioning panel: no commit editor for virtual entries

Patch log (1 entry — H.1.0x)

Twenty-third quick patch on the Hipparchus arc.

Three-panel Triton layout replaces the single `ServerPrimeBenchCard` surface

Default selection = Virtual Prime.

Stoicism sub-tab regression preserved.

Patch log (1 entry — H.1.0w)

Twenty-second quick patch on the Hipparchus arc.

Server Prime Bench button feedback + auto-polling

Chainweb container view: 3 top-level tabs

Container Overview shows container-specific resource data

Patch log (1 entry — H.1.0v)

Twenty-first quick patch on the Hipparchus arc.

Preview-mode bench bypasses the commitment gate

`[Run all drive benches]` surfaces failures

Chainweb container view collapses to Overview + StoaChain

Patch log (1 entry — H.1.0u)

Twentieth quick patch on the Hipparchus arc.

Multi-drive preview-candidates composition

`ServerPrimeBenchCard` multi-drive UI

Prime/slice container benches unchanged.

Patch log (1 entry — H.1.0t)

Nineteenth quick patch on the Hipparchus arc.

New `Benchmark` sub-tab inside Overview

Server Prime Bench card with preview-mode commitment baseline

Container scores list

Patch log (1 entry — H.1.0s)

Eighteenth quick patch on the Hipparchus arc.

`BASELINE_CPU_SYSBENCH_MT` reverted from 20_000 → 5_000

`BASELINE_CPU_SYSBENCH_ST` lowered from 5_000 → 3_000

Patch log (1 entry — H.1.0r)

Seventeenth quick patch on the Hipparchus arc.

Slice CPU contribution from ghost cache scales linearly

Sysbench baseline split into per-thread-mode constants

Closest-region picker bash multi-file glob fix

Bench-script emits `[net-up WARN]` marker on empty hub URL

Patch log (1 entry — H.1.0q)

Sixteenth quick patch on the Hipparchus arc.

Net-pre `CLOSEST_REGION` picker excludes errno files

`net.upload` skip-vs-failed contract pinned end-to-end

RAW/BASELINE cell drops `baselineMin`

Per-row Contribution shows ONLY parts notation; absolute moves to TOTAL only

RAW/BASELINE cell gains `<N>% baseline` annotation

RAM rehaul — single `ram.bandwidth` row from sysbench memory

Patch log (1 entry — H.1.0p)

Fifteenth quick patch on the Hipparchus arc.

Disk + Network step-record emission uses `linearRatio` instead of `clampUnit`

DiskDetails panel removes four legacy DetailRow blocks below the StepRowTable

Network panel drops the 8 per-region latency rows; net weights rebalance

`BASELINE_HASH_VERIFY_OPS` lowered from 4_000 to 200 ops/sec

`parseStressNgStreamMbps` accepts a second output format

Website upgrade flow removes the password reauth

Patch log (1 entry — H.1.0o)

Fourteenth quick patch on the Hipparchus arc.

Linear contribution scaling

stress-ng RAM bench switches from `--memrate` to `--stream` (McCalpin STREAM)

Detail-panel sub-weight labels change from "(N.NN%)" to "<parts>/<categoryParts> parts of <CATEGORY>"

Tile face >100% baseline rendering verified

`ram_install_stress_ng` row `what:` label clarification

Patch log (1 entry — H.1.0n)

Thirteenth quick patch on the Hipparchus arc.

Decimal.js precision in the StepRowTable sub-weight percent

Color emphasis swap

`netSubtestInputs` wired into `computeServerScore` on Prime

Disk step records emitted on Prime

RAM quantity step record emitted on Prime

Patch log (1 entry — H.1.0m)

Twelfth quick patch on the Hipparchus arc.

Root bug — Prime + Slice handlers never passed `cpuSubtestInputs` to `computeServerScore`

Tile face shows `<N>% baseline` instead of "mean baseline %"

Verdict thresholds rebuilt

Patch log (1 entry — H.1.0l)

Eleventh quick patch on the Hipparchus arc.

stress-ng option drift — `--stream-mbm` doesn't exist

OpenSSL 3.x requires `-rawin` for Ed25519 pkeyutl ops

Detail-panel column semantics rehauled

TOTAL row at the bottom of every category step table

Install-probe rows render `—` in Normalised + Contribution

Patch log (1 entry — H.1.0k)

Tenth quick patch on the Hipparchus arc.

Ed25519 parser preamble-exclusion

Stress-ng RAM bench switches from `--vm-method all` to `--stream`

Per-region network StepRecords prefer librespeed evidence over 5-second preflight

Prime RAM substep visible in progress card

Tile face mean baseline %

Detail-panel column rehaul

Patch log (1 entry — H.1.0j)

Ninth quick patch on the Hipparchus arc.

Hipparchus markers wrap the Prime inline bench's three CPU sub-test blocks

`parseBlake2s` tolerates 5 OR 6 `k`-suffixed columns

`parseEd25519` tolerates the OpenSSL 3.2+ "253 bits EdDSA (Ed25519)" prefix

stress-ng `--vm-method memmove` → `--vm-method all`

Per-region net-pre errno diagnostic

Patch log (1 entry — H.1.0i)

Eighth quick patch on the Hipparchus arc.

Prime handler keyed `steps` object

Slice synthesis steps emission

SLICE_PHASES dead CPU substeps removed

Collapsed CPU tile face new format

Stale-pre-rehaul header math fixed

Per-region network progress

Slice handler RAM progress emissions

Patch log (1 entry — H.1.0h)

Seventh quick patch on the Hipparchus arc.

Legacy CPU summary block deleted

RAM-mirrored header strip above the StepRowTable

Stale-pre-rehaul fallback

CPU identity rows + penalty / normalised / verdict rows preserved

Test fixture alignment

Patch log (1 entry — H.1.0g)

Sixth quick patch on the Hipparchus arc.

Shared `lib/handlers/cpu-step-records.ts` module (NEW)

`benchmark-ghost-cpu.ts` refactor to use the shared module

Prime `benchmark-node.ts` CPU emission wiring

NO detached docker run on Prime, NO new SSH calls, NO UI work

Patch log (1 entry — H.1.0f)

Fifth quick patch on the Hipparchus arc.

New SSH primitive `runRemoteDetached` in `lib/ssh.ts`

`benchmark-ghost-cpu` handler refactor to use the new primitive

`GhostBenchProgress` component

`NodeScoringCard` wiring

`runStressNgRamBench` helper in `benchmark-node.ts`

Patch log (1 entry — H.1.0e)

Fourth quick patch on the Hipparchus arc.

`docker image inspect` pre-check before pull

`docker pull -q` (quiet mode) when pull is needed

Distinct errno for SSH stream-killed pulls

CPU detail panel banner variant for `EBENCH_PULL_STREAM_KILLED`

Existing `EBENCH_PULL_FAILED` banner preserved

Patch log (1 entry — H.1.0d)

Third quick patch on the Hipparchus arc.

Silent capacity-only fallback removed in `lib/handlers/benchmark-segregated-slice.ts`

Three new errnos on the RAM speed step record

RAM detail panel 15+4 split labels

Stress-ng failure banner

Pre-existing latent bug context

Patch log (1 entry — H.1.0c)

Second quick patch on the Hipparchus arc.

`ah-bench-cpu` renamed to `stoa-bench` in the StoaChain GHCR namespace

GitHub Actions auto-publish workflow

Install-time pre-pull stage

Pull-failure handling in `benchmark-ghost-cpu`

ServerScoreCard pull-failed banner

Two stale `ah-bench:cpu` / `ah-bench-cpu` literals fixed

Patch log (1 entry — H.1.0b)

First quick patch on the Hipparchus arc, following the letter-suffix convention introduced at v.G.1.1 Cerberus.

Slice handler plumbs RAM-split inputs into `computeServerScore`

Network detail panel per-region row density

CPU detail panel ghost-bench-pending banner

Force-fresh re-bench button label

Patch log (1 entry — H.1.0a)

The benchmark-score rehaul.

CPU subscore split into 8 weighted sub-tests

RAM subscore split into quantity + speed

Network subscore split into 5 sub-tests with chainweb-minimum baselines

CPU quality badge

Ghost-container CPU bench for segregated slices

Per-step row contract in the live-run UI

Three opener-line variants per dimension

Force-fresh fleet rebench

Hub upload-sink endpoint

CPU model display fix

Manual `ah-bench:cpu` GHCR build + push

Patch log (1 entry — H.1.0)

▸Cadmus· v1

Found the ChronVer versioning system the whole release history is now recorded in.

▸v.Chronos.Cadmus.1· 7 entries

Shortened and refined the changelog logs (short per-entry lines, a résumé per Genesis bundle, Gluon buckets unified as codenames) and superseded the Q22 model so each codename carries real numbered versions.

Real per-codename patch-number derivation (Q22 superseded).

The changelog era tiers render top→bottom Gluon → Chronos → Boreas → Aether → Chaos

Naming refinements.

The foundation dogfoods itself again.

1-a

Changelog short entries, Genesis résumés, and Gluon-as-codenames render hotfix.

1-b

C5 codename-header highest-version chip fix.

▸v.Chronos.Cadmus.0· Genesis· 0 entries

Perseus gave the project one canonical coordinate-addressed forest and made the build stamp derive itself from the last shipped node.

The ChronVer display grammar `v.<era>.<named-patch>.<patch-number>-<hotfix-letter>`.

The Chaos origin era and the Gluon orderless maintenance era.

Two sanctioned migration-point folds.

The Hipparchus → Charon rename.

The deterministic placement skill.

The foundation dogfoods itself.

▸Perseus· v0

Replace the four hand-typed version surfaces with one canonical coordinate-addressed forest every surface is derived from.

▸v.Chronos.Perseus.0· Genesis· 1 entry

The version stamp used to be a hand-typed constant: a human edited `VERSION`, `PHASE_CODE`, and `PHASE_NAME` in `lib/version.ts` every release, and the same scheme was re-typed by hand into the changelog, the roadmap, the releases index, and the canonical page

The version stamp used to be a hand-typed constant: a human edited `VERSION`, `PHASE_CODE`, and `PHASE_NAME` in `lib/version.ts` every release, and the same scheme was re-typed by hand into the changelog, the roadmap, the releases index, and the canonical page

One canonical coordinate-addressed forest.

The four-designator grammar `v.<Release>.<Era>.<Name>-<fix>`.

Four re-derived surfaces + one unified canonical page.

The retroactive-fix rule.

The version stamp is derived, never hand-stamped.

The change dogfoods itself.

Tests

Patch log (5 entries — v.H.1.6 P1–P5)

▸Nemesis· v0

Close the economic hole where a red, stale, or red-component node kept minting stoicism as if it were gold.

▸v.Chronos.Nemesis.0· Genesis· 1 entry

A node whose Earn Score is red, stale, or built on a red sub-component used to keep accruing stoicism anyway

A node whose Earn Score is red, stale, or built on a red sub-component used to keep accruing stoicism anyway

Single-source red determination, both accrual paths gated.

Forward-only economic gate (no clawback).

Migration 088 — red-skip event + a latent bug fix.

Red is shown red where the score is read.

Triton is triple-tabbed with a disk-tweaked eligibility view.

Jobs page reads in plain language.

Argus rebench is an actionable per-entity control.

Tests

Patch log (5 entries — v.H.1.5 P1–P5)

▸Triptolemus· v0

Turn benchmark reuse from a hard-coded always-rebench into a real greenlight-gated decision and propagate it fleet-wide.

▸v.Chronos.Triptolemus.0· Genesis· 1 entry

Benchmark reuse stops being a hard-coded "always rebench from scratch" and becomes a real, greenlight-gated decision made at the moment a bench runs: a result that is **gold** (stamped with the live greenlit version) and still **fits** the target is reused/skipped instead of being re-measured, while anything **red, missing, or stale** is benched fresh

Benchmark reuse stops being a hard-coded "always rebench from scratch" and becomes a real, greenlight-gated decision made at the moment a bench runs: a result that is **gold** (stamped with the live greenlit version) and still **fits** the target is reused/skipped instead of being re-measured, while anything **red, missing, or stale** is benched fresh

One shared greenlight-gated reuse decision.

Force-default flip + universal force-fresh toggle.

Drive recency demoted to a secondary safety net.

CPU reuse can finally be gold-gated.

Four bulk "Bench all" endpoints.

Argus bulk controls.

The reuse gate is visible where the score is read.

Tests hardening, no scoring-math regression.

Tests

Patch log (5 entries — v.H.1.4 P1–P5)

▸Daedalus· v0

Make the disk a first-class measured benchmark entity instead of a number inferred from the containers on top of it.

▸v.Chronos.Daedalus.0· Genesis· 1 entry

The disk stops being a derived afterthought computed from container rows and becomes a first-class benchmark entity with its own Score, Stamped version, Last benched, Time since, and Status

The disk stops being a derived afterthought computed from container rows and becomes a first-class benchmark entity with its own Score, Stamped version, Last benched, Time since, and Status

Disk is a first-class benchmark entity.

Single-source disk subscore.

N=1 Prime / ÷N slave equivalence.

Drive benches now carry a bench-version stamp.

Argus admin panel is now a four-tab surface:

Triton user-facing panel is now a three-group surface:

Tests hardening.

Tests

Patch log (5 entries — v.H.1.3 P1–P5)

▸Argus· v0

Give the operator a single watchful oversight surface over the whole bench-versioning system.

▸v.Chronos.Argus.0· Genesis· 15 entries

Operator-reported, while rebenching StoaAncientTwo: (1) StoaAncientTwo now has the exact same score as StoaAncientOne; (2) `benchmark-host-drive failed` in the logs; (3) ghost `hash+verify pipeline` still `EBENCH_PARSE`.
Operator-reported: StoaAncientTwo, StoaAncientThree and StoaAncientFour all showed "54 minutes ago benched"
Operator-reported color inconsistency: StoaAncientOne (= AncientOne-chainweb-001) showed a **red** score (4.401161) in the Argus panel but a **yellow/gold** score in the per-node benchmark panel
Three follow-ups after the v.H.1.2j rebench of AncientOne-chainweb-001, plus the panel UX rework the operator asked for.
Five issues surfaced by the operator rebenching AncientOne-chainweb-001 (the chainweb whose `cluster-id` flag is "StoaAncientOne").
Operator-reported: `StoaEnclaveSix` (chainweb-006 on StoaNodePrime) showed `missing score` in Argus even though the entity's `ServerScoreCard` (per-node page) rendered a real partial score (~3.713).
Operator-reported: missing-score rows showed `(pre-versioning)` in the Stamped Version column even though they have no bench at all (no score, no timestamp, no stamp).
Three coordinated changes: the **Greenlight system** (operator-driven design pivot
Two refinements from operator smoke-testing of v.H.1.2e:
Operator-reported: `StoaAncientFour` rendered as "missing score" in the Argus all-entities table while `/hub/nodes` showed its real score.
Two operator-reported corrections from the v.H.1.2c live deploy:
Operator-driven Argus refinement: replace the 20-oldest stale list + unified all-entities list with two split paginated cards (Real Entities + Virtual Entities), and surface the "every server has a Virtual Prime by default" invariant by auto-creating missing prime rows during the fleet endpoint read.
Argus all-entities surface
Five operator-reported follow-ups from the live v.H.1.2 deploy: two Argus refinements + three v.H.1.1nd staleness-UX bugs the operator caught while smoke-testing the panel and benching real containers.
Bench-version oversight admin panel.

Operator-reported, while rebenching StoaAncientTwo: (1) StoaAncientTwo now has the exact same score as StoaAncientOne; (2) `benchmark-host-drive failed` in the logs; (3) ghost `hash+verify pipeline` still `EBENCH_PARSE`.

(1) Identical score

(2) `benchmark-host-drive failed`

(3) ghost `hash+verify` EBENCH_PARSE

Tests

Patch log (1 entry — v.H.1.2n)

Operator-reported: StoaAncientTwo, StoaAncientThree and StoaAncientFour all showed "54 minutes ago benched"

Root cause

`pages/api/admin/bench-versioning/fleet.ts`

This completes the **score (v.H.1.2e) / version (v.H.1.2l) / date (v.H.1.2m)** source-consistency set: all three dimensions for segregated children are now sourced exactly the way the per-node benchmark panel sources them, so Argus and the per-node card always agree.

Patch log (1 entry — v.H.1.2m)

Operator-reported color inconsistency: StoaAncientOne (= AncientOne-chainweb-001) showed a **red** score (4.401161) in the Argus panel but a **yellow/gold** score in the per-node benchmark panel

Root cause

`pages/api/admin/bench-versioning/fleet.ts`

A segregated child with **no** slice bench (score wholly inherited from drive + parent net, e.g.

Patch log (1 entry — v.H.1.2l)

Three follow-ups after the v.H.1.2j rebench of AncientOne-chainweb-001, plus the panel UX rework the operator asked for.

CPU Subscore still 0.000 in the Container-Score detail panel.

"Ghost CPU Running 57% elapsed 0m0s".

StoaEnclaveSix (= chainweb-006) showed a score but a blank "Last benched".

Panel UX rework.

Patch log (1 entry — v.H.1.2k)

Five issues surfaced by the operator rebenching AncientOne-chainweb-001 (the chainweb whose `cluster-id` flag is "StoaAncientOne").

Root cause of "score still red after rebench"

Deploy procedure reloads BOTH PM2 processes.

Slice + virtual-slave handlers stamp the LIVE version.

Transitive rebench.

Ghost-CPU `perf stat` IPC fix.

CPU 0.000

Patch log (1 entry — v.H.1.2j)

Operator-reported: `StoaEnclaveSix` (chainweb-006 on StoaNodePrime) showed `missing score` in Argus even though the entity's `ServerScoreCard` (per-node page) rendered a real partial score (~3.713).

ServerScoreCard

`/hub/nodes` registry + Argus fleet endpoint (pre-v.H.1.2i)

Surface partial slice scores in Argus.

Patch log (1 entry — v.H.1.2i)

Operator-reported: missing-score rows showed `(pre-versioning)` in the Stamped Version column even though they have no bench at all (no score, no timestamp, no stamp).

Stamped Version column distinguishes three states:

For **chainweb children** (segregated containers), v.H.1.2g preloads `segregated_slice_benchmarks.benchmarked_at` and uses it when the nodes column is null.

For **full-hosts** and **virtuals**, the timestamp lives on the same row as the score (`server_score_benchmarked_at` and `last_benched_at` respectively).

1 new panel render test (`T-A-R-STAMP1`)

Patch log (1 entry — v.H.1.2h)

Three coordinated changes: the **Greenlight system** (operator-driven design pivot

Greenlight system — promote button on Argus.

Distribution includes never-benched in `(pre-versioning)` bucket.

Chainweb-child `lastBenchedAt` sourcing.

Patch log (1 entry — v.H.1.2g)

Two refinements from operator smoke-testing of v.H.1.2e:

Real entities filtered to chainweb-bearing rows only.

`score.serverScore` fallback for older breakdown shapes.

Patch log (1 entry — v.H.1.2f)

Operator-reported: `StoaAncientFour` rendered as "missing score" in the Argus all-entities table while `/hub/nodes` showed its real score.

Mirror `/hub/nodes` score resolution.

Virtual containers get the same fallback for symmetry.

Patch log (1 entry — v.H.1.2e)

Two operator-reported corrections from the v.H.1.2c live deploy:

Virtual Prime auto-create is now host-only.

Migration 084 cleans up the bogus prime rows.

(prime) chip on real entities.

Patch log (1 entry — v.H.1.2d)

Operator-driven Argus refinement: replace the 20-oldest stale list + unified all-entities list with two split paginated cards (Real Entities + Virtual Entities), and surface the "every server has a Virtual Prime by default" invariant by auto-creating missing prime rows during the fleet endpoint read.

Real vs Virtual entities split.

Auto-create missing Virtual Primes.

`staleEntities[]` retired from the endpoint response.

Patch log (1 entry — v.H.1.2c)

Argus all-entities surface

`AllEntitiesCard` — paginated all-entities list.

Per-kind distribution split.

Composed `displayName` server-side.

`Slave{NNN}` auto-naming convention.

Fleet endpoint shape additions

Patch log (1 entry — v.H.1.2b)

Five operator-reported follow-ups from the live v.H.1.2 deploy: two Argus refinements + three v.H.1.1nd staleness-UX bugs the operator caught while smoke-testing the panel and benching real containers.

Argus stale-entities — parent host context on virtuals.

Fleet-rebench affordance on Argus.

ServerScoreCard `useLive` color carve-out removed.

Slice bench stamps `benchVersion` end-to-end.

Virtual-slave writeback stamps `benchVersion`.

Slice + ghost-CPU combined bench lifecycle.

Patch log (1 entry — v.H.1.2a)

Bench-version oversight admin panel.

`lib/bench-version.ts`

`pages/api/admin/bench-versioning/drift.ts`

`pages/api/admin/bench-versioning/fleet.ts`

`pages/hub/bench-versioning.tsx`

Operator workflow

Patch log (1 entry — v.H.1.2)

▸Pheidippides· v0

Graduate the push-primary bench architecture into its own release line and add the bench-version semver discipline.

▸v.Chronos.Pheidippides.0· Genesis· 19 entries

Bench-version semver system + segregated-slice handler bug fixes from operator-reported AncientOne-chainweb-001 real-container bench.
Bench-progress UX refinement after v.H.1.1nb stabilised the bench-event channel.
Revert
Two pre-existing bugs operator caught from a live bench run + my v.H.1.1n deploy.
Phase 2 of the bench-progress UX rebuild: **progressive per-dimension tile fill** during in-flight runs.
Bench progress panel becomes **persistent**
net.upload subtest **retired**; 2 parts redistributed to jitter (1 → 2) and loss (1 → 2).
Five operator-driven baseline recalibrations + one upload-row debuggability fix.
Network bench restructure operator drove after reading the v.H.1.1i bench output.
Three score-card polish fixes operator caught after staging v.H.1.1h.
Score-card per-dimension "vs baseline" labels rewritten.
Disk baseline alignment to Samsung 870 EVO spec **plus a critical baseline-divergence bug fix** the operator caught while triaging the next disk-score iteration.
Network scoring restructure + two real bench-script bugs the operator caught.
CPU baseline recalibration
Disk baseline recalibration to a good SATA SSD reference + per-substep completion-time labels in the live bench view.
Bench score calibration + two real bench-script bugs caught from the first successful v.H.1.1b run.
Bench data-quality refinements against the first successful end-to-end Pheidippides bench (v.H.1.1a) on AncientOne.
First quick patch on the Pheidippides arc.
Bench transport rehaul.

Bench-version semver system + segregated-slice handler bug fixes from operator-reported AncientOne-chainweb-001 real-container bench.

`lib/bench-version.ts`

`computeServerScore` stamps every breakdown

ServerScoreCard color-grading + banner

LastBenchDisplay color-grading

CI drift guard

Per-sub-test disk rows for slices

RAM quantity row + richer bandwidth inheritance attribution

Deferred — CPU 0.000 root cause

Patch log (1 entry — v.H.1.1nd)

Bench-progress UX refinement after v.H.1.1nb stabilised the bench-event channel.

`formatRelativeGranular(iso)`

`formatAbsoluteISO(iso)`

`<LastBenchDisplay />`

`BenchmarkPanel.tsx`

Patch log (1 entry — v.H.1.1nc)

Revert

Bash `__ancient_phase_done` reverted to the pre-v.H.1.1n minimal shape.

Phase 2 (per-dimension progressive tile fill) is DEFERRED.

The v.H.1.1na callback-form replace stays.

Forward path (NOT v.H.1.1nb)

Patch log (1 entry — v.H.1.1nb)

Two pre-existing bugs operator caught from a live bench run + my v.H.1.1n deploy.

Bash `__ancient_phase_done` was producing `/proc/$/fd/1` (single `$`)

`FULL_HOST_PHASES` vs `PHASE_PROGRESS` drift.

Patch log (1 entry — v.H.1.1na)

Phase 2 of the bench-progress UX rebuild: **progressive per-dimension tile fill** during in-flight runs.

Bash bench-script `__ancient_phase_done` upgrade.

New endpoint `GET /api/admin/nodes/[id]/bench-progress-snapshot`.

ServerScoreCard `liveBenchJobId` prop + polling.

NodeScoringCard wires `benchmarkJobId` through.

Patch log (1 entry — v.H.1.1n)

Bench progress panel becomes **persistent**

Bench-progress panel persistence.

"Last run" metadata block.

ServerScoreCard `inFlight` prop.

Patch log (1 entry — v.H.1.1m)

net.upload subtest **retired**; 2 parts redistributed to jitter (1 → 2) and loss (1 → 2).

net.upload subtest retired.

Expand-collapse dropdown on `net.download` row.

`PHASE_PROGRESS` map updated.

Documentation updates.

Patch log (1 entry — v.H.1.1l)

Five operator-driven baseline recalibrations + one upload-row debuggability fix.

CPU MT baseline 20_000 → 15_000

CPU IPC baseline 1.5 → 1.0.

Ed25519 baseline 15_000 → 10_000.

Hash-verify baseline 100 → 80.

Net download baseline 100 → 50 Mbps.

Upload row debuggability fix.

Patch log (1 entry — v.H.1.1k)

Network bench restructure operator drove after reading the v.H.1.1i bench output.

Pillar A — Per-region `net.download.<region>` rows collapsed into a single `net.download` step record.

Pillar B — Direction-of-merit flag on the StepRecord type.

Pillar C — Net.loss display quantization.

Patch log (1 entry — v.H.1.1j)

Three score-card polish fixes operator caught after staging v.H.1.1h.

Pillar A — Dimension tile baseline label prefix fix.

Pillar B — Baseline-summary line color bumped

Pillar C — Summary header added to Disk / Network / Commitment

Patch log (1 entry — v.H.1.1i)

Score-card per-dimension "vs baseline" labels rewritten.

What landed

Patch log (1 entry — v.H.1.1h)

Disk baseline alignment to Samsung 870 EVO spec **plus a critical baseline-divergence bug fix** the operator caught while triaging the next disk-score iteration.

Pillar A — Critical bug fix: `diskComposite` now uses the constants.

Pillar B — Samsung 870 EVO baselines.

Patch log (1 entry — v.H.1.1g)

Network scoring restructure + two real bench-script bugs the operator caught.

Pillar A — Network download: top-5-of-8 regions instead of average-of-8.

Pillar B — Upload-sink HTTP 413 fixed.

Pillar C — Jitter measurement robustness.

Patch log (1 entry — v.H.1.1f)

CPU baseline recalibration

What landed

Patch log (1 entry — v.H.1.1e)

Disk baseline recalibration to a good SATA SSD reference + per-substep completion-time labels in the live bench view.

Pillar A — Disk baselines bumped to a good SATA SSD reference.

Pillar B — Per-substep completion-time labels.

Patch log (1 entry — v.H.1.1d)

Bench score calibration + two real bench-script bugs caught from the first successful v.H.1.1b run.

Pillar A — Network baselines recalibrated.

Pillar B — RAM bandwidth baseline recalibrated.

Pillar C — Jitter bench `--max-time` dropped 5 s → 1 s.

Pillar D — Upload-sink URL DNS bug fixed.

Patch log (1 entry — v.H.1.1c)

Bench data-quality refinements against the first successful end-to-end Pheidippides bench (v.H.1.1a) on AncientOne.

Pillar A — network sub-tests no longer skip wrongly.

Pillar B — RAM bandwidth measures peak instead of latency.

Pillar C — virtual commitment scales with the chosen drive.

Patch log (1 entry — v.H.1.1b)

First quick patch on the Pheidippides arc.

Pillar A — finalize POST moved out of the bench script into the kickoff wrapper.

Pillar B — progress UI stuck at 95 %.

Pillar C — elapsed time in `VirtualBenchLiveProgress`.

Patch log (1 entry — v.H.1.1a)

Bench transport rehaul.

Flow

Mid-bench SSH stream death.

The whole class of "what kills SSH on this host?" debugging.

Score math + breakdown JSON shape + UI surfaces.

Existing handlers.

Plan reference

Patch log (1 entry — v.H.1.1)

▸Boreas (“Genesis”)Release B · 4 entries

▸Iris· v0

Let one physical server carry N public hostnames so each container can present its own identity to the chainweb P2P layer.

▸v.Boreas.Iris.0· Genesis· 29 entries

Small UI fix on the hub registry page (`/hub/nodes`).
After v.G.1.4z's diagnostic tool fired `[CRITICAL] D-HAIRPIN-VERIFY` on chainweb-002, the operator pointed out a counter-example that broke the hairpin theory cleanly: StoaBastionTwo + StoaBastionThree are syncing fine despite identical hairpin conditions on the website-hub box.
After the v.G.1.4y deploy the operator reinstalled `StoaEnclaveTwo` (chainweb-002 on StoaNodePrime) cleanly via the hub, and the new container stalled at cut height 0
After the v.G.1.4x deploy the operator hit a stuck `stoachain-control` job: action `restart` on `StoaNodePrime-chainweb-002` (a segregated child whose container is named `stoa-node-slave-002`) sat at progress 80% / step "waiting for chainweb-node inside container" and eventually failed at the 240 s `DOCKER_HEALTHY_TIMEOUT_MS`
After the v.G.1.4w deploy, the operator removed StoaEnclaveTwo + StoaEnclaveThree to reinstall them fresh.
After the v.G.1.4v deploy, the operator tried to change a chainweb's peer port via the Medusa Panel's "Ports → Apply" button and got `Validation failed: admin_confirm_required`.
The Seeds dashboard's "Active downloads" table didn't update live when reseeds were triggered from another browser tab.
The per-node "Reseed from snapshot" modal in the StoaChain control panel was taking 5+ seconds to populate its 4 display fields (donor, cut height, size, promoted-at).
After the v.G.1.4s deploy, the operator surfaced two operator-visible inconsistencies on the seeds dashboard. (1) `/admin/seeds` showed "cycling through 3 references" while `/admin/tip-references` settings showed only 2 checked. (2) An active reseed of a chainweb container (operator clicked "Re-seed from current") didn't appear in the dashboard's "Active downloads" table. v.G.1.4t closes both.
After installing AncientOne (a fresh single-disk home server) and starting a chainweb container, the operator saw a red Disk panel that survived three re-benches.
The v.G.1.4q auto-enqueue actually fired on the user's StoaNodePrime slave-bench retry
The v.G.1.4p resilient lookup added fallback paths for the slave's network inheritance, but only helps when the parent has SOMETHING to fall back from (`net.composite > 0` or at least one valid `net.perServer[]` entry).
After installing a fresh chainweb container on StoaNodePrime, the per-slave Network panel rendered red even though the parent host's network was measurably fine, and re-running the SLAVE benchmark didn't fix it.
The diagnostics added in v.G.1.4n revealed the real root cause of the StoaNodePrime install reachability failures: the operator's `node1.stoachain.com` is a **multi-A DNS** pointing to TWO different servers (StoaNodeOne at `129.212.143.119` and StoaNodePrime at `85.215.141.198`). `dns.lookup` returns whichever A-record wins round-robin, so the probe was dialing StoaNodeOne's firewall (closed for these ports) instead of StoaNodePrime's (open). v.G.1.4o stops dialing the DNS hostname for reachability checks and dials the install target's actual IP directly.
Diagnostic depth for the install-wizard reachability probe.
Two install-wizard fixes surfaced during a slave install on StoaNodePrime: the preflight's `sudo -n available` check was going yellow even when the docker NOPASSWD rule was correctly in place, and the reachability probe was showing red on the allocated slave ports (17891 / 18481) even when the operator had added Cerberus Panel UFW rules for them.
Finishes the IdentityStep desync fix that v.G.1.4k started but missed.
A small but operator-impacting fix in the chainweb-install wizard's Step 6 DNS picker.
EarnScore graduates from a stacked sub-line under ServerScore (v.G.1.4i layout) into its own dedicated table column on `/hub/nodes`.
Two coupled fixes for the ServerScore column on `/hub/nodes`. (1) The single ServerScore display splits into TWO economic entities
The big follow-up to v.G.1.4g: finishes the Personal/Foreign Pool work by adding the actual two-section bucket-split rendering on `/hub/nodes`.
Two contained UI/semantic fixes for `/hub/nodes` that fall out of v.G.1.4f live testing on ancientholdings.eu and prepare the data layer for the bigger v.G.1.4h sort + grouping + pagination overhaul.
Four follow-up fixes layered on top of v.G.1.4e Iris that close the issues surfaced during AncientMiner live testing of the post-v.G.1.4d benchmark UI.
One surgical fix that closes the last autonomic-toggle gap on Tunnelees with legacy install state.
Three follow-up fixes that cleanly address the remaining autonomic-toggle gaps after v.G.1.4c live testing on ancientholdings.eu.
Two surgical follow-up fixes layered on top of v.G.1.4b Iris that fall directly out of post-deploy investigation: the autonomic-toggle eligibility tooltip introduced in v.G.1.4a was never reaching the UI on the per-node Mount Capacity card, and full-host installs (Prime containers) never had their `data_drive_id` set so the eligibility predicate's chainweb-pinned axis failed universally.
Three follow-up data-integrity + UX fixes layered on top of v.G.1.4a Iris, surfaced during live testing on ancientholdings.eu.
Eight follow-up UX/bug fixes layered on top of v.G.1.4 Iris, surfaced during live testing on ancientholdings.eu.
The multi-DNS-per-server spec

Small UI fix on the hub registry page (`/hub/nodes`).

BucketPill repositioning

Tip alignment restored

Pre-existing v.G.1.4aa test fix

Patch log (1 entry)

After v.G.1.4z's diagnostic tool fired `[CRITICAL] D-HAIRPIN-VERIFY` on chainweb-002, the operator pointed out a counter-example that broke the hairpin theory cleanly: StoaBastionTwo + StoaBastionThree are syncing fine despite identical hairpin conditions on the website-hub box.

Cert obtain failure is now FATAL

Asclepius (the diagnostic tool, named this version)

Two new diagnoses

D-HAIRPIN-VERIFY downgraded

Asclepius naming + future companion

Source-level regression test

Patch log (1 entry)

After the v.G.1.4y deploy the operator reinstalled `StoaEnclaveTwo` (chainweb-002 on StoaNodePrime) cleanly via the hub, and the new container stalled at cut height 0

`lib/diag/chainweb-sync.ts`

Knowledge-base of named diagnoses

`scripts/diag-chainweb-sync.ts`

Patch log (1 entry)

After the v.G.1.4x deploy the operator hit a stuck `stoachain-control` job: action `restart` on `StoaNodePrime-chainweb-002` (a segregated child whose container is named `stoa-node-slave-002`) sat at progress 80% / step "waiting for chainweb-node inside container" and eventually failed at the 240 s `DOCKER_HEALTHY_TIMEOUT_MS`

`waitForContainerChainweb` now takes a `containerName` parameter

Function exported for the regression test

Error message includes the container name

Patch log (1 entry)

After the v.G.1.4w deploy, the operator removed StoaEnclaveTwo + StoaEnclaveThree to reinstall them fresh.

Auto-clear `host_drives.autonomic_commit_enabled` on last-child uninstall

Decision rationale

Patch log (1 entry)

After the v.G.1.4v deploy, the operator tried to change a chainweb's peer port via the Medusa Panel's "Ports → Apply" button and got `Validation failed: admin_confirm_required`.

Ports → Apply now wraps in `confirmFresh()`

Modal portal mounted in `MedusaEntry`

Confirm-dialog text

Patch log (1 entry)

The Seeds dashboard's "Active downloads" table didn't update live when reseeds were triggered from another browser tab.

New SSE endpoint

Shared activeDownloads helper

`/api/admin/seeds` refactored

Seeds page subscribes

Patch log (1 entry)

The per-node "Reseed from snapshot" modal in the StoaChain control panel was taking 5+ seconds to populate its 4 display fields (donor, cut height, size, promoted-at).

New lightweight endpoint

Reseed modal switches to the lean endpoint

Patch log (1 entry)

After the v.G.1.4s deploy, the operator surfaced two operator-visible inconsistencies on the seeds dashboard. (1) `/admin/seeds` showed "cycling through 3 references" while `/admin/tip-references` settings showed only 2 checked. (2) An active reseed of a chainweb container (operator clicked "Re-seed from current") didn't appear in the dashboard's "Active downloads" table. v.G.1.4t closes both.

Tip-reference filter mismatch

Active reseed not surfacing on seeds dashboard

Patch log (1 entry)

After installing AncientOne (a fresh single-disk home server) and starting a chainweb container, the operator saw a red Disk panel that survived three re-benches.

Drive bench scratch path on root-mount drives

Fix: route under `$HOME` when mount_point is `/`

Patch log (1 entry)

The v.G.1.4q auto-enqueue actually fired on the user's StoaNodePrime slave-bench retry

`segregatedHostMode` payload flag

Auto-enqueue path now sets the flag

Operator-facing log line

Tradeoff

Patch log (1 entry)

The v.G.1.4p resilient lookup added fallback paths for the slave's network inheritance, but only helps when the parent has SOMETHING to fall back from (`net.composite > 0` or at least one valid `net.perServer[]` entry).

Auto-enqueue parent's `benchmark-node`

De-dup against in-flight parent benches

Skip silently when parent has no SSH key

Audit trail

Skips silently when parent already has good net stamp

Patch log (1 entry)

After installing a fresh chainweb container on StoaNodePrime, the per-slave Network panel rendered red even though the parent host's network was measurably fine, and re-running the SLAVE benchmark didn't fix it.

`readParentNetContribution` helper

Returns 0 only when genuinely no signal

Patch log (1 entry)

The diagnostics added in v.G.1.4n revealed the real root cause of the StoaNodePrime install reachability failures: the operator's `node1.stoachain.com` is a **multi-A DNS** pointing to TWO different servers (StoaNodeOne at `129.212.143.119` and StoaNodePrime at `85.215.141.198`). `dns.lookup` returns whichever A-record wins round-robin, so the probe was dialing StoaNodeOne's firewall (closed for these ports) instead of StoaNodePrime's (open). v.G.1.4o stops dialing the DNS hostname for reachability checks and dials the install target's actual IP directly.

`canonicalHostIp` exposed by install-topology

Reachability probe dials the IP, not the hostname

Backward-compat fallback

Patch log (1 entry)

Diagnostic depth for the install-wizard reachability probe.

Explicit DNS lookup before TCP dial

Errno-specific branches for previously-collapsed errors

Per-port UI shows resolved IP + errno

Patch log (1 entry)

Two install-wizard fixes surfaced during a slave install on StoaNodePrime: the preflight's `sudo -n available` check was going yellow even when the docker NOPASSWD rule was correctly in place, and the reachability probe was showing red on the allocated slave ports (17891 / 18481) even when the operator had added Cerberus Panel UFW rules for them.

Sudo NOPASSWD regex now spans line wraps

Port probe uses the operator's picker hostname

Patch log (1 entry)

Finishes the IdentityStep desync fix that v.G.1.4k started but missed.

Banner hostname follows the picker

Source caption preserved

Patch log (1 entry)

A small but operator-impacting fix in the chainweb-install wizard's Step 6 DNS picker.

Defensive sync `selectedDnsHostnameId` → `p2pHostname`

Mechanism

Patch log (1 entry)

EarnScore graduates from a stacked sub-line under ServerScore (v.G.1.4i layout) into its own dedicated table column on `/hub/nodes`.

EarnScore as its own column

Renamed `Score` → `ServerScore` in the sort dropdown

Sort axes are now distinct

EarningScore declarations lifted into the row outer scope

Patch log (1 entry)

Two coupled fixes for the ServerScore column on `/hub/nodes`. (1) The single ServerScore display splits into TWO economic entities

EarningScore line

Three EarningScore cases

Sort-by-score uses EarningScore

Aggregate ServerScore helper

Earning-tooltip

Patch log (1 entry)

The big follow-up to v.G.1.4g: finishes the Personal/Foreign Pool work by adding the actual two-section bucket-split rendering on `/hub/nodes`.

Personal Server Pool / Foreign Server Pool sections

Tunneler-tree atomic placement

Per-bucket pagination at 25 servers/page

Per-bucket independent collapse toggles

Global Collapse Tunnelees toggle

Sort within each bucket

Cleanup #1: Server-column ROLE chip retired

Cleanup #2: Tunneler/Tunnelee chip tones

Cleanup #3: Ancient-admin reorder buttons + column retired

Patch log (1 entry)

Two contained UI/semantic fixes for `/hub/nodes` that fall out of v.G.1.4f live testing on ancientholdings.eu and prepare the data layer for the bigger v.G.1.4h sort + grouping + pagination overhaul.

Personal Pool / Foreign Pool count semantic.

3-slot tag scheme (Role / Connectivity / Mode) on each server row.

Patch log (1 entry)

Four follow-up fixes layered on top of v.G.1.4e Iris that close the issues surfaced during AncientMiner live testing of the post-v.G.1.4d benchmark UI.

Locale-neutral bench numbers via `export LC_ALL=C`.

Softer Network panel rendering on legitimate failure.

MedusaPanel Prime-detection gate regression fix from v.G.1.4d.

🆕 ⚙ Auto chip on chainweb rows in `/hub/nodes`.

Patch log (1 entry)

One surgical fix that closes the last autonomic-toggle gap on Tunnelees with legacy install state.

Autonomic `data_drive_id` backfill + `host_drives` synthesis fall back to `stoachain_runner_path` when `stoachain_data_path` is NULL.

Patch log (1 entry)

Three follow-up fixes that cleanly address the remaining autonomic-toggle gaps after v.G.1.4c live testing on ancientholdings.eu.

Autonomic eligibility predicate simplified to a single axis.

Full-host parents synthesised as their own chainweb-child for the consumption overlay.

`host_drives` synthesis from `system_probe_json.disk[]` for chainweb-data mounts.

Patch log (1 entry)

Two surgical follow-up fixes layered on top of v.G.1.4b Iris that fall directly out of post-deploy investigation: the autonomic-toggle eligibility tooltip introduced in v.G.1.4a was never reaching the UI on the per-node Mount Capacity card, and full-host installs (Prime containers) never had their `data_drive_id` set so the eligibility predicate's chainweb-pinned axis failed universally.

`NodeTabs` `eligibilityReason` wire-through.

`data_drive_id` backfill on probe.

Patch log (1 entry)

Three follow-up data-integrity + UX fixes layered on top of v.G.1.4a Iris, surfaced during live testing on ancientholdings.eu.

SEED chip never on parent shell, always on the chainweb-bearing label.

Tip Reference Menu data-integrity guard.

`host_drives` backfill on probe.

Patch log (1 entry)

Eight follow-up UX/bug fixes layered on top of v.G.1.4 Iris, surfaced during live testing on ancientholdings.eu.

Per-container SEED chip placement.

BenchSubstepList extended to five categories.

Per-substep measurements inline.

CPU card chainweb-aware shape.

Server-card score topology-correct sourcing.

Network rebench fail-loud instrumentation.

RAM measurement in slice bench.

Autonomic-toggle eligibility tooltip.

Patch log (1 entry)

The multi-DNS-per-server spec

N hostnames per node.

Per-container DNS pick at chainweb install time.

DuckDNS continuity.

Destructive migration 075

Legacy singular endpoint retired.

UI rewrite

Public docs updated

Patch log (1 entry)

▸Antikythera· v1

Measure CPU performance against the actual chainweb-node hot path instead of an abstract sysbench number.

▸v.Boreas.Antikythera.1· 1 entry

The cpu-benchmark-chainweb-tuning spec.

The cpu-benchmark-chainweb-tuning spec.

Six chainweb-correlated workloads

Chainweb-aggregate score formula

Four-bucket chainweb-friendliness label

IPC promoted to a scored signal at ~10% weight

Slice bench full mirror

Sudoers refresh

Silent-IPC-null bug fixed

Schema migrations 072 + 073

Operator UX — live substep list

Bucket-label visibility

Single-source bucket primitive

Public benchmarking docs

Internal scoring reference

Runtime-configurable baselines

240/240 spec tests pass

Patch log (1 entry)

▸v.Boreas.Antikythera.0· Genesis· 0 entries

▸Asclepius· v0

Reserve the canonical landing place for audit-cycle work — the mender, where audit cycles land.

▸v.Boreas.Asclepius.0· Genesis· 0 entries

Asclepius release page at `/docs/releases/asclepius`

Scope-update addenda in spec.md (10 fixes + 1 dead-code note: B-4-014 / B-4-020 / B-4-021 / B-4-023 / B-4-024 / B-4-027 / B-4-028 / B-6-010 / B-6-003 / CI-001 / CI-008)

requirements.md F11 row amended to align with the agreed comment-whitelist regex

▸Cerberus· v0

Drive the full UFW firewall lifecycle for managed nodes from the hub, end-to-end.

▸v.Boreas.Cerberus.0· Genesis· 5 entries

v.G.1.1

New `location_groups` table + nullable `nodes.location_group_id` FK column (migration 092); the port allocator widens `nextFreeSlotIndex` + `allocateSegregatedSet` to scope by location-group membership via an IN-list expansion that stays byte-equivalent for ungrouped servers (the dominant case at ship time).

New `/hub/locations` bulk page lists every group with per-member container count + slot range; the add-server form gains a Location step (standalone / join existing / create new); per-server detail page adds a `Location: <group>` row with inline picker.

The per-server PortPoolCard renders a "Location pool

Join-time collision warnings: PATCH `/api/admin/nodes/[id]` with `location_group_id` returns `warnings.affectedContainers` listing any slot the joiner's existing allocations would collide with on the target group

Cerberus firewall control.

Four endpoints: `/api/hub/nodes/[id]/firewall/bootstrap`, `/api/hub/nodes/[id]/firewall/reconcile`, `/api/hub/nodes/[id]/firewall/state`, plus the existing drift-status surface.

Probe-cycle drift integration

Full audit-trail guarantee: every mutation goes through `admin_audit` with operator email, target node, action, and result.

Preset templates

Preset apply/remove endpoints

Custom-rule endpoints

F12 unsafe-port helper

Per-node firewall UI

Bulk-apply page at `/hub/firewall/bulk` (Ancient Admin only)

Bulk dry-run preview endpoint `POST /api/hub/firewall/bulk/dry-run`

Bulk apply endpoint `POST /api/hub/firewall/bulk/apply` with sequential per-pair fan-out and 2 + 2P audit-row contract

applyPresetToNode helper extraction at `lib/firewall/apply-preset.ts` (Phase 3 T3.2 internal refactor; existing tests preserved)

New `firewall.bulk_apply` audit kind

Bulk selector + preview + progress UI components (`<BulkSelector>`, `<BulkPreview>`, `<BulkProgress>`)

lib/audit-query.ts extended with parent-chain CTE so per-node audit views surface bulk-apply umbrella rows

Cerberus public docs chapter at `/docs/tools/firewall`

Tools pillar landing card linking to the Cerberus chapter

Patch log (5 entries)

▸Aether (“Foundation”)Release A · 4 entries

▸Medusa· v0

Make segregated (slice) containers a fully scored, first-class entity alongside full-host containers.

▸v.Aether.Medusa.0· Genesis· 104 entries

v0.7.12m30
v0.7.12m29
v0.7.12m28
v0.7.12m27
v0.7.12m26
v0.7.12m25
v0.7.12m24
v0.7.12m23
v0.7.12m22
v0.7.12m21
v0.7.12m20
v0.7.12m19
v0.7.12m18
v0.7.12m17
v0.7.12m16
v0.7.12m15
v0.7.12m14
v0.7.12m13
v0.7.12m12
v0.7.12m11
v0.7.12m10
v0.7.12m9
v0.7.12m8
v0.7.12m7
v0.7.12m6
v0.7.12m5
v0.7.12m4
v0.7.12m3
v0.7.12m2
v0.7.12m1
v0.7.12k4a
v0.7.12k4
v0.7.12k3
v0.7.12k2
v0.7.12k1
v0.7.12k
v0.7.12j19
v0.7.12j18
v0.7.12j17g
v0.7.12j17f
v0.7.12j17e
v0.7.12j17d
v0.7.12j17c
v0.7.12j17b
v0.7.12j17
v0.7.12j16
v0.7.12j15
v0.7.12j14
v0.7.12j13
v0.7.12j12
v0.7.12j11
v0.7.12j10
v0.7.12j9
v0.7.12j8
v0.7.12j7
v0.7.12j6
v0.7.12j5
v0.7.12j4
v0.7.12j3
v0.7.12j2
v0.7.12j1
v0.7.12i30
v0.7.12i29
v0.7.12i28
v0.7.12i27
v0.7.12i26
v0.7.12i25
v0.7.12i24
v0.7.12i23
v0.7.12i22
v0.7.12i21
v0.7.12i20
v0.7.12i19
v0.7.12i18
v0.7.12i17
v0.7.12i16
v0.7.12i15
v0.7.12i14
v0.7.12i13
v0.7.12i12
v0.7.12i11
v0.7.12i10
v0.7.12i9
v0.7.12i8
v0.7.12i7
v0.7.12i6
v0.7.12i5
v0.7.12i4
v0.7.12i3
v0.7.12i2
v0.7.12i1
v0.7.12i
v0.7.12h
v0.7.12g1
v0.7.12g
v0.7.12f
v0.7.12e
v0.7.12d
v0.7.12c
v0.7.12b
v0.7.12a
v0.7.12
v0.7.12j15-events
v0.7.12d-pa

Patch log (104 entries)

▸Hydra· v0

Unify the install / convert / migrate wizards and make many slaves on one host correct.

▸v.Aether.Hydra.0· Genesis· 50 entries

v0.7.12l26
v0.7.12l25
v0.7.12l24
v0.7.12l23
v0.7.12l22
v0.7.12l21
v0.7.12l20
v0.7.12l19
v0.7.12l18
v0.7.12l17
v0.7.12l16
v0.7.12l15
v0.7.12l14
v0.7.12l13
v0.7.12l12
v0.7.12l11
v0.7.12l10
v0.7.12l9
v0.7.12l8
v0.7.12l7
v0.7.12l6
v0.7.12l5y
v0.7.12l5x
v0.7.12l5w
v0.7.12l5v
v0.7.12l5u
v0.7.12l5t
v0.7.12l5s
v0.7.12l5r
v0.7.12l5q
v0.7.12l5p
v0.7.12l5o
v0.7.12l5n
v0.7.12l5m
v0.7.12l5l
v0.7.12l5k
v0.7.12l5j
v0.7.12l5i
v0.7.12l5h
v0.7.12l5g
v0.7.12l5f
v0.7.12l5e
v0.7.12l5d
v0.7.12l5c
v0.7.12l5b
v0.7.12l5a
v0.7.12l5
v0.7.12l4
v0.7.12l3
v0.7.12l2

Patch log (50 entries)

▸Prometheus· v0

Build the Tunneler/Tunnelee NAT-traversal stack and the install-wizard + DNS-hostname foundation.

▸v.Aether.Prometheus.0· Genesis· 70 entries

v0.7.10z37h
v0.7.10z37g
v0.7.10z37f
v0.7.10z37e
v0.7.10z37d
v0.7.10z37c
v0.7.10z37b
v0.7.10z37a
v0.7.10z37
v0.7.10z35
v0.7.10z34
v0.7.10z33
v0.7.10z32
v0.7.10z31
v0.7.10z30
v0.7.10z29
v0.7.10z28
v0.7.10z27
v0.7.10z26
v0.7.10z25
v0.7.10z24
v0.7.10z23
v0.7.10z22
v0.7.10z21
v0.7.10z20
v0.7.10z19
v0.7.10z18
v0.7.10z17
v0.7.10z16
v0.7.10z15
v0.7.10z14
v0.7.10z13
v0.7.10z12
v0.7.10z11
v0.7.10z10
v0.7.10z9
v0.7.10z8
v0.7.10z7
v0.7.10z6
v0.7.10z5
v0.7.10z4
v0.7.10z3
v0.7.10z2
v0.7.10z1
v0.7.10z
v0.7.10y
v0.7.10x
v0.7.10w
v0.7.10v
v0.7.10u
v0.7.10t
v0.7.10s
v0.7.10r
v0.7.10q
v0.7.10p
v0.7.10o
v0.7.10n
v0.7.10m
v0.7.10l
v0.7.10k
v0.7.10j
v0.7.10h
v0.7.10g
v0.7.10f
v0.7.10e
v0.7.10d
v0.7.10c
v0.7.10b
v0.7.10a
v0.7.10d-e

Patch log (70 entries)

▸Cassandra· v0

Ship the audit-transparency system and CGNAT-friendly operator onboarding.

▸v.Aether.Cassandra.0· Genesis· 13 entries

v0.7.9p
v0.7.9o
v0.7.9n
v0.7.9m
v0.7.9l
v0.7.9k
v0.7.9j
v0.7.9i
v0.7.9h
v0.7.9g
v0.7.9f
v0.7.9e
v0.7.9d

Patch log (13 entries)

▸Chaos (“the pre-roster origin era”)Origin · 7 arcs· before Release A

▸Pygmalion1 historical entry · patch-number 0

Pygmalion Genesis · 1 historical entry · patch-number 0

v.Chaos.Pygmalion.0-a·git:pre-hub

Pre-hub public marketing site — sourced from version-control history (first git commit 2026-03-27); no CHANGELOG entry exists for this arc, so its single Genesis-0 member is the git provenance anchor.

▸Chiron2 historical entries · patch-number 0

Chiron Genesis · 2 historical entries · patch-number 0

v.Chaos.Chiron.0-a·v0.2.0

Second phase of v1. Lays the plumbing every later phase needs to drive remote
servers (SSH runner, driver interface, job worker, live log streaming). Ships
with a `noop` placeholder driver that exercises the full pipeline without
touching any real service — the F2 exit proof.

**What shipped**

- `ServiceDriver` interface + registry (`lib/drivers/`) — the common contract
  real drivers (SC1, MN1, WEB1, etc.) implement later. F2 includes only the
  `noop` driver for pipeline testing.
- SSH runner (`lib/ssh.ts`) — `ssh2`-based; `runRemote()` streams output via a
  callback for live log tails; `runRemoteStrict()` throws on non-zero exit.
- Job queue primitives (`lib/jobs.ts`) — `enqueueJob`, `claimNextJob`,
  `updateJobProgress`, `completeJob`, `failJob`, `cancelJob`, `reapStaleJobs`.
  Rolling 16 KB log tail per job.
- Schema migration `002_jobs_heartbeat.sql` — adds `heartbeat_at` + `worker_id`
  columns. Stale-heartbeat reaper fails orphaned jobs after 60 s.
- Worker process (`worker/index.ts`) — separate from the web tier. Polls the
  queue, dispatches to the matching driver, heartbeats every 5 s. Run with
  `npm run worker` (dev) or `npm run worker:watch` (auto-restart on file change).
  In production: a new PM2 app `ancientholdings-worker`.
- Admin job APIs — `GET /api/admin/jobs`, `POST /api/admin/jobs` (enqueue),
  `GET /api/admin/jobs/[id]`, `DELETE /api/admin/jobs/[id]` (cancel),
  `GET /api/admin/jobs/[id]/stream` (Server-Sent Events; live progress + log tail).
- Admin UI — `/admin/jobs` (list with progress bars + status badges) and
  `/admin/jobs/[id]` (live detail page with SSE streaming).
- Test fixture — `/admin/test` button enqueues a no-op job and redirects to
  its detail page so you can watch the full pipeline run in ~8 seconds.
- Admin landing page now links to Jobs queue + Driver test fixture.

**What intentionally did NOT ship**

- No real service drivers (SC1/MN1/WEB1 still ahead).
- No monitoring (MON1 — next after F2 per plan).
- No node registry UI (P7).
- F2's SSH runner is present but only exercised in SC1+; the `noop` test
  driver doesn't actually connect anywhere.

**Dependencies added**

- `ssh2` — SSH client used by the runner.
- `@types/ssh2` — TypeScript types.
- `tsx` — runs the worker directly from `.ts` source in dev and prod.

**Operational notes**

- Worker crash recovery: if the worker process dies mid-job, `reapStaleJobs()`
  (run from both the worker loop and the API) marks running jobs whose
  `heartbeat_at` is older than 60 s as `failed` with a clear error. No jobs
  get stuck in `running` forever.
- Dev smoke test: `npm run worker` in one terminal, browse to `/admin/test`,
  click "Run on fake-node-1", watch the job progress live.

---

v.Chaos.Chiron.0-b·v0.1.0

First phase of v1. Establishes the admin plumbing every later phase builds on.
Public site is unchanged; the foundation is invisible to non-admins by design.

**What shipped**

- SQLite database at `./data/app.db` (override via `APP_DB_PATH`) with a migration runner
  and the full v1 core schema: `nodes`, `node_services`, `mail_accounts`, `pins`, `jobs`,
  `backups`, `secrets_vault`, `admin_audit`, plus the provisioning columns (`provisioned_bytes`,
  `used_bytes`, `backup_capable`, `mount_point`) agreed during planning.
- Secrets vault (`lib/vault.ts`) — `seal()` / `unseal()` via libsodium `crypto_secretbox`,
  master key in `SECRETS_MASTER_KEY`.
- Admin whitelist (`lib/admin.ts`) driven by the `ADMIN_EMAILS` env var. Non-admin
  requests to admin routes get **404, not 403** — zero surface leakage.
- Admin password re-confirm: `POST /api/admin/confirm` revalidates against IMAP and
  stamps `session.adminConfirmedAt`. 5-minute freshness window via
  `requireFreshAdminConfirmApi()` for destructive endpoints.
- Audit log writer (`lib/audit.ts`) — every admin action writes a row to `admin_audit`.
- `/admin` landing page — empty state until P7 ships the node registry.
- `/admin/changelog` (this page) — renders `CHANGELOG.md` server-side, admin-only.
- Global version indicator shown on every page (`v0.1.0 · F1`).
- "Admin" link in the account menu, visible only when an admin email is signed in.

**What intentionally did NOT ship**

- Any in-house mail UI — that's P1 (inbox) onward.
- Any service drivers or SSH runner — that's F2.
- Any service monitoring — that's MON1.
- Any node registry UI — that's P7.
- Customer portal — that's v2 (CP1+).

**Dependencies added**

- `better-sqlite3` — synchronous SQLite driver.
- `libsodium-wrappers` — sealed-box secrets vault.
- `react-markdown` — renders this page.

**Env vars introduced**

- `ADMIN_EMAILS` — comma-separated list of addresses allowed into `/admin/*`.
- `SECRETS_MASTER_KEY` — base64-encoded 32-byte master key for the secrets vault.
- `APP_DB_PATH` — optional override for the SQLite file location (default `./data/app.db`).

▸Echo3 historical entries · patch-number 0

Echo Genesis · 3 historical entries · patch-number 0

v.Chaos.Echo.0-a·v0.4.0

Fourth phase of v1. The first phase where the admin UI actually shows live,
meaningful data pulled from a managed node. Built around netdata — the agent
installs over SSH with one click, our UI proxies netdata's JSON API for live
charts, the system probe inventories what's on the box without installing
anything, and an `apt upgrade` button keeps the OS current.

**What shipped**

- Schema migration 005: adds `system_probe_json` + `system_probe_at`
  columns on `nodes` to cache the last detection result.
- **System probe handler** (`lib/handlers/system-probe.ts`): SSH battery
  of detection commands that reports OS, kernel, arch, CPU, memory, disk
  mounts, nginx/docker/netdata/ipfs/chainweb/mailcow status, running
  docker containers, GNU screen sessions, listening ports. Structured
  JSON stored on the node row.
- **"Services detected" panel** (`components/admin/ServicesDetected.tsx`)
  on `/admin/nodes/[id]`: renders the probe output — core service rows
  with green/red/amber indicators, a collapsible docker-container list,
  screen sessions, disk-mount breakdown.
- **netdata install handler + button**: idempotent upstream `kickstart.sh`
  run (stable channel, telemetry disabled, non-interactive), then a Python
  edit of `/etc/netdata/netdata.conf` to force the `[web]` block to bind
  to `127.0.0.1:19999` (loopback only — no public port opens). Wires into
  the services-detected panel: netdata shows amber with an "Install"
  button when not active.
- **apt upgrade handler + button**: non-interactive
  `apt-get update && upgrade && autoremove` with the hold-config options.
  Streamed live to the jobs page.
- **Metrics proxy** (`lib/netdata.ts` + `pages/api/admin/nodes/[id]/metrics/[...netdataPath].ts`):
  admin-only passthrough that SSH-runs `curl 127.0.0.1:19999/api/v1/...` on
  the target and returns the JSON. Whitelisted endpoints
  (`info`, `data`, `charts`, `alarms`, `allmetrics`). Query strings
  sanitized against shell-meta characters.
- **Live charts** (`components/admin/NodeMetrics.tsx`): four recharts
  panels — CPU, load, RAM, network — polling the proxy every 5 seconds
  for the last 5 minutes of data. Renders only when the probe detects
  netdata active; otherwise shows a "Install first" hint.
- **"Ubuntu / Debian only" hint** on `/admin/nodes/new` so users know
  what's supported.
- Tunneling architecture: deliberately **not** a persistent SSH port
  forward. Each metrics API request opens a fresh SSH channel, runs curl
  to localhost, and returns the JSON. Trade-off: ~300 ms latency per
  request vs. a stateful tunnel. Sufficient for 5 s polling and keeps the
  hub stateless across restarts. If we ever need sub-second streaming or
  websockets from netdata, we'll build a real tunnel manager then.

**Deliberately not yet installed**

- Per-service driver actions (install StoaChain, install miner, install
  IPFS, install Mailcow, install website). Those stay with SC1 / MN1 /
  P12 / future-mailcow / WEB1 respectively. The probe tells you what's
  there; the drivers land next.
- Full-pod one-click install — that's v3 V4.

**Verified against production**

Smoke-tested end-to-end against the live 85.215.122.215 box. Probe correctly
detected Ubuntu 24.04, 12-core Ryzen 5 PRO 3600, 31 GB RAM, nginx active,
docker 28.2.2, netdata missing, IPFS with 2 pins, chainweb at cut height
1,566,039, mailcow present (19 containers), 3 screen sessions (StoaNode,
StoaMiner, cronoton), 5 disk mounts including the 2 TB `/mnt/nvmedrive`.

---

v.Chaos.Echo.0-b·v0.3.1

Patch on top of P7. Previously the only way to register a node was to paste
an existing SSH private key into the Advanced form. That works, but it&apos;s
friction-heavy for first-time setup. Added a password-based bootstrap flow
that generates a fresh keypair on the hub and installs it on the target.

**What shipped**

- Schema migration 004: adds `ssh_public_key` column to `nodes` so the UI
  can show "this is the key authorizing the hub on the server" and the user
  can find + revoke it manually if needed.
- Keypair generator (`lib/ssh-keygen.ts`) — ed25519 via Node&apos;s built-in
  crypto; emits both PKCS8 PEM (for ssh2 to consume) and single-line
  `ssh-ed25519 AAAA…` format (for authorized_keys). No native deps.
- `bootstrapNode()` in `lib/nodes.ts` — the end-to-end flow:
  1. connect with password
  2. generate keypair
  3. idempotent append of the public key to `~/.ssh/authorized_keys`
  4. verify by reconnecting with the new key only
  5. seal private key in the vault
  6. insert the node row with both `ssh_key_id` and `ssh_public_key`
  The password is used in-memory once and never stored anywhere.
- New API `POST /api/admin/nodes/bootstrap` — synchronous; the response
  includes the generated private key exactly once so the user can download
  or copy it as an emergency backup copy.
- Redesigned `/admin/nodes/new` with two tabs:
  - **Easy setup (password)** — the new flow above; success screen shows
    the private key with Download .pem + Copy buttons and a prominent
    "will not be shown again" warning.
  - **Advanced (paste key)** — the original flow, unchanged.
- Fixed several hydration bugs: locale-dependent `toLocaleTimeString()` on
  jobs pages replaced by a stable `Intl.DateTimeFormat('en-GB', hour12:false)`;
  relative timestamps ("2m ago") moved into a client-only `RelativeTime`
  component to avoid Date.now() drift between server render and client
  hydration; `<title>` with interpolated strings rewrote to template
  literals to satisfy React strict-title checking.

**Pubkey annotation**

The public key installed on the server carries a comment of the form
`ancientholdings-hub:<node-uuid>`. To revoke the hub&apos;s access from
outside the hub, remove that line from `~/.ssh/authorized_keys` on the
target.

---

v.Chaos.Echo.0-c·v0.3.0

Third phase of v1. First shippable admin feature: register the servers the hub
manages via the UI with proper SSH credential handling. Subsequent phases
(MON1 monitoring, P8 backups, service drivers) read nodes from the registry
instead of requiring manual SQL seeds.

Scope was tightened to "generic node registry + SSH connectivity test". Per-service
probes travel with each service driver as they land — not in P7.

**What shipped**

- Schema migration 003 adds `last_test_at`, `last_test_status`, `last_test_detail`
  columns to `nodes` for caching connectivity-test results.
- Node helpers (`lib/nodes.ts`) — `createNode`, `getNode`, `listNodes`, `deleteNode`,
  `publicNode` (redacts the vault key id before sending to the client).
- Job handler framework (`lib/handlers/`) — sits alongside the service driver
  registry; the worker dispatches by `job.kind`, checking handlers first, then
  falling back to drivers. Lets node-level ops (connectivity test, future
  netdata install) share the job queue without abusing the ServiceDriver
  contract.
- `node-test` handler — SSHes into the node, runs `uname / id / uptime / df`,
  parses the output, writes result onto the `nodes` row. Full output streams
  live to the UI via the F2 SSE pipeline.
- Admin API — `GET/POST /api/admin/nodes`, `GET/DELETE /api/admin/nodes/[id]`,
  `POST /api/admin/nodes/[id]/test`. Keys are sealed via `lib/vault.ts`; the
  key id never leaves the server.
- `/admin/nodes` list page with role badges + last-test status.
- `/admin/nodes/new` form — label, host, port, SSH user, private key paste,
  role picker (master-mixed / storage-fullcopy / ouronet-validator / customer-miner /
  utility), notes. Enqueues connectivity test on submit, redirects to detail.
- `/admin/nodes/[id]` detail — live connectivity test via SSE, retest button,
  service placeholder, danger zone with fresh-admin-confirm gated delete.
- Admin landing page upgraded with a direct "Add your first node" CTA.

**What intentionally did NOT ship**

- SSH key *generation* in-app (for customer-provisioned nodes): CP4 territory.
- Per-service health checks: ride with each service driver (SC1 / MN1 / WEB1 / P11).
- netdata auto-install on node add: lands in MON1 (next phase).
- Any visualization of service state: services don't exist yet.

**Dev notes**

- The `node-test` handler is the first real exercise of the SSH runner from F2.
- Worker dispatcher now checks handlers first, then drivers — symmetric with
  how the admin UI will grow (node-level ops vs service-level ops).

---

▸Proteus5 historical entries · patch-number 0

Proteus Genesis · 5 historical entries · patch-number 0

v.Chaos.Proteus.0-a·v0.5.4

Fixes a small lie from v0.5.0 (the `.ahbk` download set
`Accept-Ranges: bytes` but didn't actually parse Range headers) and
closes the gap for the `.tar.gz` download path (unresumable because the
secretstream decrypt pipeline is sequential). Also relocates the app
version label from each admin page into the global navbar.

**What shipped**

- **Proper HTTP Range support for `.ahbk`**: the download endpoint now
  parses `Range: bytes=N-M` (including suffix ranges `bytes=-N`),
  returns 206 Partial Content with `Content-Range`, streams only the
  requested window via `fs.createReadStream({start, end})`. Invalid
  ranges get 416 with a `Content-Range: bytes */<size>` hint. Download
  managers, wget -c, and browsers that retry from partial state now
  actually work.
- **Materialize-then-serve flow for `.tar.gz`**: since the libsodium
  secretstream decryptor has sequential state (can't seek), we can't
  make the in-flight decrypt Range-capable. Instead, a new flow:
    - `POST /api/admin/backups/:id/prepare-targz` kicks off a background
      decryption into `<archive>.tar.gz.ready.tmp`, atomically renames
      to `<archive>.tar.gz.ready` on completion.
    - `GET /api/admin/backups/:id/prepare-targz` polls staging status —
      `none` / `preparing` (with currentBytes vs innerBytes for %) /
      `ready` (with TTL expiry).
    - `GET /api/admin/backups/:id/download?decrypt=1` serves the ready
      file as a regular static file with Range support. 409 if not
      prepared, telling the UI to offer the Prepare button.
    - `DELETE /api/admin/backups/:id/prepare-targz` for manual discard.
    - Worker reaper sweeps stale `.tar.gz.ready` + `.tar.gz.ready.tmp`
      files older than 30 min, alongside the existing `.ahbk` expiry
      reaper.
- **`lib/targz-staging.ts`**: encapsulates the phase machine (none /
  preparing / ready), path helpers, idempotent `startPrepare` (two
  concurrent calls don't race — uses `open(path, 'wx')` as a lock),
  discard + reap.
- **Single-button 3-state UI** on `/admin/backups`:
    - `.ahbk` row action: always "Download .ahbk" (static, resumable).
    - `.tar.gz` row action: one button that transforms through
      "Prepare .tar.gz" → "Preparing X%" (disabled, live polls for
      progress) → "Download .tar.gz" (green, with expiry subtext and a
      Discard link). Position doesn't change, label does.
- **Auto-delete on completion**: both formats follow the same rule as
  before — full (non-Range) response that finishes flushed triggers
  `deleteBackup()`. Range responses never trigger auto-delete (a
  resumed session may span multiple requests and we don't track
  cross-request coverage), so the admin either manually discards or
  the TTL reaper catches it.
- **Honest info block** on `/admin/backups` explaining both formats'
  resumability behavior, the preparation step for `.tar.gz`, and the
  30-min staging TTL.
- **Version label relocated to navbar**: admin-only; appears next to
  the "Ancient Holdings" wordmark when signed in as admin, linking to
  `/admin/changelog`. Removed the duplicate version spans from every
  admin page title row (saves a line of visual noise on 9 pages).

**Safety & correctness notes**

- Staged `.tar.gz` is plaintext at rest on the hub during its TTL
  window. Single-admin hub: modest risk, we called it out in the UI
  copy. If this becomes concerning we could add "encrypt at rest with
  an ephemeral key held only in memory during the prepare→download
  cycle" but that complicates resumability.
- Range downloads don't auto-delete. This is deliberate — we can't
  tell from one request whether the user has now finished downloading.
  The 30-min reaper + Discard button handle cleanup.
- Staging file lifecycle is managed by filesystem state (no DB column
  needed). This keeps the DB as the source of truth for the archive
  itself while the plaintext staging is a fungible cache.

**Verified with**

- Type-check clean.
- Range parsing handles `bytes=0-`, `bytes=100-499`, `bytes=-100`,
  rejects `bytes=5000000-` on a 1 MB file with 416.
- `.tar.gz` prepare flow: empty state → Prepare click → .tmp file
  appears → status flips to preparing with bytes-in-progress → on
  completion, `.ready` exists + status flips to 'ready' → Download
  serves the file with Range → successful full-file download deletes
  both `.ahbk` and `.ready`.

---

v.Chaos.Proteus.0-b·v0.5.3

Closes the last operational gap on key rotation. Before this, rotating
the master key left the worker process holding the **old** key in its
`process.env` until you remembered to restart `npm run worker` (or
`pm2 restart`). v0.5.3 makes the whole cycle self-healing: the rotation
flow signals the worker to hold off, the worker polls `.env.local` for
the new key, and the UI shows a live "worker in sync ✓" indicator so
the admin can confirm the propagation without checking logs.

**What shipped**

- **Migration 008**: new `system_state` (key/value flag table — first
consumer is `rotation_in_progress`) and a `master_key_fingerprint`
column on `worker_leadership` so the worker can publish the short
sha256 prefix of its current master key on every heartbeat.
- **`lib/system-state.ts`**: tiny `getFlag` / `setFlag` / `clearFlag` /
`isFlagSet` helper. Generic enough to reuse for future coordination
(rotation_in_progress today, maintenance mode / probe-lock / etc.
later).
- **`lib/rotation.ts`** now wraps the rotation body in `setFlag` /
`clearFlag` (`rotation_in_progress`) via `try/finally` — flag is
always cleared even if rotation throws.
- **Worker (`worker/index.ts`)**:
- On boot and every 10 s thereafter, re-reads `.env.local` for
`SECRETS_MASTER_KEY`. If the on-disk value differs from the
in-process copy, updates `process.env` and logs
`[worker] SECRETS_MASTER_KEY reloaded (<oldfp> → <newfp>)`.
- Before each job-claim attempt, checks `rotation_in_progress` — if
set, logs `holding off on new jobs` and sleeps. Logs `resuming`
when the flag clears. Closes the race where a queued job could
transition to `running` mid-rotation.
- On every lease renewal, publishes its current master-key
fingerprint into `worker_leadership.master_key_fingerprint` for
the web tier to see.
- **`GET /api/admin/worker-status`**: admin-only endpoint returning
`{hub: {masterKeyFingerprint}, worker: {...fp, isFresh}, rotationInProgress, keyInSync}`.
UI uses it to flip the ✓ / ⚠ indicator.
- **`/admin/security` rotation card**: after a successful rotation,
polls `/api/admin/worker-status` every 1 s for up to 30 s. Shows
three states: pulsing "waiting for worker" (with divergent
fingerprints displayed), green "✓ worker picked up the new key",
or amber "⚠ worker didn't pick up within 30 s" with a one-line
restart hint. Wait window = ~3× the worker's env-poll cadence.

**Result**

End-to-end rotation is now a single click with zero manual follow-up
on a healthy hub. The old "restart the worker after rotating" footgun
is gone.

**Deliberately not in this patch**

- Graceful worker restart via PM2 signal. The poll-the-env-file
approach already solves the problem; a signal-based path is only
worth building if we later find `.env.local` polling doesn't work
for some deployment (e.g. Docker secret mounts).
- Key *versioning* in archive headers ("which master key was used to
wrap this?"). Useful for forensics / cross-hub archive portability;
separate feature.

**Verified with**

- Type-check clean.
- Worker env-file parser unit-tested inline via the same approach used
for `upsertEnvVar` — correct handling of unquoted, single-quoted,
double-quoted values.
- Integration test planned: rotate via UI, watch worker log print the
`SECRETS_MASTER_KEY reloaded` line, then the UI indicator flip ✓.

---

v.Chaos.Proteus.0-c·v0.5.2

Second half of the master-key story. v0.5.1 made the key exportable;
v0.5.2 makes it replaceable — in place, on a live hub, without
re-uploading any archive body. Retires the "the key I minted 6 months
ago is probably still fine" posture.

**What shipped**

- **`lib/rotation.ts`** — core rotation logic. Generates a fresh 32-byte
  key, walks the vault + known `.ahbk` archives, unwraps everything with
  the OLD master into memory (in-memory plan), then applies:
    1. archive header rewrites in place (only `wrapped.nonce` +
       `wrapped.ciphertext` change — identical JSON byte length by
       construction; verified before any write)
    2. vault re-seal as a single DB transaction
    3. `.env.local` update via the new `lib/env-file.ts` upsert helper
       (preserves comments + other vars)
    4. `process.env.SECRETS_MASTER_KEY` flipped in memory so the running
       hub uses the new key immediately — no PM2 restart required
  On any failure, previously-rewritten archive headers are restored
  from memory; the DB transaction rolls itself back.
- **`lib/env-file.ts`** — small upsert-in-dotenv helper with
  safety rails (no newlines, valid env-var name pattern, atomic write
  at `0o600`).
- **`POST /api/admin/security/rotate-master-key`** — fresh-confirm gated
  endpoint. Requires body `{acknowledgedExport: true}` so a rotation
  without a backed-up key is impossible via the UI. Returns counts +
  first-16-hex-chars SHA-256 fingerprints of both keys (so the admin
  can visually confirm the key actually changed without exposing it in
  the response). Every call audit-logged, including the fingerprints.
- **Pre-flight guard**: rotation refuses if any job is `running`. Avoids
  mid-run vault unseal against the new key.
- **`/admin/security` rotate card**: explainer text, safety model,
  two required checkboxes ("I have exported the current key",
  "I understand the consequences"), and a success panel showing row
  counts, duration, and old/new fingerprints.

**Safety notes**

- Archive header re-writes are verified to preserve byte length before
  any write happens; if a serialization quirk would change the length
  the rotation aborts with a clear error rather than corrupt the file.
- The body of each `.ahbk` never changes — only the JSON header's
  wrapped-key fields. So a pre-rotation download with the old key still
  works for archives already stored locally; re-downloads through the
  hub use the new key.
- Key rotation does **not** migrate archives that belong to a different
  hub (different master key). The rotation aborts loudly if any
  completed archive doesn't unwrap with the current master.

**Deliberately not in this patch**

- Importing a known key (disaster-recovery seed) — still manual via
  `.env.local`.
- Rotation *scheduling* / policy enforcement (e.g. "rotate every 90
  days"). Admin-driven for now.
- Archive re-encryption with a new *content* key (would require reading
  + re-writing archive bodies). Not needed for master-key rotation; only
  worth it if a specific content key is suspected compromised.

**Verified with**

- Type-check clean.
- `lib/env-file.ts` smoke-tested inline: upserts existing var in place,
  appends when missing, preserves surrounding vars/comments.
- Rotation plan validates `newHeaderBytes.length === headerLen` before
  any write; planned but untested path: rollback of partial archive
  rewrites. Relies on old header bytes held in memory for the duration
  of the rotation call.

---

v.Chaos.Proteus.0-d·v0.5.1

Follow-up patch to v0.5.0 closing the "what happens if the hub disappears"
gap on `.ahbk` archives. Before this, the hub's `SECRETS_MASTER_KEY` lived
only in `.env.local` on one machine — lose the file, lose every backup
forever. This patch makes the key exportable and the format
reimplementable from scratch.

**What shipped**

- **`docs/ahbk-format.md`** — authoritative byte-layout + JSON header
  schema + crypto-primitive spec for the `.ahbk` format. Any competent
  engineer with libsodium and this doc can reimplement the decoder from
  scratch. Ships alongside the reference encoder (`lib/archive.ts`) and
  the reference decoder (`bin/dr-tool.mjs`).
- **`bin/dr-tool.mjs`** — standalone CLI decryptor. Node + libsodium,
  nothing else. Subcommands: `info` (print header, no key needed) and
  `decrypt` (unwrap with master key, write inner `.tar.gz`). Intended for
  keeping on the USB stick next to your exported key + archive so you can
  restore from any machine with Node installed.
- **`/admin/security`** page — new admin route. Shows what the master key
  is, what it secures, why to export, and a warning block treating it
  like a root password. "Reveal master key" button triggers the existing
  fresh-admin-confirm flow (5-minute window); on success shows the base64
  in a monospace block with Copy and "Download as .txt" actions.
- **`/api/admin/security/master-key`** endpoint. GET returns the key as
  JSON (view mode) or a plaintext download (`?mode=download`). Gated by
  `requireFreshAdminConfirmApi`. Every call — success or failure — writes
  a `security.master_key.reveal` row to the admin audit log with mode
  (view/download), so repeated reveals are visible in the trail. Response
  headers set `Cache-Control: no-store` to defend against intermediate
  caches.
- Admin landing page gains a 🔑 Security entry.

**Deliberately not in this patch (→ v0.5.2)**

- Key *rotation* (regenerate + re-wrap every vault row + every `.ahbk`
  header). Sequenced after this patch because you should have a backed-up
  copy of the current key before attempting rotation.
- *Importing* a key on first boot (disaster-recovery seed). You still
  paste the value into `.env.local` manually today.

**Verified with**

Type-check clean. Reveal + copy + download round-tripped against the v0.5.0
worker. `dr-tool.mjs info` against a freshly-produced archive prints the
expected header; `dr-tool.mjs decrypt` round-trips to the original `.tar.gz`.

---

v.Chaos.Proteus.0-e·v0.5.0

Ships the first hub-orchestrated backup flavor end-to-end: click "Backup now"
on the StoaChain tab, the worker runs the full flow, you get an encrypted
.ahbk archive on `/admin/backups` you can download to your Windows machine.
Port of the user's `stoa-backup-now.sh` / `stoa-remote-daily-backup.sh` into a
hub job handler, but with a Node-native pipeline (no rsync/zstd binary
dependency on the hub).

**What shipped**

- Schema migration 007: extends `backups` with `status`, `label`, `local_path`,
  `started_at`, `completed_at`, `expires_at`, `run_kind`, `error`,
  `remote_backup_id` + a couple of indexes.
- `lib/backups.ts`: CRUD helpers (create / get / list / delete /
  markCompleted / markFailed), auto-prune of expired archives (default
  **1 day** for manual, 14 for scheduled — manual is short because the
  download auto-deletes anyway), plus a two-tier archive path resolver:
  primary dir `APP_BACKUPS_DIR` (defaults to `./data/backups` next to the
  app), with spillover to `APP_BACKUPS_SPILL_DIR` when the primary
  filesystem doesn't have ~1.2× the estimated archive size free. Lets the
  hub live on a small partition while spilling large archives onto a
  bigger mount (e.g. `/mnt/nvmedrive/StoaBackups` in production).
- `lib/archive.ts`: the `.ahbk` encrypted archive format. Magic "AHBK" +
  version byte + length-prefixed JSON header + libsodium `secretstream`
  body. Envelope encryption: per-archive random 32-byte key wrapped with
  the hub's `SECRETS_MASTER_KEY` via `crypto_secretbox`. Gzip compression
  (Node native) sits between the tar source and the encryption stream,
  so body bytes are compressed-then-encrypted. Outer sha256 stored on the
  backup row for download integrity; inner sha256 + plaintext byte count
  stored in the archive header for future offline verification by dr-tool.
- `lib/handlers/backup-stoachain.ts`: the first backup flavor. Ports the
  user's bash scripts:
  - `POST /chainweb/0.0/stoa/make-backup?backupPact` over SSH+curl (no public
    HTTP), gets the backup id
  - polls `/check-backup/<id>` every 15 s, handling `backup-in-progress` /
    `backup-done` / `backup-failed` with proper logging
  - opens a fresh `ssh2` connection, runs `tar c -C .../backups/<id> .`
    on the remote, and streams stdout directly into the local encrypted
    archive builder — no rsync, no staging dir on the hub, no SSH key on
    disk
  - records remote backup id, inner sha256, plaintext size in the archive
    header metadata
- API routes:
  - `POST /api/admin/nodes/[id]/backup` — enqueues a backup job
    (flavor-dispatched; only `stoachain-backup-api` implemented here, others
    return 400 until their flavors ship)
  - `GET /api/admin/backups` — list
  - `GET /api/admin/backups/[id]` — single backup
  - `DELETE /api/admin/backups/[id]` — gated by fresh-admin-confirm
  - `GET /api/admin/backups/[id]/download` — raw `.ahbk`; add `?decrypt=1`
    to stream the decrypted `.tar.gz` instead (hub-side decrypt; useful
    before `dr-tool` lands). After a fully-flushed response the archive
    auto-deletes (server-side copy is transient; the download *is* the
    point). Client-aborted downloads preserve the archive so the user
    can retry.
- `/admin/backups` index page: table with status badges, size, created,
  Download .ahbk + Download .tar.gz actions per completed backup
- "Backup" section on the StoaChain tab with a **Backup now** button and
  an expandable "How it works" block documenting the whole 6-step pipeline
  + disk usage + retention. Pre-flight warning when `--enable-backup-api`
  isn't in the last probe's startup flags (detection source explained
  inline + nudge to re-probe if stale)
- **Granular live progress**. chainweb's `check-backup` only returns
  `backup-in-progress` / `backup-done`, so we observe the checkpoint dir
  with `du -sb` every 15 s and report bytes written vs. a live-measured
  `/mnt/nvmedrive/StoaNodeData` baseline. During the streaming phase,
  progress + throughput (MB/s) come from the in-flight byte count. The bar
  moves through all five phases instead of freezing during the longest two.
- Link to `/admin/backups` from the admin landing page
- `backup-stoachain` handler registered in the handlers registry

**Fully Node-native pipeline**

The backup flow uses no external CLI binaries on the hub — only things
reachable via `npm install`:
- `ssh2` for the SSH transport + `exec` channel
- Node's built-in `zlib` for gzip compression
- `libsodium-wrappers` for the envelope encryption
- `crypto` for sha256 manifests

Rationale: makes the hub deployable on any OS without worrying about which
version of `rsync` / `zstd` / `tar` is present. The remote side still needs
`tar` (Linux default) and `curl` (already present on every server we'd
manage). No new dependencies.

**Deliberately not in this pass**

- Other flavors: Mailcow, IPFS pins, nginx/service configs, full-node. Each
  gets its own handler — architecturally the same shape, just different tar
  sources. Follow-up passes.
- Off-site destinations (S3-compatible, SSH-to-backup-node, backup-storage-role
  node). Deferred until the user decides on a provider; the `destinations_json`
  column on backups is already wired for this.
- Scheduled / rules engine. Deferred until there's a real off-site destination
  to push to.
- Standalone `dr-tool` binary. Deferred — for now the hub itself decrypts
  (the `?decrypt=1` download option).

**Verified with**

Type-check clean. End-to-end smoke test planned against the production box
after the worker restart.

---

▸Jason49 historical entries · patch-number 0

Jason Genesis · 49 historical entries · patch-number 0

v.Chaos.Jason.0-a·v0.7.4q

Finalizes the role matrix that was nudged into existence by the real
handover flow (ancient admin sets up a hub, then hands accounts off to
modern admins who in turn manage their own clients).

### Admin console — per-link access tags

The `/admin` quick-links list now prefixes each entry with a compact
three-glyph role badge:

- `★` ancient (gold)
- `◆` modern (blue)
- `◇` client (grey)

A glyph lights up when that role can access the page; greyed out
otherwise. Unavailable links render disabled with a small
"(restricted)" tag — visible but unclickable, so modern/client admins
can see at a glance what exists but isn't theirs to touch.

### Role matrix tightened

- **Acolytes** — now ancient-only (page gate + API GET both locked to
  ancient). Modern admins were never able to mutate the roster (already
  ancient-only there), but they could browse. That's gone now — the
  public-site team roster is an ancient concern.
- **Admins & Clients page** — no longer loads for clients. The page gate
  now 404s any role below modern. (Quick-links already hid it, but a
  direct URL hit would still render a stripped view.)
- **Client management** (`/api/admin/clients/...`) — promote, revoke,
  and reset-onboarding actions now accept both ancient and modern admins
  via the new `requireFreshAdminNonClientConfirmApi` guard. Client role
  itself is still rejected.
- **Admins roster API** — GET now rejects clients explicitly
  (404, not 403, keeping the "not-admin" veneer).
- **Admins page UI** — Promote-to-Client form, Revoke, and Reset
  Onboarding buttons now render for modern admins too.
  Grant-Modern-Admin stays ancient-only.
- **Mailcow mailbox list** — modern admins now fetch it at page load so
  the Promote-to-Client picker populates for them.

### Files touched

- `lib/admin.ts` — added `requireFreshAdminNonClientConfirmApi`.
- `pages/admin/index.tsx` — new `<AccessTag>` + `<QuickLink>` helpers;
  Quick-links rewritten to use them.
- `pages/admin/acolytes.tsx` — role gate tightened to ancient.
- `pages/admin/admins.tsx` — UI gates + effect dependency for Mailcow
  fetch now include modern role.
- `pages/api/admin/acolytes/index.ts` — `requireAncientAdminApi`.
- `pages/api/admin/clients/index.ts` — POST guard swapped;
  GET rejects clients.
- `pages/api/admin/clients/[email].ts` — DELETE + PATCH guards swapped.
- `pages/api/admin/admins/index.ts` — GET rejects clients.

---

v.Chaos.Jason.0-b·v0.7.4j

Five slices bundled in one release so v0.7.4 closes cleanly. Each
addresses a gap surfaced during today's real-world VPS onboarding.
Phase code → **CR3** (Client Role 3 — onboarding end-to-end).

### v0.7.4e — Install Wizard Certificate step

Previously: fresh installs got a self-signed P-256 cert and the
operator had to manually go to the Identity tab, paste the DuckDNS
token, run Obtain-LE, restart. Long chain of clicks + context
switches with the DuckDNS dashboard.

Now: Identity step in the wizard expands when `p2pHostname` ends in
`.duckdns.org`. Extra fields: DuckDNS token (required for auto-LE)
+ email (optional). On install, after `docker compose up`, the
handler runs `certbot certonly --manual --preferred-challenges dns`
with auto/cleanup hooks that hit DuckDNS's update API. Cert files
land at `<tlsDir>/tls-{cert,key}.pem` (what compose mounts) →
container restarts → node emerges with CA-signed cert ready to peer.

Renewal deploy-hook installed at
`/etc/letsencrypt/renewal-hooks/deploy/stoa-inst.sh` — re-copies
the renewed cert to the compose-mount paths and restarts
stoa-node automatically on every certbot.timer fire.

Non-DuckDNS hostnames: LE step skipped; self-signed bootstrap
remains and operator can run Obtain-LE manually (existing flow,
unchanged).

### v0.7.4j — Seed-at-install option

Install Wizard Profile step grew a checkbox: "Install with current
hub seed (recommended)". Shows seed cut height + size + donor.
Defaults ON when a current seed exists. On apply, install handler
replaces empty chainweb boot with an inline call to the existing
`stoachainReseedHandler` (reuses v0.7.3c-f's rollback + cert-preserve
+ stream-plumbing logic). Net: install completes with chainweb at
the donor's cut-at-backup time, not cut=0. Minutes saved on stoa;
hours-to-days on Kadena-mainnet-sized chains later.

If no current seed on hub, the checkbox turns into a grey note
linking to `/admin/seeds` to produce one first.

### v0.7.4g — Already-managed detection at Add-Node

New preflight endpoint `POST /api/admin/nodes/already-managed-probe`.
SSHes with password auth to the target and runs 5 detection checks:
- `/etc/sudoers.d/ancientholdings-stoa` exists (hub-sudoers file)
- chainweb-node process running
- `stoa-node` container present
- `RunStoaNode.managed.sh` file under `/home`, `/mnt`, or `/srv`
- `ah-hub:` / `ancientholdings-hub` marker in authorized_keys

If any trigger, the Add-Node wizard's Bootstrap submit surfaces a
`window.confirm` listing detected signals before proceeding.
Operator can Cancel (safe default) or click OK to force-adopt
anyway (e.g. re-adding after accidental delete, or they've already
cleaned up another hub's leftover).

Non-destructive probe — purely read-only. Prevents the
"two hubs dueling for one server" footgun.

### v0.7.4h — Key-purge on unmanage + `/admin/orphans` page

Node DELETE endpoint rewritten. New flow:

1. SSH into the target, remove any line in `~/.ssh/authorized_keys`
   (and `/root/.ssh/authorized_keys`) containing the `ah-hub:`
   marker. Backup copy left as `*.bak.<timestamp>`.
2. Unconditionally delete vault secret + nodes row (the hub commits
   to losing its SSH access regardless of whether step 1 succeeded).
3. If step 1 failed (network partition, target offline, auth
   failure), write a row into `node_orphans` capturing what was
   attempted + the error.

New admin page `/admin/orphans` (ancient-only). Lists unresolved
orphans with clear "SSH in yourself and remove the ah-hub: line"
instruction + a "Mark resolved" button. Keeps resolved history
(last 20) with the operator's cleanup note.

### v0.7.4i — Onboarding transparency modal for clients

First time a `client`-role admin lands on `/admin`, modal appears:
- Names the hub's ancient admin (first in env list)
- States plainly: hub has full SSH access to their managed
  servers, every action is audit-logged, client retains ownership,
  unmanage removes the hub's key
- "I understand — continue" stamps `clients.accepted_transparency_at`
  (one-way). "Cancel — sign me out" redirects to home.

Modal fires **only** for role=client. Ancient/modern admins see
nothing (they already know the game). Uses new endpoints:
- `GET /api/admin/clients/me` — role + acceptance stamp
- `POST /api/admin/clients/me` — stamp acceptance (no-op if
  already accepted)

### Backlog

New `plans/BACKLOG.md` seeded with:
- **Storage-partition awareness per service** (user-requested
  today) — ability to see where each hub-hosted service lives
  (partition + path + free space) and move services between
  partitions. Matters once hub hosts multiple websites
  (caduceus subdomain + others). Live server currently has a
  480 GB partition at 29% that will eventually need management.
- A few smaller items surfaced in today's VPS-onboarding arc.

### Version bump

- `lib/version.ts` → **v0.7.4j**. Phase code **CR3** (Client Role
  3 — onboarding end-to-end). Closes the v0.7.4 phase as planned in
  `plans/v0.7.4-client-role.md`.

With this release a fresh VPS → synced stoa peer is **one form in
the Install Wizard** (DuckDNS token being the only external dance
the operator still does manually — grab it once from DuckDNS
dashboard). Original 45-min ops slog compressed to ~10 min.

---

v.Chaos.Jason.0-c·v0.7.4k

User spotted an inconsistency on `/admin/seeds`: AncientMiner's row
showed `bytales.duckdns.org` (nice DNS name) while IonosFiveVPS
showed `82.165.48.252` (raw IP) — even though IonosFive now has
`kjrkentolopon.duckdns.org` as its P2P identity.

Cause: the UI was showing `nodes.host` (SSH entry point from the
Add-Node wizard). AncientMiner happened to be added via its
DuckDNS name for SSH; IonosFive was added via raw IP. The two
don't have to match.

**Fix**:
- `ManagedNodeSeedRow` gains `p2pHostname: string | null` —
  populated from live argv's `p2p-hostname` flag (skipping the
  `0.0.0.0` placeholder).
- Seeds page prefers `p2pHostname` when displaying node identity,
  falls back to `host` (SSH) if no p2p-hostname is set yet.
- Appends `(ssh: <host>)` in muted text when the two differ, so
  the operator can still see the SSH entry point at a glance.
- Tooltip explains which is which on hover.

Behavior preserved: the backing data still uses `host` for SSH.
Only the display changed.

**Version bump**
- `lib/version.ts` → `v0.7.4k`. Phase stays `CR2`.

v.Chaos.Jason.0-d·v0.7.4f

**The real bug behind "fresh install won't sync."** User with a fresh
IonosFive VPS had:
- LE-signed cert ✓
- Real DNS hostname pointing at the box ✓
- `p2p-hostname` set to that hostname ✓
- Port 1789 reachable ✓

And still cut=0, no peers, no sync. Diagnosis:

`RECOMMENDED_PROFILE` (what the Install Wizard uses) does not
include `known-peer-info`. The `stoa` custom chainweb variant has
**no built-in bootstrap peer list** — that's a `mainnet01`-only
thing baked into upstream chainweb-node. So a fresh `stoa` node
with no `known-peer-info` has **zero peer-discovery seeds** and
sits at cut=0 forever waiting to be contacted, which can't happen
either since its hostname was just created seconds ago and nobody
in the network knows about it.

**Fix**: add `'known-peer-info': ['node1.stoachain.com:1789',
'node2.stoachain.com:1789']` to `RECOMMENDED_PROFILE`. Two entries
for redundancy — a fresh node survives one seed being
temporarily down. Once peer gossip discovers the broader graph
on first handshake, the seed entries become non-critical.

`ANCIENT_PROFILE` had one entry already; parity restored.

**For existing nodes that were affected**: add `known-peer-info`
manually via Flag Editor and Restart — takes 2 min. Or re-run
Install Wizard (cleanup auto-wipes and re-installs with the new
default).

**Version bump (CR2 continues)**
- `lib/version.ts` → `v0.7.4f`. Skipped `.e` because that slot is
  reserved for the "certificate step in Install Wizard" slice
  (still planned; this patch unblocks the current user first).

v.Chaos.Jason.0-e·v0.7.4d

Install handler's self-signed cert generator was still emitting
ECDSA P-384 / SHA-384 — a copy-paste carryover from chainweb-node's
example script that never matched any production Stoa node.
node1 / node2 / AncientMiner all use ECDSA P-256 / SHA-256 per
`RunStoaNode.sh`. v0.7.3g fixed this for `stoachain-cert-rotate`
but missed `stoachain-install` — same class of miss as the
compose-plugin one.

**Fix**: install handler's `openssl req -x509` now uses
`-newkey ec -pkeyopt ec_paramgen_curve:P-256 -sha256`.

**Important note for the operator**: the curve change does NOT
make peers accept the node. Chainweb-node validates peer certs
against the **system CA bundle**; any self-signed cert (P-256 or
P-384) is rejected as "unknown CA" — verified fact from the
node2 TLS forensics. A fresh install thus produces a
chainweb-node that peers refuse. To get peer acceptance:

1. Point DNS for your chosen `p2p-hostname` at the new VPS.
2. Chainweb tab → Identity → "Obtain Let's Encrypt certificate"
   (HTTP-01 if port 80 is free; DNS-01 for DuckDNS).
3. Restart the node.

v0.7.5 (planned) folds the certbot step into the Install Wizard
itself, so the wizard asks "enter hostname + obtain LE now?" at
install time. For now, it's a manual post-install click.

**Version bump**
- `lib/version.ts` → `v0.7.4d`. Phase stays `CR2`.

v.Chaos.Jason.0-f·v0.7.4c

Install failed on a fresh Ionos Ubuntu 24.04 VPS with empty error
"docker compose up failed:" — same class of bug that
convert-supervision had through v0.7.3r/t but the install handler
never got the fix. Two root causes hit simultaneously:

1. Ionos's default docker CLI is from `docker.io` apt package, which
   **does not include the compose plugin**. `docker compose up -d`
   parses `-d` as a top-level docker flag and chokes.
2. The install handler's `dockerComposeUp` used `2>&1` to merge
   stderr into stdout but then only read `r.stderr` on failure —
   always empty. The operator saw "docker compose up failed:" with
   nothing after the colon.

**Fixes (`lib/handlers/stoachain-install.ts`)**
- New preflight step 5b: `ensureDockerComposePlugin(target)` runs
  between `docker pull` and `docker compose up`. If `docker compose
  version` fails, fetches the v2.29.1 compose plugin binary from
  GitHub releases and drops it into
  `/usr/libexec/docker/cli-plugins/docker-compose`. Architecture-
  aware (x86_64 / aarch64 / armv7). Uses sudo + tee + chmod (all
  already in the canonical sudoers list).
- `dockerComposeUp` now:
  - streams compose output live via `onChunk` (you see pull/create/
    start progress in the job log in real time)
  - captures the merged output and includes it in the error message
    (tail -600 chars, with explicit exit code)
  - adds a defensive `docker rm -f stoa-node` before compose up to
    survive stale containers from prior failed installs
  - bumped timeout 60s → 5min for cold image pulls

**Version bump**
- `lib/version.ts` → `v0.7.4c`. Phase code stays `CR2`.

v.Chaos.Jason.0-g·v0.7.4b

Second slice of v0.7.4. Pure-plumbing v0.7.4a now has a visible
surface — you can promote mailcow mailboxes to `client`, assign nodes
to those clients, and clients will see only their own nodes.

**Phase code**: v0.7.4b ships as `CR2` (Client Role 2 — promotion +
ownership UI).

**Admin page — Clients section (`/admin/admins`)**
- New "Clients" roster section (parallel to the Ancient+Modern
  roster). Shows each client's email, promote-date, promoter, and a
  "pending onboarding" badge when `accepted_transparency_at` is null
  (wired in v0.7.4e).
- New "Promote to Client" form (ancient-only) — dropdown lists
  Mailcow mailboxes that aren't already admins / clients.
- Revoke button on each client row (ancient-only; warns that nodes
  owned by revoked client become stranded until reassigned).
- Page re-titled "Admins & Clients" with updated 3-tier intro copy.

**New API routes**
- `GET /api/admin/clients` — list clients.
- `POST /api/admin/clients` — promote an email (ancient + fresh-confirm).
  Refuses if email is already ancient/modern (upgrade path has to go
  through explicit tier removal first).
- `DELETE /api/admin/clients/[email]` — revoke (ancient + fresh-confirm).

**Nodes list (`/admin/nodes`)**
- Shows `owner:` line per node (email or "unowned · ancient-only").
- SSR filters the list by ownership — modern/client admins see only
  nodes they own. Ancient sees all, including unowned.

**Node detail (`/admin/nodes/[id]`)**
- SSR returns 404 if caller can't `canAccessNode` (same behavior as
  the API layer — no leak between "doesn't exist" and "not yours").
- New `OwnerRow` component under the SSH line. Shows the owner email
  or "unowned · ancient-only". Ancient admins get a "change" link
  that inline-edits the field, fresh-confirms via password modal,
  PATCHes `/api/admin/nodes/[id]/owner`, reloads.
- New `PATCH /api/admin/nodes/[id]/owner` API route (ancient + fresh).

**Add-Node wizard (`/admin/nodes/new`)**
- New "Owner email" field at the bottom of the shared form. Defaults
  to the admin doing the adding.
- Ancient admins can type any email. Modern/client admins see the
  field but it's locked to their own email (the API also refuses
  mismatched ownership for non-ancient callers).
- Both `POST /api/admin/nodes` (paste-key) and
  `POST /api/admin/nodes/bootstrap` (password bootstrap) accept
  `ownerEmail`, default to caller, validate.

**Schema changes**
- `CreateNodeInput` and `BootstrapInput` gain `ownerEmail?: string | null`.
- `NodeRow` and `PublicNode` gain `owner_email: string | null`.
- `bootstrapNode` persists `owner_email` at INSERT time; defaults to
  `issuedBy` if caller didn't specify.
- `createNode` persists `owner_email` at INSERT time (lowercased).

**How to test after dev reload**
1. Log in as ancient admin → `/admin/admins` → Clients section
   empty. Pick a non-admin mailbox → "Promote to Client". Confirm
   it appears in the Clients roster.
2. `/admin/nodes/[id]` → click "change" next to Owner. Assign it
   to the client you just promoted. Save.
3. Sign out. Sign in as the client's email. You land at `/admin`
   with their node visible at `/admin/nodes`. No other admin pages
   accessible (they'd 404).

**Version bump**
- `lib/version.ts` → `v0.7.4b`, phase `CR2`.

v.Chaos.Jason.0-h·v0.7.4a

Starts the v0.7.4 phase (client role + ownership) per
`plans/v0.7.4-client-role.md`. This slice is **pure plumbing** — no
user-visible changes yet. Subsequent slices (b–e) add the UI for
promotion, owner assignment, already-managed detection, key purge on
unmanage, and the onboarding transparency modal.

**Phase code**: v0.7.4a ships with phase code `CR1` (Client Role 1 —
Ownership plumbing), replacing SC5.

**Migration 016**
- `nodes.owner_email TEXT` — nullable column. Pre-v0.7.4a rows keep
  NULL = "unowned, ancient-only". Fresh Add-Node flows in v0.7.4b
  will populate it explicitly.
- New `clients` table — mirrors `modern_admins` shape. Email +
  created_at + created_by + accepted_transparency_at (null until
  v0.7.4e's modal consent).
- New `node_orphans` table — audit trail for unmanage attempts where
  the hub couldn't remove its SSH key from the target. v0.7.4d
  populates it.

**`AdminRole` extended**
- Added `'client'` to the union. Priority: `ancient > modern > client`.
- `getAdminRole()` checks `clients` table when neither ancient-env nor
  `modern_admins` matches.

**New helpers (`lib/admin.ts`)**
- `canAccessNode(caller, node)` — ancient always; modern/client only if
  owner_email matches their email; null owner = false for non-ancient.
- `requireOwnedNodeApi(req, res, opts?)` — route guard combining
  `requireAdminApi` + node lookup + ownership check. Returns
  `{ email, role, session, nodeId, ownerEmail }`. Pass `{ fresh: true }`
  for fresh-confirm routes. 404s uniformly on unauthorized or
  not-found (no surface leak).

**Node-route wiring (13 files updated, 8 skipped)**
- Updated (now ownership-scoped):
  `[id].ts`, `apt-upgrade`, `backup`, `metrics/[...netdataPath]`,
  `netdata-install`, `probe`, `stoachain/control`, `stoachain/docker-logs`,
  `stoachain/flags` (GET only — PATCH stays ancient), `stoachain/logs`,
  `stoachain/peer-activity`, `stoachain/preflight`, `stoachain/status`,
  `test`.
- Skipped (ancient-only by design, bypass ownership):
  `drive-benchmark`, `stoachain/cert-rotate`, `stoachain/certbot-obtain`,
  `stoachain/convert-supervision`, `stoachain/install`,
  `stoachain/peer-trust-reset`, `stoachain/reseed`, `sudoers-repair`.

**Master plan updated**
- `plans/control-hub.md` §16 Progress log: added the 2026-04-18 → 2026-04-21
  SC-series build-out summary + the v0.7.4a entry.

**Next**: v0.7.4b — client-role promotion UI in `/admin/acolytes` +
owner-assignment UI on node detail.

v.Chaos.Jason.0-i·v0.7.3af

v0.7.3ae's resolver fixed node2-hardcoding but had a gap: adopted
docker nodes (never went through the Install wizard) have
`stoachain_runner_path = NULL` in the DB, so the resolver fell
through to "use live argv's `--database-directory`". For docker
nodes that value is the **container-internal** path (`/data`) because
chainweb-node runs inside the container. Resolver would have
returned `/data/backups` and tar would have failed again.

Hit on live for AncientMiner.

**Fix**: when supervision is docker AND we have no captured runner
path, run `docker inspect stoa-node` to read the host source of the
`/data` bind mount. That's the authoritative host data dir.

**Resolution flow (now)**
1. Hub-installed docker (runner_path ends compose.yml) → derive
   from stoa-root convention
2. Adopted docker (runner_path NULL, supervision=docker) →
   `docker inspect` the `/data` mount source
3. screen/systemd → live argv's `--database-directory` (host path)
4. Fallback → stored flags' database-directory
5. Throw with actionable error if nothing resolves

The `/data !== db` sanity guard now also prevents accidentally
treating a container-internal path as a host path in the later
fallbacks.

**Version bump**
- `lib/version.ts` → `v0.7.3af`.

v.Chaos.Jason.0-j·v0.7.3ae

Two bugs surfaced once v0.7.3ad stopped auto-promoting junk seeds
and forced the real failure into visibility:

**Bug 1: backup handler had the remote backup dir hardcoded**
(`/mnt/nvmedrive/StoaNodeData/backups`). Worked for node2 by
coincidence; every other node's tar ran against a non-existent
path and produced an empty archive. Seen in the wild on live's
AncientMiner attempt:

```
tar: /mnt/nvmedrive/StoaNodeData/backups/1776731165148056: Cannot open
tar: Error is not recoverable: exiting now
```

**Bug 2: donor eligibility threshold was 95% of the tallest
*candidate***. If only one node had `enable-backup-api` on, it was
always ≥95% of itself and passed — even when another managed node
(without backup-api) showed the network was miles ahead.

**Fixes**

- `lib/handlers/backup-stoachain.ts`:
  - New `resolveHostBackupDir(node, nodeId, log)` helper. For docker
    nodes: derive from `stoachain_runner_path` (compose dir →
    `<stoaRoot>/data/backups`). For screen/systemd: use live
    argv's `--database-directory` (host path directly) →
    `<db-dir>/backups`. Falls back to stored flags; throws with a
    clear operator message if neither source resolves.
  - The `du` baseline measurement also uses the derived data dir
    (not the hardcoded path).
- `lib/seeds.ts`:
  - Max cut is now tracked across ALL reachable managed nodes, not
    just backup-api-enabled candidates.
  - Eligibility threshold raised 95% → **999‰ (99.9%)**. Matches
    the "sync progress" green-zone threshold in the per-node Status
    card, so what the admin sees as "synced" is exactly what the
    donor picker accepts.
  - `cut-too-low` reason text now shows permille: "sync progress
    823.1‰ is below the 999‰ donor threshold".

Belt-and-suspenders with v0.7.3ad's 1 GiB archive-size check:
size check catches empty archives at write time; sync check
catches partially-synced donors at pick time.

**Version bump**
- `lib/version.ts` → `v0.7.3ae`.

v.Chaos.Jason.0-k·v0.7.3ad

Hit on live: the auto-refresh job promoted a **714-byte archive**
from AncientMiner (manifest: `innerBytes: 20, remoteSizeBytes: 0`)
as the hub's current seed. Happens when the donor's chainweb backup
API returns a near-empty archive — most likely because the donor
wasn't ready (recent restart, still syncing, internal backup worker
uninitialized).

Without a guard, a reseed from this "seed" would replace target
nodes with an empty data dir. Real footgun — seed-refresh must
refuse to promote junk.

**Fix (`lib/handlers/seed-refresh.ts`)**
- After the backup sub-handler returns, cross-check `size_bytes`.
- If below `MIN_SEED_SIZE_BYTES = 1 GiB`, throw with a clear
  operator message. The backup row is preserved (operator can
  inspect or delete via `/admin/backups`); the existing current
  seed (if any) is untouched.
- Threshold chosen to be generous enough that any healthy chainweb
  donor clears it, strict enough that an empty-archive failure gets
  caught (real stoa-chain data is ~50 GB by now).

**Cleanup on live**
- Deleted the bad seed_archives row + 714-byte .ahbk file on the
  production hub (one-off SSH). Next scheduled seed-refresh will
  produce a real seed once a healthy donor is available.

**Version bump**
- `lib/version.ts` → `v0.7.3ad`.

v.Chaos.Jason.0-l·v0.7.3ac

Follow-up to v0.7.3ab: seeds and client backups have different
semantics (hub infrastructure vs client-facing archives) and mixing
them in the Backups UI is confusing. Splits them cleanly.

**Changes**
- `listBackups(opts)` gains `excludeSeeds?: boolean`. The Backups
  page + API both pass it to exclude seed-referenced rows.
- `/admin/backups` no longer shows seeds. Header paragraph now
  points operators at `/admin/seeds` for hub-infrastructure archives.
- New endpoint `GET /api/admin/seeds/[id]/download`:
  - **Ancient admin + fresh-confirm required**
  - Serves the `.ahbk` file (HTTP Range supported, resumable)
  - **No auto-delete** — hub keeps its copy, operator gets a copy
  - Filename baked with seed status + promote date for cold-storage
    clarity (`stoa-seed-current-2026-04-21-<id8>.ahbk`)
- `/admin/seeds` History table gains a `Download` column with a
  `↓ .ahbk` button per row. Button triggers the password modal
  (stamps fresh-confirm on the session) then navigates to the
  download URL.
- History section has an explanatory paragraph: seeds are
  infrastructure, download is out-of-band only, no auto-delete.

**Use cases for the download**
- Cold/offline archive of the reseed baseline (disaster recovery)
- Manual reseed on a firewalled node that can't SSH to the hub
- Inspection / diagnostics of the archive content

**Version bump**
- `lib/version.ts` → `v0.7.3ac`.

v.Chaos.Jason.0-m·v0.7.3ab

User caught a real footgun: the hub's seed archive (the `.ahbk` used
for new-node installs + reseeds) shares the same `data/backups/`
directory and `backups` table as client-facing backups. Downloading
it via the normal backups page auto-deleted the file on completion
(standard behavior for client backups), which would orphan the seed
and break future reseeds.

**Fix — seed-referenced backups are now protected**

- New helpers in `lib/backups.ts`:
  - `getBackupSeedStatus(id)` → `'current' | 'archived' | null`
  - `listBackupSeedStatuses(ids)` → batch map for list endpoints
- `deleteBackup(id, opts)` now throws `BackupIsSeedError` if the
  backup is seed-referenced. Pass `{ force: true }` only from
  internal seed-management code (none currently; reserved for
  future demotion flows).
- `DELETE /api/admin/backups/[id]` catches the new error, returns
  **409 Conflict** with the seedStatus, and logs the refusal.
- `GET /api/admin/backups/[id]/download` auto-delete-on-completion
  logic now skips seed-referenced backups. Staged `.tar.gz.ready`
  is still cleaned up (it's disposable); only the `.ahbk` is the
  seed archive and stays on disk.
- `GET /api/admin/backups` and `GET /api/admin/backups/[id]` now
  include `seedStatus` in the response.
- `/admin/backups` UI surfaces this:
  - `HUB SEED · current` (orange) or `HUB SEED · archived` (grey)
    badge next to the label
  - Tooltip on Download explains auto-delete is skipped for seeds
  - Header paragraph mentions the HUB SEED exemption

Downloads of seeds now behave as: admin gets a copy of the file,
hub keeps the file, reseed remains possible. No more one-shot
"download → lose the seed" accident.

**Version bump**
- `lib/version.ts` → `v0.7.3ab`.

v.Chaos.Jason.0-n·v0.7.3aa

Node2 conversion succeeded (chainweb now runs inside
`stoa-node` container), but the Control tab still showed
`supervision=screen`. Two cooperating bugs:

1. Priority was `screen > docker > systemd`. Any screen session
   present made detection short-circuit to screen.
2. Screen detection regex matched **any** session name: `\d+\.\w+`.
   Node2 has unrelated screens on the box — `StoaMiner` (kadena
   ASIC miner) and `cronoton` — both matched. First one picked →
   mis-reported.

**Authoritative fix**: use the **cgroup of the chainweb-node PID**.
A docker-supervised process lives in `/system.slice/docker-<hash>.scope`;
a systemd unit lives in `/system.slice/<unit>.service`. That's the
truth regardless of which other services happen to be on the box.

**Changes (lib/stoachain-live.ts)**
- Bash probe now captures `/proc/$PID/cgroup` in a new `---CGROUP---`
  section.
- Supervision picker checks cgroup first (docker / systemd), falls
  back to screen/docker/systemd blocks only if cgroup didn't resolve.
- Screen session detection regex tightened: `[0-9]+\.StoaNode` only
  — unrelated screens no longer trigger false positives.

**Version bump**
- `lib/version.ts` → `v0.7.3aa`.

v.Chaos.Jason.0-o·v0.7.3z

Node2 screen → docker conversion failed:

```
error mounting ".../StoaNodeData.stoa/tls/tls-cert.pem" to rootfs at "/data/tls-cert.pem":
...not a directory: Are you trying to mount a directory onto a file (or vice-versa)?
```

Two distinct bugs:

**Bug 1 (root cause): cert path not translated after data-dir move.**
On nodes where the TLS cert lives inside the data dir (e.g.
`/mnt/nvmedrive/StoaNodeData/tls-cert.pem`), the `mv` of the data dir
moves the cert along with it. The flags loaded from live argv still
point at the pre-move path, so the `sudo cp` to copy cert+key into
the new `tls/` subdir silently fails. The handler didn't check cp's
exit code — it logged "copied cert+key" regardless. Docker's
bind-mount then auto-created the missing source path as a directory,
and `runc` rejected the mount because you can't bind-mount a dir
onto a file.

**Bug 2: dead-but-existing container poisons supervision detection.**
A compose-up that creates a container but fails to start it leaves
that container in "Created" state. `detectSupervisionLive` was using
`docker ps -a` (all containers), so a stopped stoa-node was reported
as docker-supervised even after rollback restarted screen/systemd.

**Fixes (lib/handlers/stoachain-convert-supervision.ts)**
- Before cp: if the cert/key paths were inside the old data dir,
  translate them to the new (post-mv) location. Logs the translation
  so it's visible.
- cp: check exit code and throw on failure. Also `test -f` the
  resulting `tls-cert.pem` to make sure it's actually a regular file.
- detection: `docker ps` (running only), not `docker ps -a`.
- New rollback step pushed right after compose.yml is written:
  `docker compose down` + `docker rm -f stoa-node`. LIFO order puts
  this first on rollback (while compose.yml still exists), then
  remove-intermediate / mv-back / restart-old. Prevents orphaned
  container from blocking clean retry.

**Scripts**
- `scripts/recover-node2-post-fail.ts` — one-off to clean up node2's
  dead container + leftover .stoa dir after the v0.7.3y attempt.

**Version bump**
- `lib/version.ts` → `v0.7.3z`.

v.Chaos.Jason.0-p·v0.7.3y

Node2 benchmark: write succeeded at 210 MB/s, then cache-drop step
timed out at 10s. Root cause: `sync` blocks until RocksDB dirty
pages are flushed — on a busy chainweb node that's easily >10s.
Timeout killed the whole benchmark even though cache-drop is
strictly a "read test accuracy" nice-to-have.

**Fixes**
- Dropped the `sync` preamble. We care about clearing the page cache,
  not durability; `drop_caches` handles what we need.
- Bumped timeout 10s → 30s for the drop itself.
- Wrapped the call in try/catch — if it still times out or fails for
  any reason, log a warning and continue. The read test may show
  inflated cached throughput in that case, but the write number is
  the authoritative one anyway (RocksDB's bottleneck is writes).

Net effect: no more benchmark deaths from a busy node, and the
worst degradation is "read test optimistic".

**Version bump**
- `lib/version.ts` → `v0.7.3y`.

v.Chaos.Jason.0-q·v0.7.3x

v0.7.3w's auto-sudoers-repair worked (AncientLinux benchmark got past
the dd write step, 345 MB/s). Next failure was the **read** parse:

```
536870912 bytes (537 MB, 512 MiB) copied, 0,445005 s, 1,2 GB/s
```

Two issues packed into one line:
- Comma decimal separator (`0,445005`, `1,2`) — AncientLinux is in a
  German/Romanian locale
- GB/s (not MB/s) — fast NVMe reads report in GB/s

The old regex `/,\s*([\d.]+)\s*MB\/s/` expected dot-decimals AND
MB/s. Missed both on this line.

**Fix**: new `parseDdThroughput(output)` helper that accepts
MB/s, GB/s, KB/s (with GB→MB and KB→MB normalization) and both
`.` and `,` as decimal separator. Returns null if unparseable so
the caller can throw an honest error.

Used for both write and read parsing in `drive-benchmark.ts`.

**Version bump**
- `lib/version.ts` → `v0.7.3x`.

v.Chaos.Jason.0-r·v0.7.3w

v0.7.3v's probe correctly identified AncientLinux's `/home/StoaNode/data`
as root-owned (docker runs chainweb as root), triggering the
`sudo -n dd` path. That path then failed because AncientLinux's sudoers
is from the pre-v0.7.3m template and doesn't include `/bin/dd`,
`/bin/sh`, or `/bin/sync`.

Rather than tell the operator "go click Sudoers Repair and retry",
the handler now auto-repairs sudoers on sudo-refusal. Every manual
fix becomes a UI feature.

**Changes**
- New `lib/sudoers.ts` — single source of truth for the canonical
  NOPASSWD command list, with `repairSudoers(target, username)` and
  `ensureSudoers(target, username, log)` helpers.
- `lib/handlers/drive-benchmark.ts` — on sudo refusal during dd write,
  calls `ensureSudoers()` to refresh `/etc/sudoers.d/ancientholdings-stoa`
  to the canonical list, then retries the dd once. If it still fails,
  returns an actionable error ("check the sudoers file manually").
- Refactored `pages/api/admin/nodes/[id]/sudoers-repair.ts` to use
  the shared primitive — previously the canonical list was duplicated
  across three files.
- Also dropped the last remaining fake MB/s fallback: if dd exits 0
  but output lacks the `MB/s` line, throw an error instead of
  inventing a reading from wall-clock time.
- Added `/usr/bin/curl` to the canonical sudoers list (needed by
  v0.7.3t's compose-plugin install and v0.7.3u's docker install).

**Version bump**
- `lib/version.ts` → `v0.7.3w`.

v.Chaos.Jason.0-s·v0.7.3v

Two bugs in one:

1. **Drive benchmark always used `sudo -n dd`** — fine for docker
   installs (root-owned data dir) but failed on user-owned data dirs
   (screen/systemd installs, e.g. AncientLinux's `/home/StoaNode/data`)
   whose sudoers didn't have a `/bin/dd` entry. The dd never actually
   ran; sudo refused with "a password is required".

2. **The handler fabricated a fake MB/s reading on failure.** Because
   the error-handling ran AFTER the mbps calculation, and the
   calculation fell back to `sizeMb / wall-clock` when the dd output
   had no "MB/s" line, the job log showed a plausible-looking number
   (the ssh round-trip time, e.g. "1802.8 MB/s") before throwing
   the actual error. Misleading.

**Fixes (both in `lib/handlers/drive-benchmark.ts`)**
- Probe `benchDir` perms first via `[ -w ... ]`. If the ssh user can
  write, skip `sudo` entirely. Only use sudo when the dir is
  root-owned (docker case).
- Check dd exit code BEFORE parsing MB/s. If exit != 0 and stderr
  indicates sudo refusal, return a clear "run Sudoers Repair" error
  instead of trying to plot a fake reading.
- Same pattern for the read-test dd and the rm cleanup.
- Cache-drop (`/proc/sys/vm/drop_caches`) still needs sudo — left
  best-effort with `|| true`. If cache-drop fails, the read number
  is just inflated (cached), but the write number is still accurate.

**Version bump**
- `lib/version.ts` → `v0.7.3v`.

v.Chaos.Jason.0-t·v0.7.3u

Closes the "you want to convert to docker but docker isn't installed"
gap. v0.7.3t handled the compose plugin; v0.7.3u handles the whole
docker engine.

**Fix**: convert-supervision's docker preflight now runs Docker's
official `get.docker.com` convenience script if `command -v docker`
fails. That sets up the apt repo, installs `docker-ce` +
`docker-compose-plugin` + dependencies, and enables + starts
`docker.service`. After install, preflight re-verifies `docker --version`
and proceeds to the compose-plugin check (which should now pass since
get.docker.com includes the plugin).

Rationale: "if you're converting TO docker and docker is missing,
install it" is the obvious operator expectation. Failing with
"go run the install-wizard bootstrap step yourself" made the Upgrade
button lying. The converter is now genuinely self-healing for the
docker-as-target case.

Every manual fix becomes a UI feature — in line with the operator
principle that production users won't have Claude to SSH in for them.

**Install flow (docker path)**
1. `command -v docker` → if missing, run `get.docker.com` (10-min timeout)
2. `docker --version` → sanity check after install
3. `docker compose version` → if missing, fetch v2 plugin binary from
   GitHub (v0.7.3t code)
4. Proceed with conversion

Streamed output: the `[docker-install]` and `[compose]` lines show
pull/install progress in real time.

**Version bump**
- `lib/version.ts` → `v0.7.3u`.

v.Chaos.Jason.0-u·v0.7.3t

Real error surfaced by v0.7.3s's error-visibility + rollback: node1
had docker CLI 29.1.3 but **no compose plugin**. Ubuntu 22.04's
`docker.io` package ships the CLI without the plugin. Running
`docker compose up -d` then fails with
`unknown shorthand flag: 'd' in -d` because docker treats `compose`
as a positional arg and `-d` as a top-level docker flag.

**Fix**: convert-supervision's docker preflight now checks for
`docker compose version` and — if missing — downloads the official
v2 plugin binary (v2.29.1) from GitHub releases directly into
`/usr/libexec/docker/cli-plugins/docker-compose`. Single-binary
install; no apt repo, no GPG key, no Docker repo setup needed.
Architecture-aware (x86_64 / aarch64 / armv7). Uses sudo + tee
(already in sudoers).

This lifts off the operator's plate the "why doesn't my upgrade
work" confusion when their distro's docker package is incomplete.
Can later be factored into a shared `ensureDockerCompose()` primitive
used by the install-wizard too.

**Rollback proven end-to-end**
- Last failed attempt from v0.7.3s logs showed: `[compose] unknown
  shorthand flag: 'd' in -d` → `[rollback] ✓ restored to systemd`.
  Node never needed manual SSH recovery. That's the target state.

**Version bump**
- `lib/version.ts` → `v0.7.3t`.

v.Chaos.Jason.0-v·v0.7.3s

The big one. Previously "the old supervision never comes back up on
failure" was left to operators to fix manually (or a one-off recovery
script). v0.7.3s bakes **full rollback** into every conversion.

**How it works**
- Before any destructive step, `captureOldStartInfo()` records how to
  restart the current mode:
  - systemd: resolves the active unit name
  - screen: captures the runner path from live argv or stored profile
  - docker: captures the compose working dir via `docker inspect`
- The destructive section builds a `rollbackStack` of labelled undo
  callbacks as it goes:
  - after stop → "restart old mode" (registered first, runs last)
  - if data dir was moved → "mv data back" (using `[ -d src ] && [ ! -e dest ]` guards)
  - if data dir was newly created as part of layout → "remove intermediate"
  - before writing systemd unit/wrapper → snapshots originals to
    `.TS.bak`, records "restore systemd unit + wrapper" (stops + disables
    + restores backups + daemon-reload)
  - before writing screen runner → snapshots to `.TS.bak`, records
    "restore screen runner"
- On any failure in steps 4-7: run the stack in reverse (LIFO). Each
  undo is wrapped in try/catch so one failing undo doesn't block the
  rest. After rollback, re-runs supervision detection; logs whether
  the old mode came back successfully.

**Verify is now inside the rollback scope** — if chainweb-node doesn't
come up within 3 min under the new mode, we revert to the known-good
old mode instead of leaving the node silent. (Previously the handler
explicitly skipped rollback for verify failures; that was exactly the
kind of half-broken state operators had to SSH in to fix.)

**Error visibility (from v0.7.3r, restated)**
- compose output is now streamed live to the job log via `onChunk`
  (pull/create/start progress visible in real time)
- Combined (stderr + streamed stdout) is included in the error
  message, tail -600 chars, with explicit exit code
- Same treatment for systemctl + screen start
- Defensive `docker rm -f stoa-node` before compose up (survives
  a stale container collision from a prior failed attempt)

**Scripts**
- New `scripts/recover-node1-systemd.ts` — one-off recovery used to
  restore node1 to systemd after the v0.7.3o/p/q chain of failed
  conversions left it half-converted. Useful as a reference for
  similar recoveries; not intended to be part of the regular ops path
  now that rollback is built in.

**Version bump**
- `lib/version.ts` → `v0.7.3s`.

v.Chaos.Jason.0-w·v0.7.3r

Bugfix chain continuing from v0.7.3p/q. v0.7.3p unlocked the upgrade
for adopted nodes (node1); the actual `docker compose up` then failed
with only "docker compose up failed:" (empty stderr). Root cause: the
handler merged stderr into stdout via `2>&1` but then only reported
`r.stderr` on failure — dropping the real error on the floor.

**Fixes**
- Live-stream compose output to the job log (`onChunk`) — you see
  the pull/create/start progress in real time.
- Error message now includes the combined captured output, tail
  -600 chars, with explicit exit code.
- Defensive cleanup: `docker rm -f stoa-node` before compose up, so
  a stale `stoa-node` container from a prior failed attempt doesn't
  cause a name-conflict error on the next try.
- systemd-start + screen-start error paths: same treatment
  (stdout + exit code surfaced).
- Docker-compose timeout raised from 3 min → 5 min to cover cold
  image pulls on slow connections.

**Version bump**
- `lib/version.ts` → `v0.7.3r`.

v.Chaos.Jason.0-x·v0.7.3q

Bugfix: after running a drive benchmark that classified a drive as SSD,
the "Drive (sysfs)" row still rendered the red "HDD (discouraged)"
badge because the badge was hardcoded to sysfs — the empirical class
was only being applied to the Storage card's tone and the HDD-
discouragement warning.

Fix: new `effectiveClassBadge` that renders the benchmark class when
available, sysfs as the fallback. The row is renamed from "Drive
(sysfs)" to "Drive class", with a source note: "from empirical
benchmark — sysfs heuristic said hdd" when they disagree, or "from
/sys/block — heuristic (run benchmark below for empirical)" when
only sysfs is available. Drive model moved to its own KV row.

Also drops the now-dead `driveBadge` helper.

**Version bump**
- `lib/version.ts` → `v0.7.3q`.

v.Chaos.Jason.0-y·v0.7.3p

Bugfix: v0.7.3o's convert-supervision failed on adopted nodes (like
node1) because it required a pre-captured `stoachain_flags_json` in
the DB. Adopted systemd/screen nodes that never went through the
Install wizard never had stored flags; the handler blew up at step
2/8 with "no stored flag profile — trigger a Restart first".

Fix: source flags from **live argv first** (via `fetchLiveFlags`, which
SSHes + parses `ps` output), fall back to stored only if live parsing
fails. Since the handler already confirms the node is running in
step 1/8 (supervision detection), live always works in practice.

Affected path: `lib/handlers/stoachain-convert-supervision.ts` step 2/8.
No other behavior change; the hierarchy lock + UI + API route from
v0.7.3o are untouched.

**Version bump**
- `lib/version.ts` → `v0.7.3p`.

v.Chaos.Jason.0-z·v0.7.3o

Turns the v0.7.3n any↔any converter into an **upgrade-only** ladder
along the hierarchy `docker > systemd > screen`. Screen is the worst
supervision mode for a production daemon — no restart policy, no boot
recovery, session death = node death — and the UI now surfaces that
so operators can't accidentally miss it.

**Hierarchy (correct ordering)**
- `docker`  ★★★ — image-pinned, isolated, reboot-safe via
  `restart: unless-stopped`. Best.
- `systemd` ★★  — proper lifecycle (`Restart=on-failure`), boot recovery
  (`WantedBy=multi-user.target`), but binary lives on host. Upgrade
  recommended.
- `screen`  ★    — no restart, no boot recovery. Upgrade highly
  recommended.

**Hub-enforced upgrade-only conversions (3)**
- `screen  → systemd`
- `screen  → docker`
- `systemd → docker`

**Refused downgrades (3)** — reinstall under the lower mode instead:
- `systemd → screen`
- `docker  → screen`
- `docker  → systemd`

**Changes**
- New `lib/supervision.ts` — single source of truth for ranks,
  star counts, labels, taglines, and reboot survivability. Exports
  `canUpgradeTo(from, to)`, `upgradeTargetsFrom(from)`.
- `lib/handlers/stoachain-convert-supervision.ts` — enforces
  `canUpgradeTo` at job start. Downgrade requests fail with a clear
  message before any state changes.
- `pages/api/admin/nodes/[id]/stoachain/convert-supervision.ts` —
  fetches live supervision, validates upgrade, rejects downgrades at
  the API layer so a downgrade never even hits the worker.
- `components/admin/NodeTabs.tsx` — replaces `SupervisionConverterCard`
  with `SupervisionCard`. Shows current mode with star rating, tagline
  ("Best — no upgrade needed" / "Upgrade recommended" / "Upgrade highly
  recommended"), and an explicit "Survives hardware reboot: yes / no"
  indicator. When the node isn't at the top, an Upgrade button with
  dropdown of valid targets. Placed at the top of the Control sub-tab.
- Tone: docker green, systemd amber, screen red.

**Auto-restart verification**
- Docker: `renderDockerCompose` already emits `restart: unless-stopped`
  (verified `lib/stoachain-layout.ts:160`).
- Systemd: unit template already has `Restart=on-failure` +
  `WantedBy=multi-user.target` + `systemctl enable` (verified
  `stoachain-convert-supervision.ts:430-445`).
- Screen: no auto-restart (intentional; reinforces 1-star rating).

**Deferred to later**
- Install wizard 3-mode selector + binary-extract-from-image primitive.
  Today only docker installs are wired; systemd/screen exist through
  adoption of legacy nodes or manual bootstrap. Fresh systemd/screen
  installs are a future slice; every node currently in the network can
  already be upgraded along the hierarchy via this converter.

**Version bump**
- `lib/version.ts` → `v0.7.3o`.

v.Chaos.Jason.0-aa·v0.7.3n

Closes gaps in supervision handling so every node-op works regardless of
whether the node runs under screen / systemd / docker, and adds a
first-class migration path between the three.

**Seeds page: live backup-api detection**
- `listManagedNodeStatus` (lib/seeds.ts) now fetches live flags alongside
  `/info`. The "Backup API" column in `/admin/seeds` no longer falls back
  to stored flags when the node's running argv has been edited out-of-band
  (node1 symptom before this fix).
- Stored flags remain the fallback when the node is unreachable.

**Unified logs endpoint**
- New `GET /api/admin/nodes/[id]/stoachain/logs?lines=N` dispatches on
  detected supervision:
  - `docker`   → `docker logs --tail N stoa-node`
  - `systemd`  → `journalctl -u stoa-node.service --lines=N`
  - `screen`   → `tail -n N` of common runner log files (`/var/log/stoa-node.log`,
                `/mnt/nvmedrive/StoaNodeData/chainweb.log`, etc.), or a
                friendly "attach to the screen session" note when no log
                file exists
- Old `/docker-logs` route kept as a back-compat alias.
- Peer-activity route (`/stoachain/peer-activity`) now uses the same
  supervision-aware source — "Peer Activity" works for systemd + screen
  nodes too, not just docker.
- New `NodeLogsCard` in NodeTabs replaces the docker-only
  `ContainerLogsCard` in the Control sub-tab. Title/source adapts:
  "Container logs" / "Service logs (journalctl)" / "Screen logs".

**Flag Editor Apply+Restart: systemd support**
- `stoachain-control` handler gained `rewriteSystemdWrapper`. When the
  user Applies flag changes on a systemd-supervised node, the handler
  inspects `systemctl cat stoa-node.service`, finds the wrapper script
  referenced by `ExecStart=`, and overwrites it with the output of
  `toRunnerScript(flags)` (base64 + tee, chmod 755). Then `daemon-reload`
  + `systemctl restart`.
- Matches the existing docker-compose rewrite and screen runner-script
  rewrite paths — all three supervision modes now behave identically in
  the Flag Editor.

**Any↔any supervision converter (NEW)**
- New `lib/handlers/stoachain-convert-supervision.ts` migrates a node
  between any two supervision modes without losing chain data. Six
  conversions covered:
  - screen ↔ docker
  - screen ↔ systemd
  - docker ↔ systemd
- 8-step pipeline: detect current → load flags → preflight target
  prerequisites → stop current → prepare new mode layout → start under
  new mode → verify live `/info` → update stored state.
- Docker target: rearranges into canonical `<stoaRoot>/{chainweb, data, tls}`
  layout, renders compose.yml via `renderDockerCompose`, mounts through
  to container-internal paths (`/data`, `/data/tls-cert.pem`).
- Systemd target: writes `/usr/local/bin/run-stoa.sh` wrapper +
  `/etc/systemd/system/stoa-node.service` unit, daemon-reload + enable.
- Screen target: writes `RunStoaNode.managed.sh` next to the data dir.
- Auto-rollback attempts to restart under the old mode if the
  conversion fails after stop (not guaranteed — old-mode artifacts may
  already be overwritten when rearranging into docker layout).
- New API `POST /api/admin/nodes/[id]/stoachain/convert-supervision`
  with body `{toMode: 'docker' | 'systemd' | 'screen'}`. Ancient admin +
  fresh-confirm required.
- **UI** new `SupervisionConverterCard` on Chainweb → Control sub-tab:
  dropdown of available target modes, destructive confirmation dialog,
  redirects to job log on submit.

**Registry**
- `lib/handlers/registry.ts` now registers 14 handler kinds (added
  `stoachain-convert-supervision`).

**Version bump**
- `lib/version.ts` → `v0.7.3n`.

v.Chaos.Jason.0-ab·v0.7.3m

Three items operator-requested.

**Drive benchmark (empirical classification)**
- New `lib/handlers/drive-benchmark.ts` — `dd`-based sequential write + read
  test against the node's data-dir filesystem. 512 MB default, `conv=fdatasync`
  + `oflag=dsync` to bypass cache. Caches dropped before read (via
  `/proc/sys/vm/drop_caches`).
- Classifies by measured write throughput:
  - ≥ 500 MB/s → `nvme`
  - ≥ 150 MB/s → `ssd`
  - ≥ 50 MB/s → `hdd`
  - < 50 MB/s → `slow` (red warning; check for virtualized/network storage)
- Persists via inline ALTER TABLE (additive cols `drive_bench_*` on `nodes`).
- New API `POST /api/admin/nodes/[id]/drive-benchmark` — ancient admin +
  fresh-confirm.
- Sudoers template updated: `/bin/dd`, `/bin/sh`, `/bin/sync` added.
- **UI** on Chainweb → Status → Storage card: "Empirical benchmark"
  section alongside sysfs class. "Re-run benchmark" button. Shows
  write + read MB/s, measured timestamp, highlights mismatch between
  sysfs-heuristic and empirical-measured class.

**`backup-directory` locked in Flag Editor**
- Added to `IMMUTABLE_FLAGS` client + server side. Chainweb auto-derives
  it to `<database-directory>/backups` when omitted; setting it elsewhere
  breaks RocksDB hardlink checkpointing.
- Operators now only toggle `enable-backup-api`; the dir is always correct
  by default. Matches how node2 / node1 / AncientLinux all work in
  practice.

**Node1 `--enable-backup-api` enabled (manual fix)**
- Out-of-band: SSH'd into node1, edited `/usr/local/bin/run-stoa.sh` to
  add `--enable-backup-api`, reloaded via `systemctl restart
  stoa-node.service`. Verified with POST to `/make-backup` → returned
  backup id successfully.
- Old runner script archived with `.TS.old` suffix.
- Note: systemd-supervised nodes don't yet support Flag Editor's
  Apply+Restart path — that's v0.7.4+ work (unit-file rewriting).
  Manual fix for now; v0.7.4 ships proper support.

**Version bump**
- `lib/version.ts` → `v0.7.3m`.

v.Chaos.Jason.0-ac·v0.7.3l

Filling in three automation gaps the user called out:

**Certbot now detected in system-probe**
- `lib/handlers/system-probe.ts` — new `SVC_CERTBOT` section captures:
  binary version, `certbot.timer` enabled/active state, next scheduled
  run, list of installed deploy-hooks.
- `SystemProbe.services.certbot` — surfaces in the probe output so the
  admin UI can show certbot alongside docker, nginx, etc. Install wizard's
  certbot install (added in v0.7.3i) is now visibly confirmed by probe.

**Cert renewal deploy-hook**
- `stoachain-certbot-obtain` now installs a per-node deploy-hook at
  `/etc/letsencrypt/renewal-hooks/deploy/stoa-<nodeId8>.sh`.
- When `certbot.timer` renews the cert (~60 days from now, automated),
  the hook:
  1. Copies the renewed cert files into chainweb's TLS paths
  2. Fixes ownership + permissions
  3. Detects supervision (docker / systemd / screen) at hook run-time
     and restarts accordingly (docker compose up -d --force-recreate,
     systemctl restart, or bail with instructions for screen)
- Previously: certbot renewed fine but the new cert never reached
  chainweb's in-memory copy — would have been a silent time-bomb ~60
  days out.

**Daily seed auto-refresh (scheduled)**
- `worker/index.ts` — `maybeScheduleSeedRefresh()` runs on every main
  loop iteration (throttled to 15 min between checks). If:
  - auto-refresh isn't disabled (system_state flag)
  - current seed is >23h old (or missing entirely)
  - no seed-refresh job is already queued/running
  - there's an eligible donor
  → enqueues a `seed-refresh` job automatically. Runs under actor email
  `system:seed-auto-refresh` in the audit trail.
- `pages/api/admin/seeds/auto-refresh.ts` — POST endpoint to toggle
  the scheduler on/off (fresh-confirm + ancient-admin).
- Admin UI on `/admin/seeds` gets a new "Auto-refresh schedule" panel:
  green/gray Enabled/Disabled toggle, next ETA (based on current seed
  age + 23h), last enqueue timestamp + job id, last skip reason (e.g.
  "no donor available").

**Version bump**
- `lib/version.ts` → `v0.7.3l`.

v.Chaos.Jason.0-ad·v0.7.3k

Follow-on cleanup after v0.7.3j proved the LE flow works end-to-end.

**Cert-doctor logic inverted (critical bugfix)**
- `lib/cert-doctor.ts`: the old v0.7.3h logic said "CA-signed is bad,
  self-signed is good, certbot auto-renew breaks the network". Every
  claim was wrong — verified today by restoring node2's LE cert and
  watching all three nodes resume syncing.
- New logic:
  - `severity='healthy'` (green) — CA-signed + certbot auto-renew active
  - `severity='warn'` (amber) — CA-signed without auto-renewal configured
  - `severity='error'` (red) — self-signed (broken on public Stoa P2P)
  - `severity='unknown'` — cert unreadable / ephemeral / missing
- Messages rewritten to match reality.

**Identity card: positive confirmation when healthy**
- Green banner appears when TLS is set up right (LE + certbot timer).
  *"Peer trust is unaffected by renewals because chainweb validates
  via CA chain, not fingerprint pinning."*
- Amber banner when LE cert but no auto-renew.
- Red banner stays only for self-signed — the actually-broken case.

**Sync progress indicator on Status card**
- New KVs: **Target (tallest peer)** + **Sync progress**.
- Target = max cut height across all managed nodes the hub has
  live-probed (parallel SSH, O(slowest node)). Null if this is the
  tallest.
- Progress shown as permil with 3 decimals (e.g. `998.234 ‰`), colored:
  - green `≥ 999 ‰`
  - gold `≥ 950 ‰`
  - amber `< 950 ‰`
- Shows "N blocks behind" when delta > 0; "at tip" when caught up.

**certbot handler: auto-resolves docker host paths**
- `stoachain-certbot-obtain`: if the node's stored runner_path is a
  `docker-compose.yml`, the handler derives `<stoaRoot>/tls/...` host
  paths from it automatically. No more manual `certPath` + `keyPath`
  in the API payload for docker-supervised nodes.

**Add Node UI: docker-default signaling**
- "Easy setup (with password)" → **"Easy setup · docker"** + green
  "recommended" badge.
- "Advanced (paste private key)" → **"Advanced · existing install"**.
- Explanatory line under the tabs: *"Docker supervision gives each node
  a self-contained environment … what the hub recommends for any new
  install."*

**Version bump**
- `lib/version.ts` → `v0.7.3k`.

v.Chaos.Jason.0-ae·v0.7.3j

Found three bugs in the v0.7.3i certbot handler while actually running
it against AncientLinux:

**1. Silent apt install "success"**
- `sudo -n apt-get install -y certbot 2>&1 | tail -5` — tail's exit code
  masks apt-get's failure. Handler thought certbot was installed; it
  wasn't.
- Fixed: wrap in `set -o pipefail`, then verify post-install with
  `command -v certbot`.

**2. DEBIAN_FRONTEND=noninteractive rejected by sudo**
- sudo's `env_reset` strips non-whitelisted env vars. Setting
  `DEBIAN_FRONTEND` in the sudo command failed: *"you are not allowed
  to set the following environment variables"*.
- Fixed: dropped the env var. apt-get install is fine without it.

**3. DNS-01 hook scripts didn't land on disk**
- `echo ${JSON.stringify(script)} | tee ...` corrupted escape sequences.
  The `\n` in the script source became literal `\n`, not a newline. The
  file ended up as one giant first line that `sh` couldn't parse →
  certbot reported `/bin/sh: 1: /etc/letsencrypt/duckdns-hooks/auth.sh:
  not found`, even though the file existed.
- Fixed: base64-encode the script content, decode on the remote side
  via `base64 -d | tee`. Same pattern used by `writeManagedRunner` etc.
- Added sanity check: `test -x && test -s` on the written file before
  invoking certbot.

**UX cleanup**
- **Deleted** `CertRotateButton` / the self-signed rotate UI entirely.
  Per feedback: *"just remove it all together... since its only noise
  now."* The old `stoachain-cert-rotate` handler stays registered for
  API-level compatibility but has no UI surface anymore.
- **Renamed** "Obtain Let's Encrypt cert (recommended)" → simply
  **"Install TLS cert"**. No "recommended" hedge — LE is the only way
  chainweb P2P works on public Stoa.
- **Auto-detected challenge** from the hostname: `.duckdns.org` →
  DNS-01, everything else → HTTP-01. Removed the challenge dropdown.
  Operator still needs to provide a DuckDNS token for NAT'd nodes; the
  field auto-reveals only when DNS-01 is the auto-choice.

**On "ancientholdings as its own CA" (future work)**
- Honest answer logged: technically possible (~weeks of work), but
  creates network-splitting effect with operators who already trust LE.
  Deferred to Phase 3+ as a consortium-CA option; main Stoa network
  stays on LE.

**Version bump**
- `lib/version.ts` → `v0.7.3j`.

v.Chaos.Jason.0-af·v0.7.3i

**Root cause finally verified**: chainweb-node's P2P TLS validates
against the standard system CA bundle. Self-signed certs are rejected
with `HandshakeFailed "certificate has unknown CA"`. Let's Encrypt
certs (CA-signed) work fine — the original node1 + node2 setup used LE
for exactly this reason.

**The hub's old cert-rotate generated self-signed certs → broken for
real chainweb use.** Rotating node2 twice today (P-384 and P-256, both
self-signed) broke peer sync each time. **Restoring node2's original LE
cert from `/etc/letsencrypt/live/` immediately fixed sync network-wide**
— confirmed by: node2 1,622,533 → 1,624,038 in minutes; node1 unstuck
from 1,621,032 → 1,621,330; AncientLinux → 1,623,530.

**New handler: `stoachain-certbot-obtain`**
- `lib/handlers/stoachain-certbot-obtain.ts`:
  1. Installs certbot via apt if missing.
  2. Archives existing cert+key with `.TS.old` suffix.
  3. Runs certbot:
     - HTTP-01 (`--standalone`): certbot binds :80; nginx is briefly
       stopped if active; chainweb keeps running.
     - DNS-01 via DuckDNS: writes a small auth-hook script that updates
       a TXT record via DuckDNS's API (works for NAT'd nodes like
       AncientLinux on `bytales.duckdns.org`).
  4. Copies `fullchain.pem` + `privkey.pem` from
     `/etc/letsencrypt/live/<domain>/` to chainweb's configured paths.
  5. chown to the chainweb user, chmod 600 on the key.
  6. Optionally auto-restarts chainweb-node to load the new cert.

**Bootstrap: certbot now installed alongside docker**
- `lib/nodes.ts` — `prepareTarget` script now installs certbot via apt/dnf/yum.
- Canonical sudoers list gains `/usr/bin/certbot`, `/usr/bin/apt-get`,
  `/usr/bin/cp`, `/bin/cp`. `sudoers-repair` endpoint updated to match.

**UI: Identity card**
- Primary action is now **"Obtain Let's Encrypt cert"** with challenge
  method dropdown (HTTP-01 / DNS-01-DuckDNS), ACME email field, DuckDNS
  token field (revealed when DNS-01 selected), auto-restart checkbox.
- The old **"Rotate (self-signed)"** button is tucked behind an
  `Advanced` expand arrow with a warning that self-signed certs are
  rejected by chainweb P2P on public networks.

**API**
- `POST /api/admin/nodes/[id]/stoachain/certbot-obtain` — fresh-confirm +
  ancient-admin. Body accepts `domain`, `email`, `challenge`,
  `duckdnsToken`, `restart`.

**Node2 cert restored out-of-band** via SSH — see above for heights
proving the fix. Next: user can obtain LE certs for AncientLinux (DNS-01
via DuckDNS) through the new UI action.

**Version bump**
- `lib/version.ts` → `v0.7.3i`.

v.Chaos.Jason.0-ag·v0.7.3h

**Systemd supervision in stoachain-control**
- `lib/handlers/stoachain-control.ts` — new `systemd` branch alongside
  existing docker + screen paths. Resolves the unit name (prefers
  `stoa-node.service`, falls back to any active `stoa*` / `chainweb*`
  unit) and dispatches `systemctl start|stop|restart <unit>`.
- `waitForChainweb` reused for post-start liveness check.
- `detectSupervisionLive` now recognizes systemd (between docker and
  screen in priority).
- Limitation documented: flag edits in the Flag Editor don't yet apply
  to systemd-supervised nodes because the handler doesn't rewrite the
  unit file's `ExecStart` line. Restart/Start/Stop work. Flag-driven
  recomposes will land in v0.7.3i or later.

**Cert-doctor**
- New `lib/cert-doctor.ts` — inspects a node's TLS setup beyond just
  "cert file exists":
  - Issuer CN vs Subject CN → classifies `self-signed` / `ca-signed` /
    `unknown`
  - Scans for `certbot.timer` systemd unit + cron entries mentioning
    `certbot` / `letsencrypt`
  - Extracts last-run / next-run timestamps from the timer
- Status endpoint `/api/admin/nodes/[id]/stoachain/status` now returns
  a `certDoctor` section.

**Identity card UI**
- New red warning banner at the top of the Identity card when:
  - Cert is CA-signed (issuer ≠ subject)
  - Certbot auto-renew is active (timer or cron)
- Warning explicitly lists the class of problem and suggests rotation
  to self-signed ECDSA.
- New `Issuer` KV row shows the issuer CN + cert-kind tag (`self-signed`
  / `ca-signed`). Red styling when ca-signed.

**Why this lands as one feature**
- StoaNodeOne was just added to the hub: systemd-supervised, Let's
  Encrypt cert with active certbot timer. Surfacing both problems
  (can't control via hub; cert will periodically rotate) in one release
  so operators see the full picture.

**Version bump**
- `lib/version.ts` → `v0.7.3h`.

v.Chaos.Jason.0-ah·v0.7.3g

Response to the feedback *"all of these manual help-ups, in production
you are not there to fix shiet"* — every manual fix I've done during
this session is now exposed as a UI action the operator can trigger
themselves.

**Peer activity card + auto-detection banner**
- New `lib/peer-activity.ts` — parses chainweb-node's docker logs into
  per-peer summaries (error count, last success/error, dominant failure
  tag: `unknown-ca`, `timeout`, `conn-refused`, etc.).
- New API `GET /api/admin/nodes/[id]/stoachain/peer-activity?minutes=N`
  — SSHes to target, pulls last N minutes of container logs, returns
  events + summaries + **auto-detected issues** with suggested actions.
- New `PeerActivityCard` on Chainweb → Status sub-tab. Polls every 15s.
  Shows per-peer table. If the node isn't syncing AND a dominant tag
  of `unknown-ca` is detected → **red banner with "Reset peer trust"
  one-click button**.

**Reset peer trust (self-service)**
- New handler `peer-trust-reset` (`lib/handlers/peer-trust-reset.ts`):
  composes `seed-refresh` + `stoachain-reseed` in one job. Refreshes
  seed from a healthy donor (excludes the target), then reseeds the
  target. As a side effect, the target's peer-DB is replaced with the
  donor's current view — stale fingerprints cleared.
- Honest caveat documented in the handler: **this is a pragmatic proxy**
  for a surgical peer-DB wipe. True surgical would be a rocksdb key
  prefix delete; building that requires chainweb source reading we
  haven't done.
- New API `POST /api/admin/nodes/[id]/stoachain/peer-trust-reset`.
  Fresh-confirm + ancient admin.

**Cert-rotate now generates ECDSA P-256** (was P-384)
- `lib/handlers/stoachain-cert-rotate.ts` — switched curve to P-256 with
  SHA-384 signatures. Matches the original working Stoa cert (the one
  AncientLinux trusted pre-incident); evidence suggests P-384 may be
  rejected by some chainweb-node builds. 128-bit security is still
  ample for P2P identity; cert generation is faster.

**Force-fail stuck job**
- New API `POST /api/admin/jobs/[id]/force-fail` + button on
  `/admin/jobs/[id]`. Marks a `running` or `queued` job as failed in
  the DB immediately. Operator escape hatch for jobs that never complete
  due to worker bugs or external issues (ssh2 half-open channels, dead
  remote processes). Warns that side effects the handler was in the
  middle of may persist.

**Sudoers repair**
- New API `POST /api/admin/nodes/[id]/sudoers-repair` + new
  `SudoersRepairCard` on Chainweb → Control sub-tab. One-click rewrites
  `/etc/sudoers.d/ancientholdings-stoa` with the current canonical
  NOPASSWD command list. Idempotent. Uses existing `tee` NOPASSWD grant
  so no password prompt.
- Fixes the pre-v0.7.3d installs that didn't include `tar`/`df`/`du`/
  `find` in sudoers (AncientLinux, Node2).

**Version bump**
- `lib/version.ts` → `v0.7.3g`.

**To test**: if AncientLinux still blocked by TLS, click Rotate on node2
again (will get P-256 this time) — if sync resumes, curve was the
issue. If not, click "Reset peer trust" on AncientLinux — will take
~20 min but should fully clear any stale-fingerprint pinning.

v.Chaos.Jason.0-ai·v0.7.3f

v0.7.3e proved the full reseed pipeline works end-to-end — AncientLinux
jumped from cut height ~79,000 to ~1,621,032 (StoaNodeTwo's height at
seed-capture time) in minutes, as designed. But the handler parked in
`running` state at 90% afterward because of an ssh2 quirk.

**Root cause**: when the remote `tar -xz` exits cleanly after consuming
all stdin, ssh2 sometimes emits only the `exit` event and NEVER the
`close` event. The handler was waiting on `close` to settle the
promise, so it parked indefinitely even though tar finished + data was
correct.

**Fix**: settle on whichever of `exit`, `close`, or a post-EOF 60s
timer fires first. `plaintextTarGz.pipe(stream).on('end')` triggers
the timer as a belt-and-suspenders. Either:
- `exit` fires with the tar exit code → settle immediately based on it
- `close` fires without exit → settle based on stderr (empty = success)
- post-EOF 60s passes without either → settle based on stderr

All three paths guarantee deterministic resolution. No more 20-min
wait-on-timeout after successful reseeds.

**Recovery of the stuck job from v0.7.3e test**:
- Job `efabc16d-…` was stuck at 90% running. I killed the worker,
  manually ran the remaining handler steps (mv staging → data, rm
  data.old, docker compose up -d), and marked the job succeeded in
  the DB so the UI reflects reality.
- AncientLinux verified running off the seed at height 1,621,032.

**Version bump**
- `lib/version.ts` → `v0.7.3f`.

v.Chaos.Jason.0-aj·v0.7.3e

v0.7.3d passed the sudoers preflight but failed during extraction with
`gzip: stdin: not in gzip format` — and the node got stranded again
(container stopped, data moved aside, extract dead). Two bugs:

**1. First chunks of the decrypted stream disappeared**
- `lib/handlers/stoachain-reseed.ts` — the progress-tracking `data`
  listener was attached to `plaintextTarGz` BEFORE `.pipe()` was set up.
  Adding a data listener puts a Node Readable into flowing mode
  immediately; during the `await` gap before the SSH pipe attached, the
  first chunks flowed into only the counter (no pipe yet) and vanished.
  The remote tar received bytes starting mid-gzip → "not in gzip format".
- Fix: replace the separate `data` listener with an inline `Transform`
  in the pipe chain. Every byte passes through counter → pipe → SSH
  stdin, no losses.

**2. Failed reseed stranded the node**
- When extract fails, the handler had already stopped the node + moved
  data aside. No rollback meant the operator had to SSH in and move
  things back manually.
- New `rollbackAfterExtractFailure()` helper fires on any extract throw:
  remove the (partial) staging dir, `mv data.old.<ts>` back to live, and
  `docker compose up -d` (for docker supervision). Best-effort — each
  step is try/catch, any rollback failure is logged but doesn't mask the
  original extract error.
- Tight-disk mode (no data.old kept) skips the restore step with a clear
  log line — operator must reseed or sync from genesis.

**3. Broader stderr pattern matching**
- `streamIntoTarExtract` now also settles immediately on:
  - `not in gzip format` / `unexpected end of file` → stream plumbing / corrupt archive
  - `error is not recoverable` / `child died with signal` → tar internal fatal
  - `no space left on device` → disk full during extract
- Previously only sudo-denial patterns triggered early-settle; everything
  else waited for the close event that ssh2 sometimes doesn't emit.

**Version bump**
- `lib/version.ts` → `v0.7.3e`.

v.Chaos.Jason.0-ak·v0.7.3d

First reseed on AncientLinux hung: the target's sudoers (written by the
install wizard) didn't include `tar`, so `sudo -n tar -xz` immediately
hit "a password is required" and the SSH stream closed in a way the
handler didn't catch. Job stuck at 22% "running" forever, and worse —
by the time we noticed, the node was already stopped with its data moved
aside.

Three fixes:

**1. Pre-flight `sudo -n tar` BEFORE stopping the node**
- `lib/handlers/stoachain-reseed.ts` — `preflightSudoTar()` runs a
  harmless `sudo -n tar --version` as the first destructive-safe check.
  If sudo denies it, fail immediately with the exact sudoers line the
  operator needs. Node stays running, data stays put — recoverable state.

**2. Hang-safe stream plumbing**
- Handler's `streamIntoTarExtract` promise now has a single `settle()`
  gate and hooks error / exit / close / stderr-pattern triggers all of
  which settle deterministically. Sudo denial patterns in stderr settle
  immediately instead of waiting for the SSH `close` event that ssh2
  sometimes doesn't emit when the remote process dies before receiving
  any stdin bytes.
- 20-minute belt-and-suspenders timeout — any state where the SSH
  channel goes half-open without firing events still fails cleanly.

**3. Install-template sudoers now includes tar + df + du + find**
- `lib/nodes.ts` — bootstrap writes `tar`, `df`, `du`, `find` into the
  NOPASSWD list. Every NEW install gets the right sudoers.
- **Existing installs (AncientLinux, Node2)** need a one-time sudoers
  patch — the handler's new preflight surfaces this as a clear error
  with the exact fix.

**Recovery on the stuck install**
- Stuck job `86285db2-…` marked failed in the DB manually (it was
  never going to complete on its own).
- AncientLinux's moved-aside data dir restored to `/home/StoaNode/data`;
  container brought back up; syncing resumed.
- AncientLinux's sudoers patched directly via SSH to include the new
  entries.

**Version bump**
- `lib/version.ts` → `v0.7.3d`.

v.Chaos.Jason.0-al·v0.7.3c

Consumer half of SC5 lands: a running-but-unsynced node can now jump
near head by pulling the hub's current seed instead of waiting days to
sync naturally. End-to-end streaming: hub decrypts the .ahbk in-memory,
pipes plaintext tar.gz over SSH into a `tar -xz` on the target. No
intermediate files anywhere — peak memory ≈ SSH channel buffer.

**New handler: `stoachain-reseed`**
- `lib/handlers/stoachain-reseed.ts` — 8-step pipeline:
  1. Preflight (current seed exists, target reachable, disk-space check)
  2. Detect supervision (docker / screen), resolve host data dir via
     docker inspect OR stored flags' `database-directory`
  3. Stop node (`docker compose down` OR `screen quit` + pkill)
  4. Move existing data aside: `mv data/ → data.old.<ts>/`
     (or rm up front in tight-disk mode)
  5. Stream decrypt from hub's `openArchiveStream()` → pipe into SSH
     `sudo tar -xz -C <data.staging>`
  6. Structural verify — `CURRENT` file present in staging
  7. Atomic `mv data.staging/ → data/`
  8. Restart node + delete `data.old/`
- Handles the 700 GB case cleanly: decrypt + extract happen as one
  streaming pass; disk preflight requires 1.1× seed size free (or
  ~seed+existing in deleteOldFirst mode).

**Disk-space UX**
- Default mode keeps `data.old/` aside during extract, deletes on success
  → peak disk ~2× for minutes only, rollback-safe.
- Tight-disk mode (operator-selected checkbox) deletes existing data
  BEFORE extract → peak disk ~1×, zero rollback. UI warns explicitly:
  "if extraction fails, you will have NO chain data."

**UI: Reseed card on Chainweb → Control sub-tab**
- `components/admin/NodeTabs.tsx` — new `ReseedCard` alongside
  ControlCard + RunnerCard. Loads current seed via
  `/api/admin/seeds`, shows donor / seed cut height / node cut height /
  blocks-skipped-forward preview.
- Tight-disk checkbox. Confirm dialog explains destruction before
  enqueue. Fresh-confirm required.
- Inline rollback/rewind warnings if seed height is behind node height.

**API**
- `POST /api/admin/nodes/[id]/stoachain/reseed` — fresh-confirm +
  ancient-admin. Body `{deleteOldFirst?: boolean}`. Rejects with 409 if
  no current seed exists on the hub.

**Not in this release** (deferred to v0.7.3d):
- Install wizard "seeded install" mode (for brand-new nodes, not
  reseed). Needs install-handler extension + wizard UI — distinct code
  path.
- Chainweb-node boot test of staging dir before swap. Adds ~60s per
  reseed and hasn't been needed for the common docker case; revisit if
  post-swap failures become a pattern.

**Version bump**
- `lib/version.ts` → `v0.7.3c` · `SC5 Seeded install — reseed pipeline`.

v.Chaos.Jason.0-am·v0.7.3b

End-to-end test of v0.7.3a on localhost surfaced three UX gaps:

1. **"No eligible donors"** even though Node2 was running with
   `--enable-backup-api`. The filter was reading `probe.chainweb.backupEnabled`
   from `system_probe_json`, which the probe doesn't populate — the field's
   under `probe.chainwebFlags` (and currently empty). Authoritative source
   for backup-api flag is actually `nodes.stoachain_flags_json` (set at
   install + every Start/Restart via the hub).

2. **Only eligible donors shown.** When nothing was eligible, the admin had
   no way to see WHY each managed node was excluded.

3. **No indication of in-flight refresh.** If the admin bounced away from
   the job log page, there was no way to tell from /admin/seeds that a
   new seed was being built.

All three fixed.

**Fix: donor detection from stored flags + live cut height**
- `lib/seeds.ts` — `storedBackupApiEnabled()` reads the DB-stored flag
  profile (authoritative) instead of the probe.
- New `listManagedNodeStatus()` surveys every managed node in parallel,
  SSHing each via `fetchLiveStatus` for cut height + reachability. O(slowest
  node) not O(sum). Returns eligibility status per node:
  - `eligible` / `eligible-rotation` — can donate (latter skipped by
    auto-pick; admin can still pick manually)
  - `no-backup-api`, `not-reachable`, `not-running`, `cut-too-low`,
    `unknown` — with a human reason
- `listDonorCandidates` + `pickDonor` converted to async; rebuilt on top of
  `listManagedNodeStatus`.

**UI: full managed-nodes table + active refresh banner + download queue placeholder**
- `/admin/seeds` gets three new sections:
  - **Active refresh banner** (top): shows when a `seed-refresh` job is
    `queued` or `running`, with live progress % + step label. Link to full
    job log. Polls every 5s.
  - **Managed nodes table**: every node the hub is managing, eligibility
    badge (✓ / ◷ / ◐ / ✗ / —), cut height, last-donated date + relative
    age. The "why not eligible" reason shows inline beneath the badge.
  - **Active downloads table**: structure in place, populated as [] until
    v0.7.3c ships the consumer streaming pipeline.
- Refresh controls updated: auto-pick works even when only
  `eligible-rotation` nodes exist (explicit "recent donor — override"
  label). "⏳ refresh already in flight" notice blocks starting a second.

**APIs**
- `GET /api/admin/seeds` now returns `managedNodes`, `activeRefresh`, and
  `activeDownloads` in addition to `current` + `archives`.

**Version bump**
- `lib/version.ts` → `v0.7.3b`.

v.Chaos.Jason.0-an·v0.7.3a

Start of phase SC5: the hub can now produce "seeds" — promoted backup
archives intended for serving to new / unsynced nodes so they skip
weeks of syncing-from-genesis. This release only covers the PRODUCER
side (hub making + promoting seeds); consumer side (new installs +
reseed consuming a seed) lands in v0.7.3b.

**Schema**
- `db/migrations/015_seed_archives.sql` — adds `seed_archives` (metadata
  for promoted backups; one 'current' row at a time) and
  `seed_downloads` (future queue/progress for clients streaming the
  seed down; consumed in 0.7.3b).
- A seed row references a `backups.id` — the archive file itself lives
  where the backups system put it (`data/backups/<id>.ahbk`). No
  duplicated bytes on the hub.

**New library**
- `lib/seeds.ts` — CRUD for seed_archives, atomic `promoteBackupToSeed`
  (previous current → archived in one SQLite tx, new insert), and
  `pickDonor` / `listDonorCandidates` with health filters:
  - node must have chainweb-node running per latest probe
  - `--enable-backup-api` enabled
  - cut height within 5% of the tallest candidate (proxy for "synced")
  - not donor in the last 3 days (rotation; skipped when only one
    candidate is available)

**New handler**
- `lib/handlers/seed-refresh.ts` — full 4-step flow:
  1. Pick donor (explicit or auto-rotated)
  2. Capture donor live status (cut height + chainweb-node version for
     the seed manifest)
  3. Run `stoachainBackupHandler` as a direct function call (same code
     path as manual customer backups, same encryption)
  4. Promote the resulting `backups` row as the new current seed;
     previous current → archived
- Registered as kind `seed-refresh`.

**Admin panel**
- New page `/admin/seeds` — current seed card (donor, cut height, size,
  sha256, age), eligible-donors picker, one-click "Refresh seed now"
  button (fresh-confirm + ancient-admin), and a history table of past
  promotions.
- Link added from `/admin/`.

**APIs**
- `GET  /api/admin/seeds` — read-only (plain admin auth); returns
  current + archives + donor candidates.
- `POST /api/admin/seeds/refresh` — enqueues a `seed-refresh` job;
  fresh-confirm + ancient-admin; body `{donorNodeId?: string}`.

**Not in this release** (lands in 0.7.3b):
- Install wizard "seeded install" mode
- `stoachain-reseed` handler for existing nodes (stop → download →
  verify → swap → start)
- Streaming download + extract pipeline (tar.zst + secretstream → staging
  dir → boot test → atomic rename)
- Disk-space preflight with keep-old vs delete-old UX

**Version bump**
- `lib/version.ts` → `v0.7.3a` · `SC5 Seeded install — producer side`.

v.Chaos.Jason.0-ao·v0.7.2d

Restart on AncientLinux actually worked — chainweb-node came up inside
the recreated container with the new `p2p-hostname=bytales.duckdns.org`
and `cluster-id=AncientMiner`, syncing cuts from node1 at height 70490.
But the hub's job reported "failed" because `waitForContainerChainweb`
couldn't detect the process.

Root cause: `docker top stoa-node -eo comm` fails on this Docker /
kernel combo with `"Couldn't find PID field in ps output"`. The custom
`-eo comm` ps-options syntax isn't universally supported.

Fix: drop the custom ps format. Use the default `docker top` output
(which includes the full CMD with argv in the rightmost column) and
grep for the substring `chainweb-node`. Works whether the entrypoint
execs `/chainweb/chainweb-node` (current image) or any wrapper, and
doesn't rely on a specific ps format.

Also: backfilled the AncientLinux node row's
`stoachain_last_action=restart` + `stoachain_runner_path` to
`/home/StoaNode/chainweb/docker-compose.yml` so the UI reflects the
de-facto successful restart from the previous job attempt.

Known quirk surfaced by the logs: home node can't sync peers from
node2.stoachain.com due to `certificate has unknown CA` — backlog item,
not new. node1 syncing works fine.

**Version bump**
- `lib/version.ts` → `v0.7.2d`.

v.Chaos.Jason.0-ap·v0.7.2c

v0.7.2b's docker branch was technically correct but didn't run: the worker
had booted before the code change and kept running the old screen-only
handler. When the user hit Apply + Restart on AncientLinux (docker-
supervised), the stale handler ran `screen -X quit` (no-op), then `pkill
-TERM chainweb-node` — which killed the chainweb-node process INSIDE the
container (PID-visible on the host), then tried to write the runner to
`/data/RunStoaNode.managed.sh` (the container-internal data dir path,
which doesn't exist on the host). Job failed. Container's restart policy
(`restart: unless-stopped`) brought chainweb-node back up.

Three preventive changes so this can't recur.

**Worker logs its VERSION on boot**
- `worker/index.ts` — import `VERSION`, `PHASE_CODE`, `PHASE_NAME` from
  `lib/version.ts` and banner them on startup. Operators can now tell at
  a glance (tmux scrollback, PM2 logs) whether the worker process is
  running current code after a patch. Every suffix-bump (`v0.7.2b → c`)
  changes the banner text.

**Screen-path stopNode refuses to pkill when a stoa-node container is running**
- `lib/handlers/stoachain-control.ts` — before the screen quit + SIGTERM
  sequence, the handler checks `docker ps --filter name=stoa-node
  --filter status=running`. If a live stoa-node container is found, it
  throws a clear error asking the operator to restart the worker. The
  supervision branch at the top of the handler should have caught this
  earlier, so the only way the screen path ever reaches a docker node is
  stale worker code.

**CLAUDE.md documents `npm run worker:watch`**
- The `package.json` already had `worker:watch` using `tsx watch`, which
  auto-reloads on every `.ts` change. `CLAUDE.md` now recommends it as
  the dev default; the plain `npm run worker` only makes sense when
  you're debugging the worker itself and don't want auto-restart.

**Version bump**
- `lib/version.ts` → `v0.7.2c`. Suffix-ticks on every patch from now on
  (`a`, `b`, `c`, …), as requested — the live badge shows it, the
  worker banner shows it, the changelog cross-references it.

v.Chaos.Jason.0-aq·v0.7.2b

Follow-on from v0.7.2a after the user pointed out that the RunnerCard told
the same screen-based story regardless of supervision mode — and noticed
the description was wrong for the docker-supervised node (AncientLinux).
Turned out that wasn't just bad copy: the `stoachain-control` handler was
entirely screen-only. Clicking Apply + Restart on a docker-supervised node
would have tried `screen -dmS StoaNode` against a container — nonsense.

**`stoachain-control` is now supervision-aware**
- `lib/handlers/stoachain-control.ts` — live supervision detection at the
  start of every run (`docker ps -a --filter name=stoa-node`, then
  `screen -ls`). The handler dispatches to a docker branch or the
  original screen branch.
- Docker Restart: `docker inspect stoa-node` to find
  `com.docker.compose.project.working_dir` + current image tag →
  `computeLayout()` from that dir → `renderDockerCompose(layout, imageTag,
  flags)` → write `docker-compose.yml` over SSH → `docker compose up -d
  --force-recreate`. `--force-recreate` ensures new env vars take effect
  even when the image tag hasn't changed. Waits for `chainweb-node` to
  appear in `docker top` output, up to 4 minutes.
- Docker Stop: `cd <composeDir> && docker compose down`. The container
  is removed; next Start recreates from the compose file.
- Docker Start: same as Restart, but only runs the up/wait phase.
- `stoachain_runner_path` for docker nodes stores the compose file path
  (that's the "hub-rewritten-on-every-Restart" thing for docker), so the
  status endpoint continues to identify docker-managed nodes the same
  way it did before.
- Compose dir not found (container `docker rm`'d manually) → clear
  error message asking the operator to re-run Install.

**`RunnerCard` — honest per-supervision copy**
- `components/admin/NodeTabs.tsx` — split into `ScreenRunnerCard` and
  `DockerRunnerCard` with correct fields + accurate procedure writeups.
  Docker version surfaces: container name (`stoa-node`), the inside-
  container binary path (`/chainweb/chainweb-node`), the host-side
  compose path, and the three bind-mount pairs (data, cert, key). The
  "How Start / Restart works" steps describe the actual flow — inspect →
  renderDockerCompose → tee → up --force-recreate → poll docker top.
- Screen version keeps its original writeup but re-titled "Runner +
  binary (screen)" for clarity, and clarifies that the legacy-runner
  rollback path is screen-specific.

**Version bump**
- `lib/version.ts` → `v0.7.2b`.

**Known caveat**
- If Apply + Restart on a docker node is the first time the hub has
  operated on it, the stored flags profile is already there (install
  wizard wrote it). But if someone imported a docker node without
  running the wizard (rare — no UI path), stoachain_flags_json is
  empty, and the first Restart will fall through the live-capture
  branch. The capture logic parses `ps -eo args` which on a docker
  host includes the container's chainweb-node argv — should work.

v.Chaos.Jason.0-ar·v0.7.2a

End-to-end test on the live site with the home node surfaced several
usability issues addressed here.

**Chainweb tab: sub-tab navigation**
- `components/admin/NodeTabs.tsx` — the Chainweb tab's cards were stacked
  vertically and scrolled 3+ screens deep. Split into sub-tabs:
  `Status | Control | Flags | Identity | Backup`. URL hash format is
  `#chainweb/<sub>` so links are bookmarkable.
- The Flags sub-tab has an inner toggle: `Edit config` (default) vs
  `Current (live)`. Read-only live-parsed view is always one click away.

**Editor prefilled with live ("ghost") values**
- Every input is now pre-filled with the value chainweb-node is actually
  running. The operator changes only what they want (e.g. `p2p-hostname`
  from `ancientminer.home` to `bytales.duckdns.org`) and the rest stays
  put.
- Previously the editor seeded from the stored profile JSON, which for
  newly-added nodes was empty — every input looked unset even though the
  node had 35+ live flags. Moot for nodes that had been Restart-ed through
  the hub once; fatal UX for nodes that hadn't.
- On Apply, the editor sends a SNAPSHOT of the full live profile + pending
  edits as the new stored profile. "Save what's running, plus my changes,
  into the DB." No more slowly-growing stored profile that lags behind
  live.

**Flag validation tightened**
- `lib/stoachain-flags-catalog.ts` — `block-gas-limit` min bumped from 0
  to 1_600_000 (the Stoa network production min). Clearing the field
  still falls back to chainweb-node's compiled-in default (1.6M), which
  now matches.

**GET /flags no longer requires fresh-confirm**
- `pages/api/admin/nodes/[id]/stoachain/flags.ts` — read-only GET
  downgraded from `requireFreshAdminConfirmApi` → `requireAdminApi`.
  Matches the other read-only endpoints (`status`, `docker-logs`,
  `preflight`). Opening the Flags sub-tab without typing your password
  in the last 5 minutes no longer 401s.
- PATCH still requires fresh-confirm + ancient-admin (restarting
  chainweb-node interrupts P2P gossip — destructive).

**Row metadata swap**
- FlagRow now surfaces "stored differs from live" instead of the other
  way around. Since the editor prefills from live, the interesting case
  is "you hand-edited the runner without restarting through the hub,
  so stored lags behind what's actually running." Apply still
  snapshots-live-then-persists, which is the desired rebaseline.

**Version bump**
- `lib/version.ts` → `v0.7.2a`.

v.Chaos.Jason.0-as·v0.7.2

First live edit path for chainweb-node flags. Previously the hub could only
start / stop / restart the node with whatever profile was captured at install
time; to change a flag the operator had to SSH in, hand-edit the runner, and
hope they didn't mistype. Now every catalog-known flag has an input control
in the UI with validation, a pending-diff counter, and Apply + Restart.

**New API endpoint**
- `GET  /api/admin/nodes/[id]/stoachain/flags` — returns the stored profile
  JSON from the `nodes.stoachain_flags_json` column. Empty `{}` if the node
  was never restarted via the hub (first Apply seeds it).
- `PATCH /api/admin/nodes/[id]/stoachain/flags` — accepts
  `{ flags: Partial<ChainwebFlags>, restart?: boolean }`. Validates every
  incoming key against `FLAGS_CATALOG` (type + range + enum + hex), rejects
  immutable flags (`chainweb-version`, `database-directory`, cert paths),
  merges into the stored profile, persists, optionally enqueues a
  `stoachain-control` restart job. Value `null` for any key means "revert
  to chainweb-node default" (key is dropped from stored JSON).
- Both routes require `requireFreshAdminConfirmApi` and ancient-admin; PATCH
  requires both because restarting chainweb-node interrupts P2P gossip.

**New UI card — Flag editor** (below the read-only Flags card on the Chainweb tab)
- `components/admin/NodeTabs.tsx` — `<FlagEditorCard>` with per-flag inputs
  grouped by category (Core / Data / TLS / P2P / Consensus / Mempool /
  Service / Mining / Backup / Logging / Runtime / Debug).
- Input type per flag is driven by `FlagMeta`: switch-pair → checkbox,
  enum → select, number → numeric input with `[min, max]` hint, repeatable
  → textarea (one entry per line), hex / string / path → text input.
- "Show all flags" toggle surfaces the full catalog (~40 flags); default
  view shows only what's currently set in the stored profile plus the
  always-visible immutable (locked) rows.
- Each row shows a `pending` badge when changed, `locked` badge on
  immutable flags, `debug` badge on debug-only flags, the inline
  description, the relevant catalog warning when the value deviates from
  default, and the live value if it differs from stored.
- Per-row `revert` discards the pending change; `clear` sets to null
  (chainweb-node falls back to its compiled-in default).
- Footer: pending count, `discard all`, `Save (no restart)`, and
  `Apply + Restart`. Save-only persists to the DB so the next Restart
  picks up the new flags; Apply + Restart does both, enqueues the
  `stoachain-control` job, and polls inline (same pattern as
  CertRotateButton — no nav-away required to watch progress).
- GHC runtime (`+RTS`) gets its own row at the bottom; it's not a flag
  per se but the handler emits it in the runner.

**Integration**
- The flag-editor writes the same `stoachain_flags_json` column that
  `stoachain-control restart` already reads when rebuilding the runner
  script (for screen-supervised nodes) or the compose file (for
  docker-supervised nodes). No new wiring needed — "edit flags → Apply
  + Restart" just happens.
- `FuturePhase` banner on the Chainweb tab updated: flag editor is no
  longer future work. Listed forward: v0.7.3 seeded install, v0.8.x web
  terminal / hub registry.

**Version bump**
- `lib/version.ts` → `v0.7.2` · `SC4 StoaChain flag editor`.

v.Chaos.Jason.0-at·v0.7.1b

Two quality-of-life additions on top of v0.7.1a's install flow, observed
during end-to-end test on a home Linux machine.

**Container logs card** (new)
- `components/admin/NodeTabs.tsx` — ContainerLogsCard shown on Chainweb
  tab when the node is supervision=docker. Live tail of `docker logs
  --tail N stoa-node` via a new API route.
- Configurable line count (100 / 200 / 500 / 1000 / 2000); auto-refresh
  every 5s (toggle); Copy-all button for pasting into support convos.
- New endpoint: `GET /api/admin/nodes/[id]/stoachain/docker-logs?lines=N&container=X`.
  Any-admin access; read-only; safe to poll.

**Auto-reprobe after mutating jobs** (new)
- `stoachain-install`, `stoachain-control` (start/stop/restart), and
  `stoachain-cert-rotate` handlers now enqueue a `system-probe` job as
  their last successful step. The probe runs within seconds of the
  mutating job finishing, so the Docker tab's container listing +
  Overview "Services detected" rows reflect the new state without the
  operator needing to click Reprobe.
- Also wired on the UI side for CertRotate + Install — they POST a
  probe from the browser on job completion as belt-and-braces (fires
  even if somehow the handler-side enqueue fails).
- Fixes the UX mismatch we hit during testing where preflight showed
  the target clean but Docker tab still showed a zombie stoa-node.

**Wizard UX fix: P2P hostname is OPTIONAL**
- Tested install on a home machine with no DNS name — the wizard
  required a P2P hostname input, which when filled with a placeholder
  (`ancientminer.home`) advertised an unresolvable name to peers.
  node1 rejected sync with HTTP 400 and node2's TLS failed.
- Fix: P2P hostname field marked optional. Blank = the install sends
  `0.0.0.0` (chainweb's auto-detect-via-peer-gossip sentinel). Peers
  use the NAT-translated source address instead of the advertised
  hostname. Home operators now work without DNS.
- Regex validation still applies when the field IS filled — for
  operators with real DNS names.
- Updated helper text explicitly tells home operators to leave it
  blank and tells validators/bootstraps to fill it in.

**End-to-end test result (v0.7.1a + b combined)**
- Fresh home Linux machine (AncientMiner, 32 GB RAM, NVMe): preflight
  green, install wizard succeeds, container healthy, chainweb-node
  begins receiving cut data from node1.stoachain.com. Cut height
  climbing — node actively syncing.
- Two real chainweb quirks observed during test (tracked as separate
  backlog items, NOT install-flow bugs):
  - TLS handshake with node2.stoachain.com returns
    "certificate has unknown CA" despite `_disablePeerValidation=True`
    in Stoa version config.
  - node1.stoachain.com returns HTTP 429 during aggressive initial
    cut polling; self-resolves as peer relationship stabilizes.

---

v.Chaos.Jason.0-au·v0.7.1a

Incremental patch on v0.7.1. Extends the existing Easy-setup bootstrap
flow in `/admin/nodes/new` so that, in addition to installing the hub's
SSH key, it also prepares the target for container-based chainweb
management. Fills the gap v0.7.1's install wizard revealed: the
wizard assumed a "prepared" server, but the bootstrap flow wasn't
actually preparing anything beyond SSH auth.

**What Easy setup now does** (was: only steps 1-5)

1. Password-auth SSH into the target (password used in-memory, never stored)
2. Generate ed25519 SSH keypair
3. Install the public key in `~/.ssh/authorized_keys`
4. Reconnect with the new key to verify it works
5. **(new)** Install `docker.io` if missing (apt / dnf / yum auto-detected)
6. **(new)** Enable + start the docker daemon via systemctl
7. **(new)** Add SSH user to the `docker` group (skipped when user is root)
8. **(new)** Write `/etc/sudoers.d/ancientholdings-stoa` with NOPASSWD for
   `docker, mkdir, chmod, chown, openssl, tee, systemctl, screen, pkill, mv`
   (skipped when user is root — root doesn't need sudo)
9. **(new)** Verify `sudo -n docker --version` works — proves sudoers took
   effect before declaring bootstrap successful
10. Seal the private key in the vault
11. Show the private key to the operator once for external backup

All steps idempotent — safe to re-run against an already-prepared box
without breaking anything.

**UI changes** (`pages/admin/nodes/new.tsx`)

- Expanded "Easy setup" helper block into a dropdown listing the full
  11-step sequence with inline docs about password handling, distro
  detection, and root-user semantics. Labeled as recommended.
- Added equivalent dropdown on "Advanced" explaining when to use it
  (rare — only for pre-existing SSH key auth + manual prep) and what
  it skips vs Easy setup.
- Clarified the root-SSH caveat (modern Linux distros block root
  password login; use a non-root sudo user instead).

**What this enables**

A truly fresh Linux box (whether a home machine, a Hetzner VPS, or a
DigitalOcean droplet) can go from "ssh user + password" to "fully
hub-managed chainweb-node capable" in one click, with zero manual
prep. The install wizard from v0.7.1 now works end-to-end on any
blank Linux target.

**Code changes**
- `lib/nodes.ts` — new `prepareTarget()` function runs after SSH key
  install. Constructs a distro-aware setup script executed in one
  SSH round-trip under the existing password auth. Idempotent.
- `pages/admin/nodes/new.tsx` — copy expansion only; no logic change.
- Version bumped to `v0.7.1a`.

---

v.Chaos.Jason.0-av·v0.7.1

Hub can now provision a fresh chainweb-node container on any registered
server via a UI wizard — no SSH, no Haskell toolchain, no manual docker
commands. Uses the published `ghcr.io/stoachain/stoa-node:latest` image
(v2.32.0-stoa.1 or later).

**Pre-work shipped outside the hub repo** — StoaChain repo now has
first-class container support:
- `docker/entrypoint.sh` expanded to cover the full production flag surface
  (all ~30 flags, correct mining-coordination vs node-mining semantics,
  ECDSA P-384 cert auto-detection).
- `cabal.project` pins crypton 1.0.4 / memory 0.18.0 / merkle-log 0.2.0
  to survive Hackage drift from post-Kadena-shutdown dep churn.
- Published image at [ghcr.io/stoachain/stoa-node](https://github.com/StoaChain/stoa-chain/pkgs/container/stoa-node),
  GitHub Release [v2.32.0-stoa.1](https://github.com/StoaChain/stoa-chain/releases/tag/v2.32.0-stoa.1)
  with the raw binary attached for operators who don't use docker.

**On the hub side:**

- `lib/stoachain-layout.ts` — canonical `StoaNode/` layout generator.
  Every install creates `<root>/StoaNode/{chainweb,data/backups,tls}/`
  with docker-compose.yml + nginx.conf.example. Backups stay inside
  data/ (hardlink-friendly for RocksDB checkpointing). Service API
  defaults to `127.0.0.1` binding unless operator opts into public.
- `lib/stoachain-install-preflight.ts` — single-SSH-roundtrip env audit:
  docker installed + running, RAM ≥ 4 GB, drive class (NVMe/SSD/HDD),
  sudoers `NOPASSWD`, port 1789 / 1848 availability, whether any
  chainweb-node is already running. Returns structured report; HDD
  targets flagged red, warnings on low RAM, exact sudoers line included
  when sudo is denied.
- `lib/handlers/stoachain-install.ts` — orchestrator job handler:
    1. Create canonical layout (sudo mkdir + chown)
    2. Generate ECDSA P-384 cert + key at `tls/`, chmod 600 on key
    3. Render hub-managed docker-compose.yml from the chosen profile
    4. Write nginx.conf.example as reference (not applied)
    5. `docker pull ghcr.io/stoachain/stoa-node:latest`
    6. `docker compose up -d` in `chainweb/`
    7. Wait up to 120s for container's `/info` to respond
    8. Persist stoachain_flags_json + runner_path + last_action in DB
  Failure at any step aborts without auto-rollback; partial state
  visible on target for manual inspection.
- API routes:
    - `POST /api/admin/nodes/[id]/stoachain/preflight` — any admin; runs
      the checks and returns the structured report.
    - `POST /api/admin/nodes/[id]/stoachain/install` — Ancient-admin-only,
      fresh-confirm required; validates body (root path, hostname format,
      pubkey hex) then enqueues `stoachain-install` job.
- `components/admin/InstallWizard.tsx` — 6-step wizard:
    1. Preflight (auto-runs; advanced-override checkbox available if any
       fail and operator knows better)
    2. Storage (drive picker with size/class badges; auto-selects best)
    3. Identity (P2P hostname + optional cluster-id)
    4. Profile (Recommended vs Mining coordinator; pubkey field shown
       for mining; backup-API + public-service toggles)
    5. Review (shows full install plan + Apply button with
       fresh-confirm modal)
    6. Running (live progress bar + log tail, success banner with
       post-install manual steps checklist, or failure with recovery
       guidance)
  Shown on Chainweb tab only when the node has no chainweb-node running
  and no hub-managed runner path recorded — i.e. truly fresh targets.

**What you can now do**
- Register a fresh Linux box in Nodes → click Install chainweb-node →
  walk through 5 steps → have a syncing StoaChain node in ~2-5 minutes.
- Legacy node2 (screen-managed, pre-hub) is unaffected — the install
  wizard only appears on nodes without an existing chainweb-node.

**Explicitly deferred** (next phases)
- Flag editor UI to change flags post-install — v0.7.2
- Seeded install from donor .ahbk → faster bootstrap — v0.7.3
- Migrate existing screen-managed node2 to container mode — v0.7.4
- Hub-hosted container registry as backup to GHCR — v0.8.x
- Vendor deps into StoaChain repo for supply-chain resilience — v0.8.x

---

v.Chaos.Jason.0-aw·v0.7.0

First phase of the v0.7.x arc. The hub now understands chainweb-node
at the flag level, can read it live over SSH, display its identity /
storage / peer state, and Start/Stop/Restart it against the existing
screen-managed production node.

**What shipped**

- `docs/chainweb-reference.md` — ~7,900-word living reference covering
  every chainweb-node flag we care about (defaults, roles, warnings,
  ranges, citations into StoaChain Haskell source), the TLS certificate
  system, the P2P discovery cascade, service-API endpoint catalog, and
  an audit of the production runner script (14 of 35 flags are default
  no-ops, some like `--mining-update-stream-limit 50` are below default
  — candidates for cleanup).
- `lib/stoachain-flags-catalog.ts` — 40-flag catalog with role /
  category / type / default / recommended / warning / doc-anchor
  metadata, plus two named profiles:
    - **Ancient** — byte-for-byte reproduction of the production script
      (~35 flags).
    - **Recommended** — minimal-equivalent (~20 flags, same behavior).
- `lib/stoachain-flags.ts` — `fromPsArgs`, `fromScript`,
  `toRunnerScript`, `toDockerEnv`, `diffFlags`. One model, two
  materializations (bash runner for screen mode today, docker env
  for v0.7.1 container mode).
- `lib/stoachain-live.ts` extended:
    - `fetchLiveFlags` — parses the live `ps -eo args` into structured
      `ChainwebFlags`, detects parent runner script, classifies which
      named profile matches (or `custom`).
    - `fetchLiveCert` — reads TLS cert + key over SSH, runs `openssl`
      for SHA-256 fingerprint, subject, validity dates, key-file perms.
      Distinguishes persistent / ephemeral / missing modes.
    - `fetchLiveDrive` — walks data-directory → mount → block device →
      `/sys/block/.../queue/rotational`. Classifies as NVMe / SSD /
      HDD; used by the UI to flash a red warning when chainweb-node
      lives on a rotational drive.
- `lib/handlers/stoachain-control.ts` — Start / Stop / Restart job
  handler. On first touch, parses live argv into flags and saves them
  as the stored profile. Thereafter renders a hub-managed runner
  script (`<data-dir>/RunStoaNode.managed.sh`) from that profile on
  every Start/Restart; the operator's legacy runner is never
  overwritten. Stop sequence: `screen -X quit` → 10s grace → TERM →
  10s → KILL. Missing `sudo -n` permissions surface with the exact
  sudoers line to add.
- `lib/handlers/stoachain-cert-rotate.ts` — openssl-over-SSH cert
  generation + rotation. Refuses to run while chainweb-node is
  active. Two modes: `upgrade` (first-time cert from ephemeral) and
  `rotate` (archive existing to `*.TS.old`). Handler wired; UI
  button deferred to v0.7.1.
- Migration 014 — 8 new columns on `nodes`:
  `stoachain_flags_json`, `stoachain_flags_at`, `stoachain_profile`,
  `stoachain_binary_path`, `stoachain_runner_path`,
  `stoachain_last_action`, `stoachain_last_action_at`,
  `stoachain_last_action_by`. Trimmed from an earlier design —
  anything trivially queryable live over SSH (cert expiry, drive
  usage, live flags) is NOT persisted, to keep the DB narrow and
  avoid stale display.
- API endpoints:
    - `GET /api/admin/nodes/[id]/stoachain/status` — single round-trip
      payload: live status + cert + drive + flags + audit. Polled
      every 10 s from the UI.
    - `POST /api/admin/nodes/[id]/stoachain/control` — enqueues a
      `stoachain-control` job. Requires fresh admin confirm.
- `ChainwebTab` UI rebuild — seven cards:
    - **Status** — live tone badge, profile badge, per-chain height
      grid (10 cells), peer count, auto-refresh timestamp.
    - **Control** — Start / Stop / Restart buttons with
      confirm-password modal; disabled states wire to the live
      running flag; last-action audit line.
    - **Peer identity (TLS)** — color-coded badge (Certified /
      Ephemeral / Missing), fingerprint, subject, validity with
      days-until-expiry (red <30d, amber <90d), key-perm warning.
    - **Storage** — drive-class badge (NVMe / SSD / HDD), mount, fs
      type, capacity + used %, red warning on HDD.
    - **Flags** — grouped read-only table of every parsed flag,
      matching-profile badge, catalog-gap collapsible for unknown
      flags.
    - **Runner + binary** — paths for both; the hub-generated runner
      path is shown even before first Start so operators know where
      the managed script will land.
    - Existing **Backup** card unchanged.
- Profile classification: a simple equality check against Ancient and
  Recommended. Everything else classifies as `custom`.

**Research inputs**
- StoaChain Haskell source at `d:\_Claude\StoaChain\` (branch
  `AncientStoa`) — ground truth for flag defaults + semantics.
- `chainweb-node --help` output captured from production binary
  `StoaChain_2.32.0` (276 lines, `docs/research/chainweb-node-help.txt`).
- User's two runner scripts (`RunStoaNode.sh` and `.backupapi.sh`),
  captured over SSH.
- Kadena upstream docs (archived since Kadena Inc. shutdown
  2025-10-21; mainnet last block 2025-11-15).

**Explicitly deferred (come later in v0.7.x)**
- Flag editor UI (change profile / individual flags) — v0.7.2.
- Cert rotation UI (handler exists; button deferred) — v0.7.1.
- Container-mode detection (screen vs docker) — v0.7.1.
- Hub-driven install on fresh host — v0.7.2 (includes drive
  auto-select + canonical `StoaNode/{chainweb,data,tls}/` layout).
- Seeded install (donor .ahbk → new node) — v0.7.3.
- Screen → container migration button — v0.7.4.
- GHCR image publish workflow — v0.7.2 (StoaChain repo addition).

**Known gaps from research (future-Claude TODOs)**
- `--enable-local-timeout` semantics — Bool or µs? Flagged in doc §8.
- Backup-API on port 1848 is unauthenticated — documented in
  research §4, production workaround is SSH+localhost; long-term plan
  is firewall-to-loopback or nginx auth.
- `--bootstrap-reachability 0` silently masks a firewalled P2P port;
  v0.7.0 shows an amber warning in flags view when this is set.

---

▸Theseus2 historical entries · patch-number 0

Theseus Genesis · 2 historical entries · patch-number 0

v.Chaos.Theseus.0-a·v0.7.5g

Feedback round on v0.7.5:

- **Dropped** the `/admin` landing per-node gauge row. Doesn't scale past
  a handful of nodes — at 100 nodes it becomes a wall of jittering bars.
  The per-node live view already lives on each node's Monitoring tab,
  and the placeholder quick-link now reads "live · open any node →
  Monitoring tab" instead of the old "MON1 — coming soon".
- **Bootstrap now installs netdata + docker-compose-plugin**, not just
  docker + certbot. New nodes land with monitoring working
  out-of-the-box. Total prepare timeout bumped to 10 min (kickstart can
  be slow on fresh VPSes).
- **Bootstrap ticker + log reveal**. The synchronous POST is unchanged
  server-side (background-job rework is a later slice), but the UI now
  shows an 8-stage ticker while it runs (SSH check → key install → key
  verify → docker → certbot → sudoers → netdata → sealing). On success,
  a collapsible "what was installed on the target" panel shows the full
  server log of the prepare script.
- **Dynamic service paths in the probe.** StoaChain data dir is now
  resolved from the running chainweb-node argv (`--database-directory`),
  translated via `docker inspect` for containerised nodes, and only
  falls back to `/mnt/nvmedrive/StoaNodeData` when nothing else yields.
  IPFS repo path comes from `ipfs config Datastore.Path`, then searches
  common fallbacks (`/root/.ipfs`, `/home/*/.ipfs`, `/var/lib/ipfs`).
  The probe emits new `servicePaths` fields; `NodeProvisioning` uses
  them so tiles show the real path on each node instead of a universal
  default that only ever worked for one owner's home layout.
- **Trim system logs button** on the System-logs provisioning tile.
  Runs `journalctl --vacuum-time=7d` + `find /var/log -type f ( -name
  '*.gz' | *.1 | *.2 | *.3 | *.old | *.N ) -mtime +1 -delete`. Active
  logs never touched. Fresh-confirm required; returns before/after
  sizes so the UI shows how much was freed.

---

v.Chaos.Theseus.0-b·v0.7.5

Phase code → **MON1**. Closes the biggest gap left from §8 of the original
control-hub plan: monitoring + capacity awareness. Netdata install + live
CPU/RAM/load/net charts were already shipped; v0.7.5 adds everything built
on top of them.

### v0.7.5a — Per-mount disk space tiles

New [NodeDiskSpace.tsx](components/admin/NodeDiskSpace.tsx) component on every
node's Monitoring tab. Queries netdata's `/api/v1/charts` once to discover
every `disk.space` chart on the box, then samples each for current avail /
used / reserved. One tile per mount point — bar coloured green / amber / red
at 75 % / 90 % thresholds. Refreshes every 30 s.

### v0.7.5b — Per-node live gauges on `/admin` landing

New [AdminHomeGauges.tsx](components/admin/AdminHomeGauges.tsx) + batched
[/api/admin/nodes/[id]/summary](pages/api/admin/nodes/[id]/summary.ts)
endpoint. Each visible node gets a compact tile showing CPU %, RAM used %,
and worst-mount disk %. Tiles refresh every 15 s and link through to the
node detail page. Ownership-filtered — modern/client admins only see their
own nodes; ancient sees all.

### v0.7.5c — Service provisioning view

New [NodeProvisioning.tsx](components/admin/NodeProvisioning.tsx) on the
Monitoring tab. For each service detected in the system probe (StoaChain
data / StoaChain backups / IPFS repo / Mailcow / Docker / system logs), shows
its path + used bytes, resolves it to the mount it lives on (longest-prefix
match against `df` output), and renders the mount's headroom bar alongside.
Pure client-side derivation from existing probe data — no new SSH calls.

### v0.7.5d — `capacity_snapshots` table + daily worker capture

Migration 018 adds `capacity_snapshots(node_id, service_key, day,
used_bytes, mount_point, mount_total_bytes, mount_avail_bytes)` with
`UNIQUE(node_id, service_key, day)`. New
[lib/capacity-snapshots.ts](lib/capacity-snapshots.ts) module: `du -sb` +
`df -B1` over SSH for every tracked service path, idempotent per UTC day.
Hooked into the worker's hourly reap tick — rows only insert once per day,
so it's cheap to call hourly and survives worker restarts gracefully.

Bonus: the module exposes `getRecentSnapshots()` + `projectFillDay()`
(simple linear regression to project when the mount fills at current growth)
which power the "full in ~N d" hint on the provisioning tiles via a new
[/api/admin/nodes/[id]/capacity](pages/api/admin/nodes/[id]/capacity.ts)
endpoint.

### v0.7.5e — Active alerts surface

New [NodeAlerts.tsx](components/admin/NodeAlerts.tsx) at the top of the
Monitoring tab. Consumes netdata's `/api/v1/alarms` (already whitelisted in
the metrics proxy), filters to `WARNING` / `CRITICAL` only, shows name +
chart + current value + human-readable info. Refreshes every 30 s. Green
"no active alarms" state when everything's quiet.

### v0.7.5f — Backup pre-flight disk headroom gate

New [lib/disk-preflight.ts](lib/disk-preflight.ts) module with
`backupPreflight()`: probes the remote node's stage + source dirs and the
hub's landing dir, estimates backup size at `source × 0.8 × 1.2` (chainweb
snapshots land at ~0.8x the source RocksDB; 20 % safety margin), and aborts
with an actionable error before any remote work starts if headroom is
tight. Wired into [backup-stoachain.ts](lib/handlers/backup-stoachain.ts).
Example error:

> Remote node needs ~32.4 GiB free on /mnt/nvmedrive/StoaNodeData/backups but only 12.1 GiB available. Free space or increase the mount.

Previously this failed mid-stream after minutes of tar'ing, leaving the
remote backup dir half-full to reap manually. Now it's a clean "don't
start" with a clear remedy.

---

▸Midas1 historical entry · patch-number 0

Midas Genesis · 1 historical entry · patch-number 0

v.Chaos.Midas.0-a·v0.7.6

Concatenated summary of v0.7.6a through v0.7.6z. This series introduced the
off-chain Stoicism reward layer — the hub's flagship feature.

- **Migration `019_stoic_power.sql` + `021_stoic_power_ledger.sql`** — full ledger
  schema: `stoic_power_accounts` (keyed by `ouronet_account`, NOT email),
  `stoic_power_events`, `stoic_power_daily`. Later renamed to `stoicism_*` in
  migration 029 for user-facing consistency; internal code still uses
  `stoic-power.ts` filenames.
- **Eligibility engine.** Seven gates for accrual: benchmark stamped, Ouronet
  account set, commitment ≥ fleet-min, warmup complete (shadow-override
  settable), cut within tip-tolerance of fleet peak, peer count (later
  retired in v0.7.7r), not in breach (flag-violation gate documented, enforcement
  lands v0.7.8z+). Node fails any gate → pending pool, no mint.
- **Shadow vs live mode.** Ancient-admin-only ScoringModeCard:
  SHADOW = accumulating for validation, not published on-chain; LIVE = authoritative
  accrual. Flip-to-live zeroes shadow points (warmup preserved); revert-to-shadow
  preserves ledger. Schedule-auto-flip writes `system_state.scoring_mode_flip_at`
  consumed by the scoring worker at the top of every tick.
- **Per-second accrual rate.** `Stoicism/sec = ServerScore × 0.001`; live display
  on the scoring card. Pending pool during warmup; flips to current on warmup
  completion.
- **Warmup model.** 24 h of cut-within-tip-tolerance before points move from
  PENDING to CURRENT. Shadow mode lets admins override (1–1440 min) for testing;
  live mode locks at 24 h.
- **Ouronet account override per-node.** Operators earn into their profile-set
  Ouronet by default; ancient admin can pin a per-node override (e.g. treasury
  node). Lock flag prevents operator from clearing the override.
- **Rich-list page (v0.7.6m).** Top accounts by lifetime Stoicism; ancient-only
  initially; later (v0.7.8) backed by an hourly materialised view for scale.
- **Earnings page (v0.7.6j).** Per-operator ledger view: current balance, pending
  points, daily accrual breakdown, event log filterable by node.
- **Scoring worker (v0.7.6x).** Tick every ~10 s; for each node in the eligibility-
  passing set, compute accrual delta, insert `stoic_power_events`, upsert
  `stoic_power_accounts.current_balance`. Batched in transactions.
- **Mint model (design).** Daily 06:00 UTC register-aggregation mint — one batched
  `update-registers` Pact tx per chain. Documented for v0.8.x implementation;
  shadow-mode today accumulates without on-chain publication.

---

Current build: vH.1.23 · Chronos — The H.1.x benchmark/scoring rehaul arc