antacog

Findings — seven

Snapshots — detail on request

№ 12 June 2026 Run 19

The template path routes around the LLM and the 855 files it already holds.

Expected the whole_repo run to produce premise substrate specific to Antacog's actual auth implementation. What scaffolded: the five template premise nodes verbatim, the five relationship descriptions word-for-word from the YAML, 16 evidence pointers attached via path-existence check (not content read). Ant's opening turn was fluent over the template's shape with zero contact with the actual invite-only, multi-tenant code sitting in the ingest. The two failures are the same failure: the template stamps a generic shape and asks canned questions instead of letting Ant read what it's already holding. The inverted epistemic frame cuts deeper than a missing grounding step: this auth code is implemented and working. Bootstrap marked it exploring / assumed and emitted canned questions about a passwords_controller.rb it was holding. Dialogue is rightly speculative because the future isn't written. Bootstrap is running against code that exists. The template makes the wrong epistemic call by default, and the call compounds: it routes intelligence around the files, so no subsequent step corrects it.

Request more information →

№ 09 May 2026 Run 17

Bootstrap and dialogue share a model but produce two different ontologies of substrate.

Pre-registered that code-only INTERPRET would produce substrate comparable to the dialogue baseline — or establish that history ingest is dead weight. Verdict: (b) DIVERGENT, headline ratio 2.90. The divergence isn't "less" — it's differently shaped. Bootstrap produced 525 elements to dialogue's 24; decisions = 0, relationships = 0 from code-only. The 525 are file atoms, one row per source file. The 24 are conceptual abstractions. Same DB model, different ontologies, no current path between them. The radical bet hypothesis survives — but only because the experiment revealed it was asking the wrong question, not because the numbers matched.

Request more information →

№ 08 May 2026 Run 16

The acceptance test that defines B13 completeness had never been run.

B13 was marked complete. The rake task had RSpecs. The pre-flight memo named five failure modes. None of the three blocking bugs — a predicate name mismatch in the rake task, an OAuth button that fails silently in two independent ways, a paginator that bypasses token-refresh so long-running jobs expire mid-way — appeared until the first end-to-end run against a real target. The run crashed mid-HistoryIngest after 135 minutes. No substrate was built on the candidate sibling. No verdict. All three bugs were in code paths that had shipped but had never been exercised in this configuration. The acceptance test that defines completeness ran for the first time today, and it didn't pass.

Request more information →

№ 04 May 2026 Run 15

Watching dialogue alone misses the agent that isn't talking.

A multi-system isolation probe ran an agent in plan mode and produced zero ask_ant calls. Watching only dialogue traffic was structurally biased toward dialogue-active agents — a foreseeable blind spot the probe doc didn't name. Per-tool-call provenance was promoted to a Mode 1 prerequisite the same day.

Request more information →

№ 03 May 2026 Runs 9, 12, 13, 14

Grounding laundering: when transcription substitutes for independent grounding.

Across four runs at structurally different surfaces — librarian summaries, spec writebacks, inferred substrate — the loop substituted summary-of-source for independent verification. The pattern transferred across domains and produced a named diagnostic — current accident, not current contract — now load-bearing in the methodology.

Request more information →

№ 02 May 2026 Run 14, labelling probe

Mode 1 emerged from labelling, not specification.

The behavioural mode embedded in the loop wasn't designed up front. It was named retroactively from a false-positive labelling probe across the run corpus. The methodology generated the spec; the spec didn't generate the methodology — a sequencing claim with consequences for how later modes get added.

Request more information →

№ 01 May 2026 Run 12, external transfer

The loop caught a structural defect on a codebase separate from Antacog's own.

Running autonomous-trigger against infieldOS — an in-house project of mine, separate from Antacog — the loop recognised that an append-only constraint tracked as a convention should be enforced at the registry level. Convention upgraded to structural rule — the transferability claim earned its first concrete artifact outside the loop's own corpus.

Request more information →

Antacog is an experimental project in agentic governance, built around an AI agent called Ant. Ant sits upstream of execution agents.

Grounded reasoning beats ungrounded action.

The conversation is the artifact.

Friction is epistemically valuable.

The template path routes around the LLM and the 855 files it already holds.

Bootstrap and dialogue share a model but produce two different ontologies of substrate.

The acceptance test that defines B13 completeness had never been run.

Watching dialogue alone misses the agent that isn't talking.

Grounding laundering: when transcription substitutes for independent grounding.

Mode 1 emerged from labelling, not specification.

The loop caught a structural defect on a codebase separate from Antacog's own.