Findings funnel¶
This page is the public ledger of every defect surfaced by the v0.6.3.1 A2A campaign and where each one is in the Patch 2 (v0.6.3.2) funnel. Findings are emitted by Phase 4 meta-analysis and committed in Phase 5 per docs/governance.md — that document is authoritative; this page is the rendered surface.
The funnel rolls up to umbrella tracking issue alphaonedev/ai-memory-mcp#511.
How findings flow (5-phase view)¶
Phase 4 emits findings; Phase 5 hands them off to the issue tracker. The narrow funnel:
- Detection (Phase 1 or Phase 3). A scenario fails or a Phase 3 metric crosses a threshold. The runner writes a structured record — substrate failures land in
runs/<run-id>/scenario-*.json; Phase 3 NHI logs land inruns/<run-id>/phase3-<scenario>-<arm>-run<n>.jsonper the §7 schema. - Classification (Phase 4). The third Claude meta-analyst (no namespace access) reads the logs, computes the §8.2 metrics, and assigns each finding a class from the §8.4 taxonomy below. The result lands in
phase4-analysis.json. - Sync (Phase 5).
findings-sync.ymlopens or updates a child issue onalphaonedev/ai-memory-mcpfor each finding taggedcarry_forward_patch2orcarry_forward_v0_6_4. Each issue is parent-linked to umbrella#511and labelledbug+v0.6.3.2-candidate(orv0.6.4-candidate). - Closure. A
v0.6.3.2-candidateissue closes when its fix lands on therelease/v0.6.3.2branch and the corresponding scenario flips green on the successorai-memory-a2a-v0.6.3.2campaign's first run.
A finding only enters Patch 2 if it has a scenario that reproduces it, an issue thread on ai-memory-mcp, the v0.6.3.2-candidate label, a parent-link to #511, and a milestone. Anything missing one of those does not count.
Finding classes (per governance §8.4)¶
The Phase 4 meta-analyst classifies each finding into exactly one of:
| Class | Meaning | Where it goes |
|---|---|---|
carry_forward_patch2 |
Real defect in ai-memory-mcp; fix scheduled for v0.6.3.2. |
Child issue under #511, label v0.6.3.2-candidate. |
carry_forward_v0_6_4 |
Real defect, but out of Patch 2 scope. Tracked for the next minor release. | Child issue under #511, label v0.6.4-candidate. |
harness_defect |
The test, not the product, is wrong. | Issue on the harness repo (ai-memory-a2a-v0.6.3.1 or ai-memory-ai2ai-gate). |
documentation_defect |
Product is correct but its documented behavior is wrong. | Doc-fix PR against ai-memory-mcp. |
wont_fix |
Real finding, accepted as-is. | Recorded in phase4-analysis.json with rationale; no child issue opened. |
needs_review |
Default. The meta-analyst could not classify with confidence. | Escalates to the human maintainer. |
Each finding's class is recorded verbatim in phase4-analysis.json. Phase 5 reads that file and routes accordingly.
Pre-campaign reds (known-open on v0.6.3.1)¶
These two defects the campaign knows about going in. They are encoded as the expected-red scenarios S23 and S24 so the harness can prove on every run that it can detect them. Both have a defaulted Phase 4 classification of carry_forward_patch2.
| Scenario | Issue | Severity | Title | Expected disposition | Role |
|---|---|---|---|---|---|
S23 |
#507 |
medium | config.toml ~ expansion |
carry_forward_patch2 in v0.6.3.2 |
Seed defect for Patch 2. Anchors umbrella tracking issue #511. |
S24 |
#318 |
high | MCP stdio writes bypass federation fanout | carry_forward_patch2 in v0.6.3.2 |
Patch 2 candidate. Asymmetric: reads via stdio fine, writes silently diverge. NHI correlate is Phase 3 Scenario D. |
If either ever returns GREEN on this v0.6.3.1 campaign, the harness is broken — the Orchestrator halts and files a harness_defect rather than letting the run complete with a misleading verdict (per governance Principle 2).
Why these two specifically¶
#507is a one-character-class bug — the loader does not expand~before passing the path to SQLite. Clean integrity check: failure mode is binary (open succeeds or it does not), no flake surface, and the fix is small enough to plausibly land in Patch 2 alone.#318is the more serious of the two — a silent correctness break in the federation fanout layer for one specific transport (MCP stdio). It exercises the harness's ability to detect divergence between mesh nodes rather than just per-node failures, and it has a behavioral correlate at the NHI layer (Phase 3 Scenario D — Federation honesty).
Together they cover both ends of the harness's detection range: a deterministic per-node failure and a multi-node divergence whose NHI correlate feeds the cross-layer consistency table (governance §8.3).
Both flip to expected-green in ai-memory-a2a-v0.6.3.2 once Patch 2 ships.
Cross-layer consistency findings¶
Findings derived from the governance §8.3 consistency table — i.e. rows where the substrate-layer outcome and the NHI-layer correlate disagree — get a separate, higher-priority class because they mean either the substrate test or the NHI test is wrong, and either answer is structurally important.
| Row state | Disposition |
|---|---|
| Both layers RED (e.g. S24 RED + Scenario D context-loss observed on v0.6.3.1) | Consistent. Recorded but no new finding. |
| Both layers GREEN (e.g. S24 GREEN + Scenario D context-propagation observed on Patch 2) | Consistent. Recorded as the regression baseline. |
| Substrate RED, NHI GREEN | Inconsistent — harness_defect candidate. The NHI test is not exercising the bypass path; substrate is the source of truth and the playbook is broken. |
| Substrate GREEN, NHI RED | Inconsistent — carry_forward_patch2 candidate. The substrate test missed a real failure mode the NHI playbook surfaced; substrate scenario needs tightening and the underlying defect routes to Patch 2. |
Campaign-discovered findings¶
Empty until first campaign run completes.
This section is populated by findings-sync.yml from the issue tracker on ai-memory-mcp, filtering for v0.6.3.2-candidate (or v0.6.4-candidate) issues parent-linked to umbrella #511 and not already listed in Pre-campaign reds.
The expected schema, once it lands:
| Run | Source phase | Scenario / metric | Cell or arm | Issue | Severity | Class | Status |
|---|---|---|---|---|---|---|---|
r2 |
Phase 1 / Phase 4 | (e.g. S15 / Phase 4 grounding-rate gap) |
ironclaw / mTLS or arm-T / scenario-B |
#NNN |
high / medium / low | carry_forward_patch2 |
open / fix-merged / verified |
Findings stay listed here even after their issues close — they are part of the immutable cert artifact for the v0.6.3.1 release.
Patch 2 funnel — operator hand-off¶
The end-to-end funnel for an operator:
- Phase 5 commits
releases/v0.6.3.1/summary.jsonwith the substrate + NHI verdicts andphase4-analysis.json. - Operator runs
findings-sync.yml(workflow_dispatch). The workflow opens / updates each finding's child issue under #511. - The Patch 2 candidate list under #511 is the canonical view of "everything Patch 2 needs to fix." It rolls up to the
v0.6.3.2milestone onai-memory-mcp. - When every parent-linked issue's fix has merged and the corresponding scenarios are green on the successor
ai-memory-a2a-v0.6.3.2campaign's first run, #511 closes.
See the runbook for the operator's exact button-pushing.
Cross-links¶
- Governance §8 — authoritative finding classification
- Scope — verdict criteria
- Matrix — substrate cells + Phase 3 cells + cross-layer consistency table
- Runbook — Phase 5 hand-off mechanics
- Subject under test:
alphaonedev/ai-memory-mcp - Umbrella tracking issue:
alphaonedev/ai-memory-mcp#511 - Patch 2 milestone:
v0.6.3.2onalphaonedev/ai-memory-mcp/milestones - Successor repo:
alphaonedev/ai-memory-a2a-v0.6.3.2(will exist when Patch 2 tags)