17 agents at peak, each owning one phase or batch. Coordinated through ai-memory itself — the system being tested is the system that holds the test campaign's state. Every phase write becomes a memory; every completion becomes an A2A notification; the orchestrator drains its inbox and updates the evidence page in real time.
Every test agent's progress is a memory. Every completion is an A2A notification. The orchestrator memory_subscribes to release/v0.6.3/* and memory_inbox drains the queue every 30s. This is not a clever stunt — it's the strongest possible integration test. If hierarchical namespaces work, the campaign succeeds. If A2A messaging works, the orchestrator hears the workers. If governance gates work, the campaign's writes pass through them. The test campaign exercises the whole product surface under real workload at a scale we'd otherwise need to simulate.
release/v0.6.3/* via memory_subscribe. Drains memory_inbox every 30s. Aggregates results into the test-hub evidence page. Detects timeouts (worker silent for > expected wall × 2) and surfaces stuck workers.phaseN_*.sh script, captures artifacts in the gate repo's runs/ dir, and notifies the orchestrator on completion. Independent — phases don't share state, so a Phase 4 chaos failure doesn't block the Phase 1-3 gates from being marked green.install → --version assertion → start daemon → assert memory_capabilities returns schema_version=2. Independent — no shared state.release/v0.6.3/<phase>), governance gates (campaign writes pass through them), federation (orchestrator on node-1, workers reporting from node-2/3), A2A messaging (notify/inbox), capabilities introspection.memory_kg_timeline on the campaign-root memory returns the entire run as a chronological event sequence. Better than ad-hoc log files.collect_reports.sh). Workers still complete their work; orchestrator just sees results late. No loss of fidelity.memory_inbox polling loop catches everything. The webhook is a fast-path optimization, not a correctness requirement.| Component | Status | Source |
|---|---|---|
| Ship-gate phase scripts | ▸ exists | ai-memory-ship-gate |
| A2A scenarios 1-42 | ▸ exists | ai-memory-ai2ai-gate |
| A2A scenarios 43-48 (v0.6.3-only) | ▸ to author | ai-memory-ai2ai-gate (~45m) |
| memory_notify / memory_subscribe / memory_inbox | ▸ exists (v0.6.0+) | ai-memory daemon |
| Terraform per-phase targets | ▸ exists | both gate repos |
| Orchestrator script | ▸ to build (~3h) | this test-hub repo |
| Webhook endpoint (test-hub) | ▸ to build (~30m) | this test-hub repo |
| Evidence-page auto-update | ▸ to build (~1h) | this test-hub repo |
| Channel-smoke GitHub Actions workflow | ▸ to build (~30m) | ai-memory-mcp |
~5h total scaffolding for the orchestrator + webhook + auto-update + workflow. After that, the orchestrator is reusable for every future release. See the parallel-testing page for the full time math.