../ runs index

Campaign a2a-hermes-v3r15-mtls-develop FAIL

Agent group
hermes (homogeneous)
ai-memory ref
develop
Completed at
2026-04-22T19:28:39Z
Overall pass
false
Skipped reports
0

Infrastructure

Provider
digitalocean
Region
nyc3
Droplet size
s-2vcpu-4gb
Topology
4-node federation mesh (W=2/N=4)
Scenarios started
Scenarios ended
Dispatched by
alphaonedev
Harness SHA
006cdf4a787b
Workflow run
https://github.com/alphaonedev/ai-memory-ai2ai-gate/actions/runs/24797477930

Node roster

#RoleAgent IDPublic IPPrivate IP
1agentai:alice104.131.117.4710.11.2.3
2agentai:bob104.236.18.12110.11.2.2
3agentai:charlie45.55.240.21910.11.2.4
4memory-only159.89.189.23610.11.2.5

Baseline attestation BASELINE VIOLATION

Per the authoritative baseline spec, every agent node must emit a self-attestation before any scenario is permitted to run. This run's attestation:

Spec version: 1.0.0 — see authoritative baseline.

NodeAgentFrameworkAuthenticMCP ai-memoryxAI cfgxAI defaultAgent IDFederationUFW offiptablesdead-manF1 xAIF2a substrateF2b agent (non-gating)Config SHAPass
a2a-baseline.json
{
	"baseline_pass": false,
	"per_node": [],
	"failure_mode": "baseline-absent"
}

raw file

Run focus

Campaign failed: no scenario reports recovered.

What this campaign tested: No scenarios were exercised, despite requesting 35 scenarios across various transport, framework, and primitive axes, due to report recovery failure.

What it demonstrated: The run proved a critical failure in the testing infrastructure or harness, as no scenario results were generated or recovered.

AI NHI analysis · Claude Opus 4.7

Campaign failed: no scenario reports recovered.

FAIL — no scenario reports recovered

For three audiences

Non-technical end users

This test run didn't produce any results because no reports were collected from the scenarios. We can't tell if agents reliably share memories or not. It seems like there was a problem with the setup or data collection.

C-level decision makers

The campaign completely failed to yield results, elevating risk posture due to untested changes in the develop branch. Production readiness remains unassessed, and no customer-facing claims can be validated. No progress or changes detected versus prior runs since nothing was tested.

Engineers & architects

No scenario reports were recovered for any of the 35 requested scenarios (e.g., S1, S2, S4 through S42), indicating a harness failure at sha 006cdf4a787bfec7cfc5007fd40ae990e22e5860. All coverage axes (transport, framework, primitives) were untested, effectively skipping everything. Probable root cause: error in infrastructure provisioning, scenario execution, or report aggregation in the CI workflow.

What changes going into the next campaign

Debug and resolve report recovery issues in the CI harness to ensure scenario results are captured.

All artifacts