../ runs index

Campaign a2a-hermes-v0.6.2-patch2-r23-tls FAIL

Agent group
hermes (homogeneous)
ai-memory ref
release/v0.6.2
Completed at
2026-04-23T16:50:55Z
Overall pass
false
Skipped reports
0

Infrastructure

Provider
digitalocean
Region
nyc3
Droplet size
s-2vcpu-4gb
Topology
4-node federation mesh (W=2/N=4)
Scenarios started
Scenarios ended
Dispatched by
alphaonedev
Harness SHA
1bd0ee3aa288
Workflow run
https://github.com/alphaonedev/ai-memory-ai2ai-gate/actions/runs/24847546654

Node roster

#RoleAgent IDPublic IPPrivate IP
1agentai:alice
2agentai:bob
3agentai:charlie
4memory-only

Run focus

Campaign failed: no scenario reports recovered.

What this campaign tested: The campaign requested 35 scenarios to exercise agent-to-agent memory sharing across various transports, frameworks, and primitives under TLS configuration.

What it demonstrated: The run demonstrated a complete failure in report recovery, proving nothing about the system's functionality as no scenario outcomes were captured.

AI NHI analysis · Claude Opus 4.7

Campaign failed: no scenario reports recovered.

FAIL — no results due to missing reports.

For three audiences

Non-technical end users

This test was meant to check if AI agents can reliably share memories with each other, but it didn't work at all. No results came back because the system couldn't collect any data from the tests. This means we don't know if the agents share memories properly in this setup.

C-level decision makers

This failed run indicates high risk in the current harness and infra stability, delaying production readiness assessments. No customer-facing claims on TLS-enabled memory sharing can be validated yet. Compared to prior runs, this represents a regression in test execution reliability.

Engineers & architects

The artifact shows overall_pass false with reason 'no scenario reports recovered', affecting all 35 requested scenarios (e.g., S1, S1b, S2 through S42). Probable root cause is a harness failure in report aggregation, possibly tied to CI workflow (harness_sha: 1bd0ee3aa288f24b2de73f13157d9ee6a0b59114) or infra provisioning on DigitalOcean nyc3. No primitives or failure modes observable due to total absence of per-scenario data.

What changes going into the next campaign

Add debug logging and retries to the report recovery step in the CI harness before the next campaign.

All artifacts