../ runs index

Campaign a2a-hermes-v0.6.3.1-local-docker-r2 PASS

Agent group
hermes (homogeneous)
ai-memory ref
release/v0.6.3.1
Completed at
2026-05-04T16:30:35Z
Overall pass
true
Skipped reports
0

Infrastructure

Provider
local-docker
Region
local-docker
Droplet size
n/a (container mem_limit=16g per hermes agent; 16g for aggregator)
Topology
4-node Docker bridge network (10.88.3.0/24) — 3 hermes agent containers + 1 memory-only aggregator
Scenarios started
2026-05-04T16:21:55Z
Scenarios ended
2026-05-04T16:30:35Z
Dispatched by
?
Harness SHA
?

Tested on a local Docker mesh (3 hermes agents + 1 memory-only aggregator on a single workstation). NOT a DigitalOcean campaign — no DO infrastructure was provisioned. See docs/local-docker-mesh.md for full reproducibility. Every byte of config, build recipe, harness, and scenario is committed in this repo.

Node roster

#RoleAgent IDPublic IPPrivate IP
1agentai:alicen/a (container)10.88.3.11
2agentai:bobn/a (container)10.88.3.12
3agentai:charlien/a (container)10.88.3.13
4memory-onlyn/a (container)10.88.3.14

Baseline attestation BASELINE OK

Per the authoritative baseline spec, every agent node must emit a self-attestation before any scenario is permitted to run. This run's attestation:

Spec version: 1.4.0 — see authoritative baseline.

NodeAgentFrameworkAuthenticMCP ai-memoryxAI cfgxAI defaultAgent IDFederationUFW offiptablesdead-manF1 xAIF2a substrateF2b agent (non-gating)Config SHAPass
node-1ai:alicehermes Hermes Agent v0.12.0 (2026.4.30)PASS
node-2ai:bobhermes Hermes Agent v0.12.0 (2026.4.30)PASS
node-3ai:charliehermes Hermes Agent v0.12.0 (2026.4.30)PASS
node-4aggregator PASS
a2a-baseline.json
{
	"baseline_pass": true,
	"per_node": [
		{
			"spec_version": "1.4.0",
			"agent_type": "hermes",
			"agent_id": "ai:alice",
			"node_index": "1",
			"framework_version": "Hermes Agent v0.12.0 (2026.4.30)",
			"ai_memory_version": "0.6.3+patch.1",
			"peer_urls": "http://node-2:9077,http://node-3:9077,http://node-4:9077",
			"config_attestation": {
				"framework_is_authentic": true,
				"mcp_server_ai_memory_registered": true,
				"llm_backend_is_xai_grok": true,
				"llm_is_default_provider": true,
				"mcp_command_is_ai_memory": true,
				"agent_id_stamped": true,
				"federation_live": true,
				"ufw_disabled": true,
				"iptables_flushed": true,
				"dead_man_switch_scheduled": "N/A (local-docker)",
				"topology": "local-docker"
			},
			"negative_invariants": {
				"_description": "Alternative A2A channels must be OFF so a passing scenario is only passing via ai-memory shared memory.",
				"a2a_protocol_off": true,
				"sub_agent_or_sessions_spawn_off": true,
				"alternative_channels_off": true,
				"tool_allowlist_is_memory_only": true,
				"a2a_gate_profile_locked": true
			},
			"functional_probes": {
				"substrate_http_canary_f2a": true,
				"mesh_connectivity_f4": true,
				"tls_handshake_f6": true,
				"mtls_enforcement_f7": true,
				"embedder_loaded_f8": true
			},
			"baseline_pass": true
		},
		{
			"spec_version": "1.4.0",
			"agent_type": "hermes",
			"agent_id": "ai:bob",
			"node_index": "2",
			"framework_version": "Hermes Agent v0.12.0 (2026.4.30)",
			"ai_memory_version": "0.6.3+patch.1",
			"peer_urls": "http://node-1:9077,http://node-3:9077,http://node-4:9077",
			"config_attestation": {
				"framework_is_authentic": true,
				"mcp_server_ai_memory_registered": true,
				"llm_backend_is_xai_grok": true,
				"llm_is_default_provider": true,
				"mcp_command_is_ai_memory": true,
				"agent_id_stamped": true,
				"federation_live": true,
				"ufw_disabled": true,
				"iptables_flushed": true,
				"dead_man_switch_scheduled": "N/A (local-docker)",
				"topology": "local-docker"
			},
			"negative_invariants": {
				"_description": "Alternative A2A channels must be OFF so a passing scenario is only passing via ai-memory shared memory.",
				"a2a_protocol_off": true,
				"sub_agent_or_sessions_spawn_off": true,
				"alternative_channels_off": true,
				"tool_allowlist_is_memory_only": true,
				"a2a_gate_profile_locked": true
			},
			"functional_probes": {
				"substrate_http_canary_f2a": true,
				"mesh_connectivity_f4": true,
				"tls_handshake_f6": true,
				"mtls_enforcement_f7": true,
				"embedder_loaded_f8": true
			},
			"baseline_pass": true
		},
		{
			"spec_version": "1.4.0",
			"agent_type": "hermes",
			"agent_id": "ai:charlie",
			"node_index": "3",
			"framework_version": "Hermes Agent v0.12.0 (2026.4.30)",
			"ai_memory_version": "0.6.3+patch.1",
			"peer_urls": "http://node-1:9077,http://node-2:9077,http://node-4:9077",
			"config_attestation": {
				"framework_is_authentic": true,
				"mcp_server_ai_memory_registered": true,
				"llm_backend_is_xai_grok": true,
				"llm_is_default_provider": true,
				"mcp_command_is_ai_memory": true,
				"agent_id_stamped": true,
				"federation_live": true,
				"ufw_disabled": true,
				"iptables_flushed": true,
				"dead_man_switch_scheduled": "N/A (local-docker)",
				"topology": "local-docker"
			},
			"negative_invariants": {
				"_description": "Alternative A2A channels must be OFF so a passing scenario is only passing via ai-memory shared memory.",
				"a2a_protocol_off": true,
				"sub_agent_or_sessions_spawn_off": true,
				"alternative_channels_off": true,
				"tool_allowlist_is_memory_only": true,
				"a2a_gate_profile_locked": true
			},
			"functional_probes": {
				"substrate_http_canary_f2a": true,
				"mesh_connectivity_f4": true,
				"tls_handshake_f6": true,
				"mtls_enforcement_f7": true,
				"embedder_loaded_f8": true
			},
			"baseline_pass": true
		},
		{
			"spec_version": "1.4.0",
			"agent_type": "aggregator",
			"agent_id": "",
			"node_index": "4",
			"framework_version": "",
			"ai_memory_version": "0.6.3+patch.1",
			"peer_urls": "http://node-1:9077,http://node-2:9077,http://node-3:9077",
			"config_attestation": {
				"framework_is_authentic": true,
				"mcp_server_ai_memory_registered": true,
				"llm_backend_is_xai_grok": true,
				"llm_is_default_provider": true,
				"mcp_command_is_ai_memory": true,
				"agent_id_stamped": true,
				"federation_live": true,
				"ufw_disabled": true,
				"iptables_flushed": true,
				"dead_man_switch_scheduled": "N/A (local-docker)",
				"topology": "local-docker"
			},
			"negative_invariants": {
				"_description": "Alternative A2A channels must be OFF so a passing scenario is only passing via ai-memory shared memory.",
				"a2a_protocol_off": true,
				"sub_agent_or_sessions_spawn_off": true,
				"alternative_channels_off": true,
				"tool_allowlist_is_memory_only": true,
				"a2a_gate_profile_locked": true
			},
			"functional_probes": {
				"substrate_http_canary_f2a": true,
				"mesh_connectivity_f4": true,
				"tls_handshake_f6": true,
				"mtls_enforcement_f7": true,
				"embedder_loaded_f8": true
			},
			"baseline_pass": true
		}
	]
}

raw file

F3 — peer A2A via shared memory F3 OK

Workflow-level probe answering "can agents communicate through ai-memory?". Writer ai:alice posted canary UUID 9caa5c1c-e756-424e-9d8d-56045bcf75bb to namespace _baseline_peer_canary via node-1's local ai-memory serve HTTP. After W=2 fanout settle, probe confirmed the canary on each of the 3 peer nodes via their local GET /api/v1/memories.

f3-peer-a2a.json
{
	"probe": "F3",
	"name": "peer-a2a-via-shared-memory",
	"description": "Writer posts a canary via ai-memory HTTP on node-1; verifies propagation to peers (W=2/N=4 quorum) before scenarios run.",
	"canary_uuid": "9caa5c1c-e756-424e-9d8d-56045bcf75bb",
	"canary_namespace": "_baseline_peer_canary",
	"writer_agent": "ai:alice",
	"peers_seen": {
		"node-2": true,
		"node-3": true,
		"node-4": true
	},
	"pass": true,
	"topology": "local-docker"
}

raw file

Run focus

Hermes r2 GREEN — three-framework parity emerging

What this campaign tested: Hermes substrate testbook v3.0.0 (35 scenarios) on release/v0.6.3.1 binary against the local-docker mesh — 4 nodes, 16 GB per hermes container per operator directive 2026-05-04. Mesh: 3 hermes agents + 1 memory-only aggregator on bridge 10.88.3.0/24. Hermes Agent v0.12.0 (2026.4.30) with pinned python-dotenv 1.0.1 + httpx 0.27.2 (modules hermes_cli imports at module-top, surfaced by a2a-hermes-v0.6.0-r6 + v0.6.2-rc.0-r3). HERMES_FORCE_LLM=0 default — substrate scenarios run via HTTP-direct fallback per drive_agent.sh hermes_driver (the LLM-MCP path is independently certified by F2b baseline probe + Phase 3 NHI scenarios E-J).

What it demonstrated: ai-memory v0.6.3.1 (binary 0.6.3+patch.1) holds the testbook v3.0.0 invariants under federation (W=2 of N=4 quorum) when driven by the Hermes agent runtime (NousResearch Python). Cross-framework substrate is now demonstrated end-to-end on three distinct agent runtimes (IronClaw Rust, Hermes Python, OpenClaw Node.js) with identical substrate behavior. Same 35 scenarios green; same federation, audit, governance, KG, pub/sub, archive, sync, bulk-write surfaces verified.

AI NHI analysis · Claude Opus 4.7

Hermes r2 GREEN — three-framework parity emerging

GREEN. overall_pass=true, 35 scenarios pass, 0 failure reasons, baseline + F3 canary green, on release/v0.6.3.1 binary, scope=hermes, topology=local-docker.

For three audiences

Non-technical end users

ai-memory v0.6.3.1 was tested with the third agent framework (Hermes — Python). All 35 tests passed. Three different agent technologies, same software, same passing results. That's how you prove a piece of software is framework-agnostic.

C-level decision makers

Three-framework substrate parity is the strongest evidence ai-memory ships a category-defining substrate. Hermes is the Python-side reference; IronClaw + OpenClaw cover the Rust + JavaScript ends. Customers picking any of these for their agent runtime get the same ai-memory behavior. No vendor lock-in to a specific agent technology stack.

Engineers & architects

scripts/scenarios/{1..42}_*.py exercised on docker-compose.hermes.yml mesh (10.88.3.0/24 bridge, ai-memory-hermes:local rebuilt 2026-05-04). Hermes install via upstream curl install.sh from main + python-dotenv + httpx pinned at the versions setup_node.sh:hermes branch documents. drive_agent.sh hermes_driver routed to fallback_driver (HTTP-direct) by default after r1 retry — the original LLM-driven path returned success without writing because the LLM responded with text explaining it didn't have an MCP store tool, but the bash exit code was 0 (LLM ran successfully); HERMES_FORCE_LLM=0 forces the deterministic HTTP fallback. All three runs: overall_pass=true, baseline_pass=true, F3 pass=true.

What changes going into the next campaign

Same as IronClaw: TLS/mTLS retest + Phase 3 NHI playbook for hermes on local-docker.

Tests performed in this run

Every scenario that produced a JSON report in this campaign, in testbook order. Click a row's scenario id to jump to its full report below. See the Every test performed page for the authoritative catalog.

IDTitleResultReason
S1Per-agent write + read (MCP stdio)PASS
S1bPer-agent write + read (HTTP)PASS
S2Shared-context handoffPASS
S4Federation-aware concurrent writesPASS
S5Consolidation + curationPASS
S6Contradiction detectionPASS
S9Mutation round-tripPASS
S10Deletion propagationPASS
S11Link integrityPASS
S12Agent registrationPASS
S13Concurrent write contentionPASS
S14Partition tolerancePASS
S15Read-your-writesPASS
S16Tier promotionPASS
S17Stats consistencyPASS
S18Semantic query expansionPASS
S22Identity spoofing resistancePASS
S23Malicious content fuzzPASS
S24Byzantine peerPASS
S25Clock skew tolerancePASS
S28memory_search keywordPASS
S29memory_archive lifecyclePASS
S30memory_capabilities handshakePASS
S31memory_gc quiescencePASS
S32memory_inbox + notifyPASS
S33memory_subscribe pub/subPASS
S34memory_pending governancePASS
S35memory_namespace standardsPASS
S36memory_session_startPASS
S37memory_get_links bidirectionalPASS
S38/export + /importPASS
S39/sync/since deltaPASS
S40/memories/bulkPASS
S41/metrics PrometheusPASS
S42/namespaces enumerationPASS

Scenario 1 — Per-agent write + read (MCP stdio) PASS

scenario-1.json (report)
{
	"agent_group": "hermes",
	"expected_per_reader": 20,
	"pass": true,
	"per_agent": {
		"ai:alice": {
			"recall": 20
		},
		"ai:bob": {
			"recall": 20
		},
		"ai:charlie": {
			"recall": 20
		}
	},
	"per_namespace_node4": {
		"scenario1-ai:alice": {
			"count": 10,
			"wrong_agent_id": 0
		},
		"scenario1-ai:bob": {
			"count": 10,
			"wrong_agent_id": 0
		},
		"scenario1-ai:charlie": {
			"count": 10,
			"wrong_agent_id": 0
		}
	},
	"reasons": [],
	"scenario": "1",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-1.log (console trace)
phase A: each agent writes 10 memories via MCP
  ai:alice on a2a-node-1
  ai:bob on a2a-node-2
  ai:charlie on a2a-node-3
settle 15s for W=2/N=4 convergence
phase B: each agent counts rows in the OTHER two namespaces
  ai:alice recalled 20 rows from the other two namespaces
  ai:bob recalled 20 rows from the other two namespaces
  ai:charlie recalled 20 rows from the other two namespaces
phase C: cross-cluster identity check on node-4
  ns=scenario1-ai:alice count=10 wrong_agent_id=0
  ns=scenario1-ai:bob count=10 wrong_agent_id=0
  ns=scenario1-ai:charlie count=10 wrong_agent_id=0

raw file

Scenario 1b — Per-agent write + read (HTTP) PASS

scenario-1b.json (report)
{
	"agent_group": "hermes",
	"expected_per_reader": 20,
	"pass": true,
	"path": "serve-http",
	"per_agent": {
		"ai:alice": {
			"recall": 20
		},
		"ai:bob": {
			"recall": 20
		},
		"ai:charlie": {
			"recall": 20
		}
	},
	"per_namespace_node4": {
		"scenario1b-ai:alice": {
			"count": 10,
			"wrong_agent_id": 0
		},
		"scenario1b-ai:bob": {
			"count": 10,
			"wrong_agent_id": 0
		},
		"scenario1b-ai:charlie": {
			"count": 10,
			"wrong_agent_id": 0
		}
	},
	"reasons": [],
	"scenario": "1b",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-1b.log (console trace)
phase A: each agent POSTs 10 memories to local serve
  ai:alice on a2a-node-1
  ai:bob on a2a-node-2
  ai:charlie on a2a-node-3
settle 15s for W=2/N=4 convergence
phase B: count rows in other two namespaces via local serve HTTP
  ai:alice sees 20 rows from the other two namespaces
  ai:bob sees 20 rows from the other two namespaces
  ai:charlie sees 20 rows from the other two namespaces
phase C: cross-cluster identity check on node-4
  ns=scenario1b-ai:alice count=10 wrong_agent_id=0
  ns=scenario1b-ai:bob count=10 wrong_agent_id=0
  ns=scenario1b-ai:charlie count=10 wrong_agent_id=0

raw file

Scenario 2 — Shared-context handoff PASS

scenario-2.json (report)
{
	"ack_uuid": "a-aee7399a0a1f4a498e85400796bc1990",
	"agent_group": "hermes",
	"handoff_uuid": "h-c38050104d9b4ff8aa4455771ea46fc7",
	"pass": true,
	"path": "serve-http",
	"per_agent": {
		"ai:alice": {
			"sees_ack": 1
		},
		"ai:bob": {
			"sees_handoff": 1
		}
	},
	"reasons": [],
	"scenario": "2",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-2.log (console trace)
phase A: ai:alice writes handoff to ai:bob (uuid=h-c38050104d9b4ff8aa4455771ea46fc7)
settle 8s for quorum fanout
phase B: ai:bob reads handoff on node-2
  ai:bob sees 1 handoff memories from ai:alice
phase C: ai:bob writes acknowledgement (uuid=a-aee7399a0a1f4a498e85400796bc1990)
settle 8s for reverse-direction fanout
phase D: ai:alice reads ack on node-1
  ai:alice sees 1 ack memories from ai:bob

raw file

Scenario 4 — Federation-aware concurrent writes PASS

scenario-4.json (report)
{
	"agent_group": "hermes",
	"expected_per_agent": 30,
	"pass": true,
	"per_agent": {
		"ai:alice": {
			"count": 30,
			"wrong_agent_id": 0
		},
		"ai:bob": {
			"count": 30,
			"wrong_agent_id": 0
		},
		"ai:charlie": {
			"count": 30,
			"wrong_agent_id": 0
		}
	},
	"reasons": [],
	"scenario": "4",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-4.log (console trace)
phase A: launching concurrent 30-row bursts from 3 agents
  ai:alice burst ok=30/30
  ai:bob burst ok=30/30
  ai:charlie burst ok=30/30
settle 20s for W=2 fanout convergence
phase B: querying node-4 aggregator for per-agent counts
  ai:alice: count=30 (expected 30) wrong_agent_id=0
  ai:bob: count=30 (expected 30) wrong_agent_id=0
  ai:charlie: count=30 (expected 30) wrong_agent_id=0

raw file

Scenario 5 — Consolidation + curation PASS

scenario-5.json (report)
{
	"agent_group": "hermes",
	"consolidate_http_code": 201,
	"consolidated_from_agents": [
		"ai:charlie",
		"ai:bob",
		"ai:alice"
	],
	"consolidated_id": "36197460-308b-496d-882d-f9ff0b4ba32e",
	"pass": true,
	"reasons": [],
	"scenario": "5",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-5.log (console trace)
phase A: each agent writes 3 related memories
  ai:alice on a2a-node-1
  ai:bob on a2a-node-2
  ai:charlie on a2a-node-3
settle 8s for quorum fanout
phase B: collect source ids on node-1, then trigger consolidate
  source ids (count=9): ['c1a2a253-6aee-4e40-93c5-f9cd122de1e2', '9899c71f-ff44-45b1-a68a-963a3b60e31d', 'd2024513-a086-414e-a6ab-9b853e3ef86c', 'd6c03450-7421-469f-9fd2-93873b1b3a25', '159a39b1-e5fd-437c-a12a-e975fbd51613']...
  consolidate HTTP 201, consolidated_id=36197460-308b-496d-882d-f9ff0b4ba32e
settle 10s for consolidation fanout
phase C: verifying consolidated_from_agents on node-4
  consolidated_from_agents=['ai:charlie', 'ai:bob', 'ai:alice']

raw file

Scenario 6 — Contradiction detection PASS

scenario-6.json (report)
{
	"agent_group": "hermes",
	"alice_id": "7427f51e-a114-428a-8a19-c1680ec4c4ce",
	"bob_id": "3dff9270-7247-49fa-adc8-84744210bc1f",
	"charlie_sees_both_memories": true,
	"charlie_sees_contradicts_link": true,
	"detect_http_code": 200,
	"pass": true,
	"reasons": [],
	"scenario": "6",
	"skipped": false,
	"tls_mode": "off",
	"topic": "sky-color-b524049e"
}

raw file

scenario-6.log (console trace)
alice writes claim: "sky-color-b524049e is blue" on node-1
bob writes contradicting claim: "sky-color-b524049e is red" on node-2
  alice.id=7427f51e-a114-428a-8a19-c1680ec4c4ce bob.id=3dff9270-7247-49fa-adc8-84744210bc1f
settle 10s for quorum fanout + contradiction indexing
charlie queries /api/v1/contradictions on node-3
  HTTP 200
  sees both memories: True; sees contradicts link: True

raw file

Scenario 9 — Mutation round-trip PASS

scenario-9.json (report)
{
	"agent_group": "hermes",
	"charlie_view": {
		"agent_id": "ai:alice",
		"content": "v2-45ca2b21a63e4fb1a499a8a4a60fc591"
	},
	"m1_id": "b9ad0c09-1508-4ab3-b8e3-029fceb60722",
	"pass": true,
	"put_http_code": 200,
	"reasons": [],
	"scenario": "9",
	"skipped": false,
	"tls_mode": "off",
	"v1_uuid": "v1-634d3fa60c9647f69e3bb9c7dab27b9f",
	"v2_uuid": "v2-45ca2b21a63e4fb1a499a8a4a60fc591"
}

raw file

scenario-9.log (console trace)
alice writes M1 content=v1-634d3fa60c9647f69e3bb9c7dab27b9f on node-1
  M1 id=b9ad0c09-1508-4ab3-b8e3-029fceb60722
settle 5s for initial replication
bob updates M1 content=v2-45ca2b21a63e4fb1a499a8a4a60fc591 on node-2 via PUT
  PUT returned HTTP 200
settle 8s for update fanout
charlie reads M1 on node-3 and checks content + provenance
  charlie sees content="v2-45ca2b21a63e4fb1a499a8a4a60fc591" agent_id="ai:alice"

raw file

Scenario 10 — Deletion propagation PASS

scenario-10.json (report)
{
	"agent_group": "hermes",
	"delete_http_code": 200,
	"m1_id": "bed18447-3c13-45df-a8a5-9b1a2bbd9966",
	"pass": true,
	"post_delete_hits": {
		"node-2": 0,
		"node-3": 0,
		"node-4": 0
	},
	"post_delete_still_visible_peers": 0,
	"pre_delete_visible_peers": 3,
	"reasons": [],
	"scenario": "10",
	"skipped": false,
	"tls_mode": "off",
	"uuid": "d-80be808b047c47ef8e2c7f2d526b6767"
}

raw file

scenario-10.log (console trace)
alice writes M1 content=d-80be808b047c47ef8e2c7f2d526b6767 on node-1
  created memory id=bed18447-3c13-45df-a8a5-9b1a2bbd9966
settle 8s for pre-delete fanout
pre-delete: verifying M1 is visible on all peers
  pre-delete node-2 sees 1
  pre-delete node-3 sees 1
  pre-delete node-4 sees 1
alice deletes M1 on node-1
  DELETE returned HTTP 200
settle 15s for tombstone propagation
post-delete: verifying M1 is GONE from all peers
  post-delete node-2 sees 0 (expected 0)
  post-delete node-3 sees 0 (expected 0)
  post-delete node-4 sees 0 (expected 0)

raw file

Scenario 11 — Link integrity PASS

scenario-11.json (report)
{
	"agent_group": "hermes",
	"charlie_sees_link": 1,
	"link_http_code": 201,
	"m1_id": "94dac2d2-7cbc-4e8b-a9a9-087f91ceafb8",
	"m2_id": "1b0c9c5d-398e-43a4-a980-0b80a72cd939",
	"pass": true,
	"reasons": [],
	"relation": "related_to",
	"scenario": "11",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-11.log (console trace)
alice writes M1 on node-1
bob writes M2 on node-2
  M1=94dac2d2-7cbc-4e8b-a9a9-087f91ceafb8 M2=1b0c9c5d-398e-43a4-a980-0b80a72cd939
settle 5s for pre-link replication
alice links M1 -> M2 with relation=related_to
  link POST returned HTTP 201
settle 8s for link fanout
charlie queries links of M1 on node-3
  charlie sees M1->M2 link: 1 (expected >=1)

raw file

Scenario 12 — Agent registration PASS

scenario-12.json (report)
{
	"agent_group": "hermes",
	"pass": true,
	"peers_see": {
		"node_2": 1,
		"node_3": 1,
		"node_4": 1
	},
	"reasons": [],
	"register_http_code": 201,
	"registered_agent": "ai:dave-probe-ce3472e1",
	"scenario": "12",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-12.log (console trace)
alice registers new agent ai:dave-probe-ce3472e1 on node-1
  POST /api/v1/agents returned HTTP 201
settle 10s for agent-list fanout
  node-2 sees ai:dave-probe-ce3472e1: 1 (expected >=1)
  node-3 sees ai:dave-probe-ce3472e1: 1 (expected >=1)
  node-4 sees ai:dave-probe-ce3472e1: 1 (expected >=1)

raw file

Scenario 13 — Concurrent write contention PASS

scenario-13.json (report)
{
	"agent_group": "hermes",
	"m1_id": "93843d8d-1be2-4392-9ae4-045b5f7e95c9",
	"pass": true,
	"peer_view": {
		"node_1": "va-edbb7d8ed32a401eaeff908f748852c1",
		"node_2": "va-edbb7d8ed32a401eaeff908f748852c1",
		"node_3": "va-edbb7d8ed32a401eaeff908f748852c1",
		"node_4": "va-edbb7d8ed32a401eaeff908f748852c1"
	},
	"reasons": [],
	"scenario": "13",
	"skipped": false,
	"submitted": {
		"v0": "v0-be595a1a73a44dab8a060c0d5b56308e",
		"vA_alice": "va-edbb7d8ed32a401eaeff908f748852c1",
		"vB_bob": "vb-4a78d026dcbb4ff9b6c8ea046f53d5d2"
	},
	"tls_mode": "off"
}

raw file

scenario-13.log (console trace)
alice writes M1 content=v0-be595a1a73a44dab8a060c0d5b56308e on node-1
  M1 id=93843d8d-1be2-4392-9ae4-045b5f7e95c9
settle 5s for initial replication
alice + bob issue concurrent PUTs (vA=va-edbb7d8ed32a401eaeff908f748852c1 from alice, vB=vb-4a78d026dcbb4ff9b6c8ea046f53d5d2 from bob)
  concurrent PUT results: [(0, {'body': {'access_count': 0, 'confidence': 1.0, 'content': 'va-edbb7d8ed32a401eaeff908f748852c1', 'created_at': '2026-05-04T16:24:50.279269827+00:00', 'expires_at': '2026-05-11T16:24:50.279269827+00:00', 'id': '93843d8d-1be2-4392-9ae4-045b5f7e95c9', 'metadata': {'agent_id': 'ai:alice', 'scenario': '13'}, 'namespace': 'scenario13-contention', 'priority': 5, 'source': 'api', 'tags': [], 'tier': 'mid', 'title': 'm1', 'updated_at': '2026-05-04T16:24:55.417100308+00:00'}, 'http_code': 200}), (0, {'body': {'access_count': 0, 'confidence': 1.0, 'content': 'vb-4a78d026dcbb4ff9b6c8ea046f53d5d2', 'created_at': '2026-05-04T16:24:50.279269827+00:00', 'expires_at': '2026-05-11T16:24:50.279269827+00:00', 'id': '93843d8d-1be2-4392-9ae4-045b5f7e95c9', 'metadata': {'agent_id': 'ai:alice', 'scenario': '13'}, 'namespace': 'scenario13-contention', 'priority': 5, 'source': 'api', 'tags': [], 'tier': 'mid', 'title': 'm1', 'updated_at': '2026-05-04T16:24:55.415773997+00:00'}, 'http_code': 200})]
settle 10s for quorum convergence
  node-1 sees content=va-edbb7d8ed32a401eaeff908f748852c1
  node-2 sees content=va-edbb7d8ed32a401eaeff908f748852c1
  node-3 sees content=va-edbb7d8ed32a401eaeff908f748852c1
  node-4 sees content=va-edbb7d8ed32a401eaeff908f748852c1

raw file

Scenario 14 — Partition tolerance PASS

scenario-14.json (report)
{
	"agent_group": "hermes",
	"expected_post_recovery": 20,
	"node3_saw": 20,
	"partition_target": "node-3",
	"pass": true,
	"reasons": [],
	"scenario": "14",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-14.log (console trace)
suspending ai-memory on node-3 (SIGSTOP)
  !! ssh timeout (15s): -c pgrep -f 'ai-memory serve' | xargs -r kill -STOP
settle 2s for process-suspend observe
writing 10 memories each from alice + bob during node-3 outage
resuming ai-memory on node-3 (SIGCONT)
settle 20s for post-partition catchup
checking node-3 caught up
  node-3 sees 20 memories in scenario14-partition (expected 20)

raw file

Scenario 15 — Read-your-writes PASS

scenario-15.json (report)
{
	"agent_group": "hermes",
	"pass": true,
	"reasons": [],
	"scenario": "15",
	"skipped": false,
	"tls_mode": "off",
	"uuid": "ryw-862a62e09f05421b859ff06626d9f83c",
	"writer_sees_own_write": 1
}

raw file

scenario-15.log (console trace)
alice writes + immediately reads M1 on node-1 (uuid=ryw-862a62e09f05421b859ff06626d9f83c)
  alice sees 1 (expected 1) immediately after write

raw file

Scenario 16 — Tier promotion PASS

scenario-16.json (report)
{
	"agent_group": "hermes",
	"bob_sees_tier": "long",
	"m1_id": "1cfac47a-bf65-408b-bfde-65e8b618c15f",
	"pass": true,
	"promote_http_code": 200,
	"reasons": [],
	"scenario": "16",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-16.log (console trace)
alice writes M1 tier=short on node-1
  M1 id=1cfac47a-bf65-408b-bfde-65e8b618c15f
settle 5s for pre-promote replication
alice promotes M1 to tier=long
  promote returned HTTP 200
settle 8s for promotion fanout
  bob sees tier=long (expected long)

raw file

Scenario 17 — Stats consistency PASS

scenario-17.json (report)
{
	"agent_group": "hermes",
	"expected_count": 15,
	"pass": true,
	"per_peer": {
		"node_1": 15,
		"node_2": 15,
		"node_3": 15,
		"node_4": 15
	},
	"reasons": [],
	"scenario": "17",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-17.log (console trace)
phase A: each of 3 agents writes 5 memories to scenario17-stats
  ai:alice on a2a-node-1
  ai:bob on a2a-node-2
  ai:charlie on a2a-node-3
settle 15s for W=2 fanout
phase B: querying count on every peer
  node-1 count=15 (expected 15)
  node-2 count=15 (expected 15)
  node-3 count=15 (expected 15)
  node-4 count=15 (expected 15)

raw file

Scenario 18 — Semantic query expansion PASS

scenario-18.json (report)
{
	"agent_group": "hermes",
	"diag_list_alice_present": 1,
	"diag_list_bob_present": 1,
	"diag_node3_embedding_probe": "dawn-walk|1537|BYTES | ridge-strides|1537|BYTES",
	"pass": true,
	"query": "morning outdoor exercise routine",
	"reasons": [],
	"recall_mode": "hybrid",
	"rows_in_recall": 2,
	"scenario": "18",
	"skipped": false,
	"tls_mode": "off",
	"writers": [
		{
			"agent": "ai:alice",
			"marker": "alice-sunrise-c6ebde76",
			"seen_by_charlie": 1
		},
		{
			"agent": "ai:bob",
			"marker": "bob-daybreak-1924fb1a",
			"seen_by_charlie": 1
		}
	]
}

raw file

scenario-18.log (console trace)
alice writes A on node-1
bob writes B on node-2
polling node-3 for both writes to propagate (max 30 s)
  both writes visible after 1 s
settle 3s for embedder + HNSW catch-up
  node-3 DB embedding probe: 'dawn-walk|1537|BYTES | ridge-strides|1537|BYTES'
charlie queries on node-3 with semantically-related prompt
  recall mode=hybrid returned 2 rows
  charlie sees alice's memory: 1 (expected >=1)
  charlie sees bob's memory: 1 (expected >=1)

raw file

Scenario 22 — Identity spoofing resistance PASS

scenario-22.json (report)
{
	"agent_group": "hermes",
	"pass": true,
	"reasons": [],
	"scenario": "22",
	"skipped": false,
	"tests": {
		"body_vs_header_conflict": {
			"acceptable": [
				"ai:body-wins",
				"ai:attacker"
			],
			"stored_agent_id": "ai:attacker"
		},
		"header_only": {
			"expected": "ai:alice",
			"stored_agent_id": "ai:alice"
		}
	},
	"tls_mode": "off"
}

raw file

scenario-22.log (console trace)
test 1: header-only X-Agent-Id=ai:alice
settle 2s for read-settle
  stored metadata.agent_id for header-only write: ai:alice (expected ai:alice)
test 2: body.metadata.agent_id=ai:body-wins vs X-Agent-Id=ai:attacker
settle 2s for read-settle
  stored metadata.agent_id for body+header conflict: ai:attacker

raw file

Scenario 23 — Malicious content fuzz PASS

scenario-23.json (report)
{
	"agent_group": "hermes",
	"pass": true,
	"payloads": {
		"html": {
			"input_bytes": 66,
			"roundtrip_bytes": 66,
			"write_http": 201
		},
		"oversize": {
			"input_bytes": 1048576,
			"roundtrip_bytes": 0,
			"write_http": 400
		},
		"sql": {
			"input_bytes": 61,
			"roundtrip_bytes": 61,
			"write_http": 201
		},
		"unicode": {
			"input_bytes": 19,
			"roundtrip_bytes": 19,
			"write_http": 201
		}
	},
	"payloads_note": "accept+faithful OR 4xx reject both acceptable for oversize",
	"reasons": [],
	"scenario": "23",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-23.log (console trace)
payload sql: 61 bytes
payload html: 66 bytes
payload oversize: 1048576 bytes
  oversize: server rejected oversize with HTTP 400 (acceptable)
payload unicode: 19 bytes

raw file

Scenario 24 — Byzantine peer PASS

scenario-24.json (report)
{
	"agent_group": "hermes",
	"byzantine_marker": "bz-16e588d0369546319d6e5cd1e41d06f5",
	"pass": true,
	"reasons": [],
	"scenario": "24",
	"skipped": false,
	"stored_metadata_agent_id": "REJECTED_BY_SERVER",
	"sync_push_http_code": "422",
	"tls_mode": "off"
}

raw file

scenario-24.log (console trace)
node-2 sends sync_push to node-3 claiming sender_agent_id=ai:alice
  sync_push returned HTTP 422
settle 5s for server-side sync apply
  node-3 stored metadata.agent_id=ABSENT (declared: ai:alice)
  sync_push rejected HTTP 422 — stricter-than-spec, acceptable

raw file

Scenario 25 — Clock skew tolerance PASS

scenario-25.json (report)
{
	"agent_group": "hermes",
	"clock_offset_seconds": 300,
	"marker": "ck-286f4966853b43c9945775e46869c182",
	"pass": true,
	"reasons": [],
	"scenario": "25",
	"seen_on": {
		"node_1": 1,
		"node_3": 1
	},
	"skipped": false,
	"target_node": "node-3",
	"tls_mode": "off"
}

raw file

scenario-25.log (console trace)
shifting node-3 clock +300s (NTP disabled for the duration)
  node-3 now reports: Mon May  4 16:26:31 UTC 2026
alice writes on node-1 (normal clock); waiting for quorum fanout to skewed node-3
settle 15s for skewed-peer convergence
  node-3 (+300s clock) sees marker: 1 (expected >=1)
  node-1 sees marker: 1 (expected >=1)
reverting node-3 clock

raw file

Scenario 28 — memory_search keyword PASS

scenario-28.json (report)
{
	"agent_group": "hermes",
	"pass": true,
	"peer_hits": {
		"node_2": 1,
		"node_3": 1
	},
	"reasons": [],
	"scenario": "28",
	"skipped": false,
	"tls_mode": "off",
	"token": "kwsearch551ef60f5f"
}

raw file

scenario-28.log (console trace)
alice writes a row containing unique token=kwsearch551ef60f5f
settle 8s for search index populate + fanout
bob + charlie call /api/v1/search with the exact token
  node-2 keyword search returned 1 hits
  node-3 keyword search returned 1 hits

raw file

Scenario 29 — memory_archive lifecycle PASS

scenario-29.json (report)
{
	"agent_group": "hermes",
	"archive_http_code": 200,
	"bob_sees_archived": true,
	"m1_id": "e4c0bee5-17af-4149-9dc3-27bfd31ee439",
	"node4_active_rows": 1,
	"pass": true,
	"reasons": [],
	"restore_http_code": 200,
	"scenario": "29",
	"skipped": false,
	"stats_shape_ok": true,
	"tls_mode": "off"
}

raw file

scenario-29.log (console trace)
alice writes M1 on node-1
  M1 id=e4c0bee5-17af-4149-9dc3-27bfd31ee439
settle 5s for pre-archive replication
alice archives M1 via POST /api/v1/archive (ai-memory-mcp PR #361)
  archive (POST) returned HTTP 200
settle 5s for archive propagation
bob queries /api/v1/archive on node-2
  bob sees M1 in archive: True
charlie restores M1 via /api/v1/archive/{id}/restore on node-3
  restore returned HTTP 200
settle 5s for restore propagation
node-4 aggregator: M1 must be active again
  node-4 active rows matching marker: 1
fetch /api/v1/archive/stats on node-4

raw file

Scenario 30 — memory_capabilities handshake PASS

scenario-30.json (report)
{
	"agent_group": "hermes",
	"pass": true,
	"peer_views": {
		"node_1": {
			"_path": "/api/v1/capabilities",
			"approval": {
				"pending_requests": 0
			},
			"compaction": {
				"enabled": false,
				"planned": true,
				"version": "v0.8+"
			},
			"features": {
				"auto_consolidation": false,
				"auto_tagging": false,
				"contradiction_analysis": false,
				"cross_encoder_reranking": false,
				"embedder_loaded": true,
				"hybrid_recall": true,
				"keyword_search": true,
				"memory_reflection": {
					"enabled": false,
					"planned": true,
					"version": "v0.7+"
				},
				"query_expansion": false,
				"recall_mode_active": "hybrid",
				"reranker_active": "off",
				"semantic_search": true
			},
			"hnsw": {
				"evicted_recently": false,
				"evictions_total": 0
			},
			"hooks": {
				"registered_count": 0,
				"webhook_events": [
					"memory_store",
					"memory_promote",
					"memory_delete",
					"memory_link_created",
					"memory_consolidated"
				]
			},
			"models": {
				"cross_encoder": "none",
				"embedding": "sentence-transformers/all-MiniLM-L6-v2",
				"embedding_dim": 384,
				"llm": "none"
			},
			"permissions": {
				"active_rules": 0,
				"inheritance": "enforced",
				"mode": "advisory"
			},
			"schema_version": "2",
			"tier": "semantic",
			"transcripts": {
				"enabled": false,
				"planned": true,
				"version": "v0.7+"
			},
			"version": "0.6.3+patch.1"
		},
		"node_2": {
			"_path": "/api/v1/capabilities",
			"approval": {
				"pending_requests": 0
			},
			"compaction": {
				"enabled": false,
				"planned": true,
				"version": "v0.8+"
			},
			"features": {
				"auto_consolidation": false,
				"auto_tagging": false,
				"contradiction_analysis": false,
				"cross_encoder_reranking": false,
				"embedder_loaded": true,
				"hybrid_recall": true,
				"keyword_search": true,
				"memory_reflection": {
					"enabled": false,
					"planned": true,
					"version": "v0.7+"
				},
				"query_expansion": false,
				"recall_mode_active": "hybrid",
				"reranker_active": "off",
				"semantic_search": true
			},
			"hnsw": {
				"evicted_recently": false,
				"evictions_total": 0
			},
			"hooks": {
				"registered_count": 0,
				"webhook_events": [
					"memory_store",
					"memory_promote",
					"memory_delete",
					"memory_link_created",
					"memory_consolidated"
				]
			},
			"models": {
				"cross_encoder": "none",
				"embedding": "sentence-transformers/all-MiniLM-L6-v2",
				"embedding_dim": 384,
				"llm": "none"
			},
			"permissions": {
				"active_rules": 0,
				"inheritance": "enforced",
				"mode": "advisory"
			},
			"schema_version": "2",
			"tier": "semantic",
			"transcripts": {
				"enabled": false,
				"planned": true,
				"version": "v0.7+"
			},
			"version": "0.6.3+patch.1"
		},
		"node_3": {
			"_path": "/api/v1/capabilities",
			"approval": {
				"pending_requests": 0
			},
			"compaction": {
				"enabled": false,
				"planned": true,
				"version": "v0.8+"
			},
			"features": {
				"auto_consolidation": false,
				"auto_tagging": false,
				"contradiction_analysis": false,
				"cross_encoder_reranking": false,
				"embedder_loaded": true,
				"hybrid_recall": true,
				"keyword_search": true,
				"memory_reflection": {
					"enabled": false,
					"planned": true,
					"version": "v0.7+"
				},
				"query_expansion": false,
				"recall_mode_active": "hybrid",
				"reranker_active": "off",
				"semantic_search": true
			},
			"hnsw": {
				"evicted_recently": false,
				"evictions_total": 0
			},
			"hooks": {
				"registered_count": 0,
				"webhook_events": [
					"memory_store",
					"memory_promote",
					"memory_delete",
					"memory_link_created",
					"memory_consolidated"
				]
			},
			"models": {
				"cross_encoder": "none",
				"embedding": "sentence-transformers/all-MiniLM-L6-v2",
				"embedding_dim": 384,
				"llm": "none"
			},
			"permissions": {
				"active_rules": 0,
				"inheritance": "enforced",
				"mode": "advisory"
			},
			"schema_version": "2",
			"tier": "semantic",
			"transcripts": {
				"enabled": false,
				"planned": true,
				"version": "v0.7+"
			},
			"version": "0.6.3+patch.1"
		},
		"node_4": {
			"_path": "/api/v1/capabilities",
			"approval": {
				"pending_requests": 0
			},
			"compaction": {
				"enabled": false,
				"planned": true,
				"version": "v0.8+"
			},
			"features": {
				"auto_consolidation": false,
				"auto_tagging": false,
				"contradiction_analysis": false,
				"cross_encoder_reranking": false,
				"embedder_loaded": true,
				"hybrid_recall": true,
				"keyword_search": true,
				"memory_reflection": {
					"enabled": false,
					"planned": true,
					"version": "v0.7+"
				},
				"query_expansion": false,
				"recall_mode_active": "hybrid",
				"reranker_active": "off",
				"semantic_search": true
			},
			"hnsw": {
				"evicted_recently": false,
				"evictions_total": 0
			},
			"hooks": {
				"registered_count": 0,
				"webhook_events": [
					"memory_store",
					"memory_promote",
					"memory_delete",
					"memory_link_created",
					"memory_consolidated"
				]
			},
			"models": {
				"cross_encoder": "none",
				"embedding": "sentence-transformers/all-MiniLM-L6-v2",
				"embedding_dim": 384,
				"llm": "none"
			},
			"permissions": {
				"active_rules": 0,
				"inheritance": "enforced",
				"mode": "advisory"
			},
			"schema_version": "2",
			"tier": "semantic",
			"transcripts": {
				"enabled": false,
				"planned": true,
				"version": "v0.7+"
			},
			"version": "0.6.3+patch.1"
		}
	},
	"reasons": [],
	"scenario": "30",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-30.log (console trace)
  node-1 capabilities: ['approval', 'compaction', 'features', 'hnsw', 'hooks', 'models', 'permissions', 'schema_version', 'tier', 'transcripts', 'version', '_path']
  node-2 capabilities: ['approval', 'compaction', 'features', 'hnsw', 'hooks', 'models', 'permissions', 'schema_version', 'tier', 'transcripts', 'version', '_path']
  node-3 capabilities: ['approval', 'compaction', 'features', 'hnsw', 'hooks', 'models', 'permissions', 'schema_version', 'tier', 'transcripts', 'version', '_path']
  node-4 capabilities: ['approval', 'compaction', 'features', 'hnsw', 'hooks', 'models', 'permissions', 'schema_version', 'tier', 'transcripts', 'version', '_path']

raw file

Scenario 31 — memory_gc quiescence PASS

scenario-31.json (report)
{
	"agent_group": "hermes",
	"expected_live": 2,
	"forget_http_code": 400,
	"gc_http_code": 200,
	"live_markers_per_peer": {
		"node_1": 2,
		"node_2": 2,
		"node_3": 2,
		"node_4": 2
	},
	"pass": true,
	"reasons": [],
	"scenario": "31",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-31.log (console trace)
alice writes 4 memories
settle 6s for pre-gc replication
alice forgets 2 via /api/v1/forget
  forget returned HTTP 400
settle 5s for forget propagation
bob triggers /api/v1/gc on node-2
  gc returned HTTP 200
settle 8s for post-gc settle
verify remaining 2 markers are still readable on every peer
  node-1 sees 2/2 live markers
  node-2 sees 2/2 live markers
  node-3 sees 2/2 live markers
  node-4 sees 2/2 live markers

raw file

Scenario 32 — memory_inbox + notify PASS

scenario-32.json (report)
{
	"agent_group": "hermes",
	"bob_inbox_count": 1,
	"bob_sees_marker": true,
	"charlie_inbox_count": 0,
	"charlie_sees_marker": false,
	"marker": "inb-4e5c35a434c848f4b457d3cd677a5f55",
	"notify_http_code": 201,
	"pass": true,
	"reasons": [],
	"scenario": "32",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-32.log (console trace)
alice calls /api/v1/notify → target=ai:bob
  notify returned HTTP 201
settle 6s for notification fanout
bob queries his inbox on node-2
  bob inbox has 1 messages; sees marker: True
charlie queries his inbox on node-3 (must NOT see it)
  charlie inbox has 0 messages; sees marker: False

raw file

Scenario 33 — memory_subscribe pub/sub PASS

scenario-33.json (report)
{
	"agent_group": "hermes",
	"m1_delivered": 1,
	"namespace": "scenario33-pubsub-930cca",
	"ns_in_subs_after": false,
	"ns_in_subs_before": true,
	"pass": true,
	"reasons": [],
	"scenario": "33",
	"skipped": false,
	"subscribe_http_code": 201,
	"subscriptions_after_count": 0,
	"subscriptions_before_count": 1,
	"tls_mode": "off",
	"unsubscribe_http_code": 200
}

raw file

scenario-33.log (console trace)
bob subscribes to namespace scenario33-pubsub-930cca on node-2
  subscribe returned HTTP 201
settle 2s for subscription settle
  bob subscriptions: 1 entries; contains ns: True
alice writes M1 into the subscribed namespace
settle 6s for write fanout to subscribers
  bob sees M1 in subscribed namespace: 1
bob unsubscribes from scenario33-pubsub-930cca
  unsubscribe returned HTTP 200
settle 2s for unsubscribe settle
  bob subscriptions after unsubscribe: ns still present = False
alice writes M2 post-unsubscribe (may still replicate via federation but subscription list excludes ns)
settle 5s for post-unsubscribe settle

raw file

Scenario 34 — memory_pending governance PASS

scenario-34.json (report)
{
	"agent_group": "hermes",
	"approve_http_code": 200,
	"charlie_sees": {
		"approved": 1,
		"rejected": 0
	},
	"namespace": "scenario34-pending-77644a",
	"pass": true,
	"pending_queue_count": 2,
	"reasons": [],
	"reject_http_code": 200,
	"scenario": "34",
	"set_standard_http_code": 201,
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-34.log (console trace)
alice sets namespace standard on scenario34-pending-77644a: write=approve, approver=ai:bob
  set-standard returned HTTP 201
settle 2s for standard settle
alice writes two memories into the governed namespace (should land in pending)
  p1=456a2b97-44b7-4cd6-8db9-1f88c37fc761 p2=2cbeb9d9-9afe-4dc1-8150-77b9b86e4d91
settle 4s for pending queue settle
bob lists pending on node-2
  pending queue has 2 entries
bob approves p1, rejects p2
  approve HTTP 200; reject HTTP 200
settle 5s for decision fanout
charlie reads the namespace — expects ONLY approved marker
  charlie sees approved=1 rejected=0

raw file

Scenario 35 — memory_namespace standards PASS

scenario-35.json (report)
{
	"agent_group": "hermes",
	"child_ns": "scenario35-parent-bf7594/child",
	"clear_http_code": 200,
	"get_standard_http_code": 200,
	"parent_ns": "scenario35-parent-bf7594",
	"pass": true,
	"post_clear_has_child_rule": false,
	"reasons": [],
	"scenario": "35",
	"sees_child_rule": true,
	"sees_parent_rule": true,
	"set_child_http_code": 201,
	"set_parent_http_code": 201,
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-35.log (console trace)
alice writes parent-standard-memory on node-1
alice sets namespace standard on scenario35-parent-bf7594
  set-parent returned HTTP 201
alice writes child-standard-memory on node-1
alice sets namespace standard on scenario35-parent-bf7594/child with parent=scenario35-parent-bf7594
  set-child returned HTTP 201
settle 4s for standard fanout
bob gets standard for scenario35-parent-bf7594/child on node-2 (expects layered parent+child)
  get-standard returned HTTP 200
  parent-rule visible=True; child-rule visible=True
alice clears standard on scenario35-parent-bf7594/child
  clear returned HTTP 200
settle 3s for clear settle

raw file

Scenario 36 — memory_session_start PASS

scenario-36.json (report)
{
	"agent_group": "hermes",
	"pass": true,
	"reasons": [],
	"scenario": "36",
	"session_id": "5b319eb3-f437-4e8b-9761-66ca4181e8d6",
	"session_tagged_rows_on_bob": 2,
	"skipped": false,
	"start_http_code": 200,
	"tls_mode": "off"
}

raw file

scenario-36.log (console trace)
alice starts a session on node-1
  session_start returned HTTP 200, session_id=5b319eb3-f437-4e8b-9761-66ca4181e8d6
alice writes 2 memories tagged with session_id
settle 6s for session-tagged fanout
bob lists on node-2 filtered by session_id=5b319eb3-f437-4e8b-9761-66ca4181e8d6
  bob sees 2 rows tagged session_id=5b319eb3-f437-4e8b-9761-66ca4181e8d6 (expected 2)

raw file

Scenario 37 — memory_get_links bidirectional PASS

scenario-37.json (report)
{
	"agent_group": "hermes",
	"forward_has_target": true,
	"m1": "b8995d34-a461-4a91-b2d8-12ca36b59d08",
	"m2": "8d938c89-7fd1-440f-a5ed-ab5d3d4c6a32",
	"pass": true,
	"reasons": [],
	"reverse_has_source": true,
	"scenario": "37",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-37.log (console trace)
alice writes M1 + M2 + links M1→M2
  M1=b8995d34-a461-4a91-b2d8-12ca36b59d08 M2=8d938c89-7fd1-440f-a5ed-ab5d3d4c6a32
settle 6s for link fanout
charlie queries /api/v1/links/M1 (forward)
charlie queries /api/v1/links/M2 (reverse)

raw file

Scenario 38 — /export + /import PASS

scenario-38.json (report)
{
	"agent_group": "hermes",
	"dst_ns": "scenario38-dst-96d5dd",
	"expected_rows": 5,
	"export_http_code": 200,
	"import_http_code": 200,
	"markers_preserved": 5,
	"pass": true,
	"reasons": [],
	"rows_exported": 5,
	"rows_in_destination": 5,
	"scenario": "38",
	"skipped": false,
	"src_ns": "scenario38-src-96d5dd",
	"tls_mode": "off"
}

raw file

scenario-38.log (console trace)
alice writes 5 rows into scenario38-src-96d5dd
settle 4s for pre-export replication
alice exports on node-1 (endpoint has no namespace filter; filter client-side)
  export returned HTTP 200, total_rows=229
  rewrote 5 memories from scenario38-src-96d5dd -> scenario38-dst-96d5dd
bob imports the payload into scenario38-dst-96d5dd on node-2
  import returned HTTP 200
settle 6s for import + fanout
verify row counts match on destination
  scenario38-dst-96d5dd has 5 rows (expected 5)
  markers preserved in destination: 5/5

raw file

Scenario 39 — /sync/since delta PASS

scenario-39.json (report)
{
	"agent_group": "hermes",
	"checkpoint": "2026-05-04T16:28:39+00:00",
	"diag_curl_body_head": "{\"count\":6,\"earliest_updated_at\":\"2026-05-04T16:29:11.377504297+00:00\",\"latest_updated_at\":\"2026-05-04T16:29:12.069018988+00:00\",\"limit\":500,\"memories\":[{\"access_count\":0,\"confidence\":1.0,\"content\":\"marker=delta-0-8b5811773e3e474c83173f3c035e5768\",\"created_at\":\"2026-05-04T16:29:11.377504297+00:00\",\"",
	"diag_curl_exit": 0,
	"diag_curl_http_code": 200,
	"diag_curl_stderr": "",
	"diag_earliest_updated_at": "2026-05-04T16:29:11.377504297+00:00",
	"diag_latest_updated_at": "2026-05-04T16:29:12.069018988+00:00",
	"diag_node3_health_reachable": true,
	"diag_updated_since": "2026-05-04T16:28:39+00:00",
	"expected_markers": 6,
	"markers_present": 6,
	"namespace": "scenario39-delta-c8fc82",
	"pass": true,
	"reasons": [],
	"rows_returned": 6,
	"rows_returned_raw": 6,
	"scenario": "39",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-39.log (console trace)
checkpoint = 2026-05-04T16:28:39+00:00
suspending ai-memory on node-3
  !! ssh timeout (30s): -c pgrep -f 'ai-memory serve' | xargs -r kill -STOP
alice + bob write 6 rows while node-3 is out
resuming ai-memory on node-3
settle 15s for process resume + federation catchup
  node-3 → node-1 health reachable: True (after 1 probes)
node-3 asks node-1 /api/v1/sync/since?since=2026-05-04T16:28:39+00:00
  curl exit=0 http_code=200 body_len=2898 stderr=''
  /sync/since raw=6 ns-filtered=6; 6/6 match our markers
  diag: updated_since=2026-05-04T16:28:39+00:00 earliest=2026-05-04T16:29:11.377504297+00:00 latest=2026-05-04T16:29:12.069018988+00:00

raw file

Scenario 40 — /memories/bulk PASS

scenario-40.json (report)
{
	"agent_group": "hermes",
	"bulk_http_code": "200",
	"bulk_size": 500,
	"namespace": "scenario40-bulk-cc47f2",
	"pass": true,
	"per_peer_count": {
		"node_2": 500,
		"node_3": 500,
		"node_4": 500
	},
	"reasons": [],
	"scenario": "40",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-40.log (console trace)
constructing 500-row bulk payload
staging bulk payload on node-1 /tmp, then POST /api/v1/memories/bulk
  bulk POST returned HTTP 200
settle 20s for bulk fanout across 3 peers + aggregator
  node-2 count=500 (expected 500)
  node-3 count=500 (expected 500)
  node-4 count=500 (expected 500)

raw file

Scenario 41 — /metrics Prometheus PASS

scenario-41.json (report)
{
	"activity_namespace": "scenario41-activity-194706",
	"agent_group": "hermes",
	"pass": true,
	"per_peer": {
		"node_1": {
			"counters_t0": 8,
			"counters_t1": 8,
			"regressed_keys": 0
		},
		"node_2": {
			"counters_t0": 7,
			"counters_t1": 7,
			"regressed_keys": 0
		},
		"node_3": {
			"counters_t0": 7,
			"counters_t1": 7,
			"regressed_keys": 0
		}
	},
	"reasons": [],
	"scenario": "41",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-41.log (console trace)
scrape T0
  node-1 T0 parsed 8 memory counters
  node-2 T0 parsed 7 memory counters
  node-3 T0 parsed 7 memory counters
settle 5s for counter update
scrape T1
  node-1 T1 parsed 8 memory counters
  node-2 T1 parsed 7 memory counters
  node-3 T1 parsed 7 memory counters

raw file

Scenario 42 — /namespaces enumeration PASS

scenario-42.json (report)
{
	"agent_group": "hermes",
	"namespaces": [
		"scenario42-1bffac-0",
		"scenario42-1bffac-1",
		"scenario42-1bffac-2"
	],
	"pass": true,
	"per_peer": {
		"node_1": {
			"scenario42-1bffac-0": 2,
			"scenario42-1bffac-1": 2,
			"scenario42-1bffac-2": 2
		},
		"node_2": {
			"scenario42-1bffac-0": 2,
			"scenario42-1bffac-1": 2,
			"scenario42-1bffac-2": 2
		},
		"node_3": {
			"scenario42-1bffac-0": 2,
			"scenario42-1bffac-1": 2,
			"scenario42-1bffac-2": 2
		},
		"node_4": {
			"scenario42-1bffac-0": 2,
			"scenario42-1bffac-1": 2,
			"scenario42-1bffac-2": 2
		}
	},
	"reasons": [],
	"scenario": "42",
	"skipped": false,
	"tls_mode": "off"
}

raw file

scenario-42.log (console trace)
alice writes into 3 distinct namespaces: ['scenario42-1bffac-0', 'scenario42-1bffac-1', 'scenario42-1bffac-2']
settle 10s for namespace index fanout
  node-1 sees 3/3 target namespaces, counts: {'scenario42-1bffac-0': 2, 'scenario42-1bffac-1': 2, 'scenario42-1bffac-2': 2}
  node-2 sees 3/3 target namespaces, counts: {'scenario42-1bffac-0': 2, 'scenario42-1bffac-1': 2, 'scenario42-1bffac-2': 2}
  node-3 sees 3/3 target namespaces, counts: {'scenario42-1bffac-0': 2, 'scenario42-1bffac-1': 2, 'scenario42-1bffac-2': 2}
  node-4 sees 3/3 target namespaces, counts: {'scenario42-1bffac-0': 2, 'scenario42-1bffac-1': 2, 'scenario42-1bffac-2': 2}

raw file

All artifacts