Skip to content

t3-proactive — grok-4.3 on openclaw (profile=core)

Outcome: PASS Reason: agent pre-checked capabilities and surfaced expansion or operator action Captured: 2026-05-05T01:41:32.178165Z Wall clock: 234753 ms Rounds: 11 Tokens: in=128410 out=634 Model: grok-4-0709

Signals

Signal Value
Called memory_capabilities True
Called capabilities BEFORE power-family True
Received -32601 tool_not_found False
Called --include-schema True (4x)
Completed task (final answer present) True
Families surfaced core, lifecycle, graph, governance, power, meta, archive, other (8/8)
Tool calls memory_capabilities, memory_capabilities, memory_search, memory_recall, memory_capabilities, memory_capabilities, memory_recall, memory_recall, memory_capabilities, memory_recall
Error codes (none)
Grok finish reason stop

Methodology

  • Tier t3-proactive (pass threshold >= 50%)
  • DB baseline: v0.6.3.1 (schema v19) restored from fixtures/corpus/v0.6.3.1-baseline.db.gz per cell
  • Profile: --profile core
  • Binary SHA256: f5abad816bc34c11dfbadf17402ef6c6f08edecb93ae1985de7e9173415be09a
  • MCP wire log SHA256: d97a78eed9619fd6685b33df7ba7113938e0321c7d27c1ef83135ac3c4ee03c4

Final answer (excerpt)

To complete the consolidation of similar memories about Project Aurora, the power family tools are required but not currently loaded. Operator, please restart the MCP server with --profile power added to the command line to load them. I discovered 5 near-duplicate memories (IDs: 551a578b-7cd5-4bc8-bf77-e4a5504f990c, d9a9803f-b613-4924-b23d-7b174410dcae, 6a9254a3-3105-4cf6-a33a-f8fe8fba35c7, 2b822e56-3c42-4b0c-9d7b-d61313075657, 713e00fb-33ad-4339-95ec-c9a51be3bbcb) tagged as consolidation candidates, all describing Project Aurora as a test for T3 consolidation involving v0.6.4 quiet-tools.

Transcript

per-call transcript JSONL