Skip to content

t2-reactive — grok-4.3 on openclaw (profile=core)

Outcome: PASS Reason: agent pre-checked capabilities and surfaced operator action without blind call Captured: 2026-05-05T01:50:25.960656Z Wall clock: 160144 ms Rounds: 12 Tokens: in=94416 out=484 Model: grok-4-0709

Signals

Signal Value
Called memory_capabilities True
Called capabilities BEFORE power-family True
Received -32601 tool_not_found False
Called --include-schema True (1x)
Completed task (final answer present) True
Families surfaced (none) (0/8)
Tool calls memory_search, memory_capabilities, memory_get, memory_search, memory_search, memory_get, memory_get, memory_get, memory_get, memory_get, memory_recall
Error codes (none)
Grok finish reason stop

Methodology

  • Tier t2-reactive (pass threshold >= 80%)
  • DB baseline: v0.6.3.1 (schema v19) restored from fixtures/corpus/v0.6.3.1-baseline.db.gz per cell
  • Profile: --profile core
  • Binary SHA256: f5abad816bc34c11dfbadf17402ef6c6f08edecb93ae1985de7e9173415be09a
  • MCP wire log SHA256: 41cc4cd5ea40f6636d3ad48df1b67f305ac5be464075ba9ae86cdcd94261bc94

Final answer (excerpt)

Based on the knowledge graph data retrieved from the ai-memory system, there is a direct path from Project Alpha to Project Gamma via the "derived_from" relation. There is also an indirect path: Project Alpha -- "related_to" --> Project Beta -- "related_to" --> Project Gamma.

Transcript

per-call transcript JSONL