T1 — Awareness — harness-pipeline-smoke (profile=core)¶
Outcome: ✅ PASS Reason: all 8 families surfaced; 1 loaded under core profile Captured: 2026-05-04T22:17:05Z Wall clock: 26 ms
Cell type¶
This is the harness-pipeline smoke test — no LLM in the loop. It validates that the discovery-gate runner can:
- Restore the v0.6.3.1 baseline fixture (schema v19) — confirmed: schema=19 → 20 post-migration
- Spawn the v0.6.4 binary at
--profile core - Drive the MCP stdio loop for the canonical T1 first-call sequence (
initialize→tools/list→memory_capabilities) - Parse the families block from the response
- Score against T1 pass criteria
A real LLM-driven cell substitutes step 3 with an actual model API call (Claude / GPT / Grok / Gemini) and records the LLM transcript. The pipeline (steps 1, 2, 4, 5) is shared.
Migration validation (every cell exercises this)¶
| Check | Before (v0.6.3.1 fixture) | After (v0.6.4 open) |
|---|---|---|
| Schema version | 19 | 20 |
| Memories | 17 | 17 (preserved) |
| audit_log table | absent | audit_log |
Migration v19 → v20 confirmed non-destructive on the gate fixture.
Evidence¶
{
"schema_version": "v0.6.4-discovery-gate-1",
"tier": "t1-awareness",
"llm": "harness-pipeline-smoke",
"harness": "local",
"profile": {
"raw": "core"
},
"outcome": "pass",
"evidence": {
"agent_called_capabilities": true,
"agent_received_tool_not_found": false,
"agent_called_include_schema": false,
"agent_completed_task": true,
"families_surfaced": [
"core",
"lifecycle",
"graph",
"governance",
"power",
"meta",
"archive",
"other"
],
"wall_clock_ms": 26,
"tokens_in": null,
"tokens_out": null,
"tools_list_count": 6,
"db_schema_before": 19,
"db_schema_after": 20,
"memories_preserved_thru_migration": true,
"audit_log_table_present": true
},
"timestamp_utc": "2026-05-04T22:17:05.694442Z",
"binary_sha256": "f5abad816bc34c11dfbadf17402ef6c6f08edecb93ae1985de7e9173415be09a",
"transcript_url": null,
"mcp_wire_log_sha256": "31d093e58b0434e907ab0da355a83b04d67acda7b50c027035398e659b4811a0",
"verdict_reason": "all 8 families surfaced; 1 loaded under core profile",
"note": "No LLM in the loop \u2014 this cell validates the harness pipeline (DB restore, daemon spawn, MCP wire-log capture, capabilities parsing, verdict scoring). Real LLM-driven cells require API keys per docs/methodology.md."
}