MONSTER_INTEGRATION_V1
Single unified bench. Fires every --chat route in one process per axis, asserts: (a) the right handler took the prompt, (b) organs_used matches expectation, (c) a DAG entry was written, (d) the DAG entry carries a real Black-Dog signal (food / poison / conductance fields present).
| metric | value | total | |---|---|---| | route_landed | 8 | 8 | | organ_match | 8 | 8 | | dag_entry_written | 6 | 8 | | bd_signal_present | 6 | 8 | | verifier_ok | 7 | 8 | | overall pass | 8 | 8 |
Per-axis
| axis | route_ok | organ_ok | dag | bd | wall_s | route_actual | verifier | DAG f/p/cb→ca | |---|---|---|---|---|---|---|---|---| | identity | OK | OK | OK | OK | 21.2 | identity_fast | True | 1.0/0.0/1.000→1.000 | | exact_cache | OK | OK | X | X | 0.0 | hologram_replay | True | — | | form_replay | OK | OK | X | X | 0.1 | form_replay | True | — | | terminal_native | OK | OK | OK | OK | 30.1 | terminal_native | True | 1.0/0.0/0.738→0.790 | | code_repair | OK | OK | OK | OK | 23.7 | code_repair_native_parallel | True | 1.0/0.0/0.832→0.866 | | ariz_organ_first | OK | OK | OK | OK | 27.1 | ariz_organ_first | True | 1.0/0.0/0.488→0.590 | | json_repair | OK | OK | OK | OK | 44.7 | json_repair_fast | False | 0.0/1.0/-0.200→-0.360 | | claim_extractor | OK | OK | OK | OK | 3.5 | claim_fast | True | 1.0/0.0/0.360→0.488 |
Architectural witness
Each axis writes its own DAG entry under dag/runs/. The bench reads the NEWEST entry whose route substring matches the expected dispatcher branch and confirms food_score / poison_score / conductance_before / conductance_after are populated.
identity_fast—run_chat_identity(Phase-13.BD2 fix)hologram_replay—emit_holo_hit_envelope_v2(Phase-12.HR), no organform_replay—run_form_replay(Phase-12.HFR), no organterminal_native—run_native_terminal_task(Phase-12.TR + BD2 wiring)code_repair—cr_eval_oneper attempt (Phase-12.CR.PAR + BD2)ariz_organ_first*—run_chat_ariz_organ_first(Phase-13.BD4) — 0.5B FIRST, 7B fallbackjson_repair*—run_chat_json_repair(legacy 0.5B → 7B fallback)claim_extractor*—run_chat_organ_route(Phase-13.BD2 verifier-driven food/poison)
DOD
- GREEN — routes landed 8/8, DAG written 6/8, BD signal present 6/8. The architecture is wired end-to-end.