Regression 8F1 — native batch runner
Cases: 4/4 (100 %)
Total wall: 304.056 s. Tier ≤ 1.
Per-organ summary
| organ | n | pass | pass% | avg wall (s) | avg tok/s | first failure reason | |-------|---|------|-------|--------------|-----------|----------------------| | phys05_claim_extractor | 2 | 2 | 100 | 81.7948 | 0.276948 | | | phys05_triz_contradiction | 2 | 2 | 100 | 70.2332 | 0.61916 | |
Per-case detail
| id | organ | ok | wall (s) | tok/s | verifier | output[:80] | |----|-------|----|---------|-------|----------|--------------| | triz_01 | phys05_triz_contradiction | ✅ | 73.0051 | 0.660055 | TC+PC both filled, ≥8 chars each | {"technical_contradiction":"heatsink size vs dissipation rate","physical_contrad | | triz_02 | phys05_triz_contradiction | ✅ | 67.4613 | 0.578266 | TC+PC both filled, ≥8 chars each | {"technical_contradiction":"frame strength vs weight","physical_contradiction":" | | claim_01 | phys05_claim_extractor | ✅ | 69.8023 | 0.37258 | array with ≥1 claim items | [{"claim":"The Eiffel Tower is 330m tall","type":"number/height/value"}] | | claim_02 | phys05_claim_extractor | ✅ | 93.7874 | 0.181316 | array with ≥1 claim items | [{"claim":"Bees pollinate flowers","type":"fact/number"}] |