Regression 8F1 — native batch runner
Cases: 2/4 (50 %)
Total wall: 11.7877 s. Tier ≤ 1.
Per-organ summary
| organ | n | pass | pass% | avg wall (s) | avg tok/s | first failure reason | |-------|---|------|-------|--------------|-----------|----------------------| | phys05_claim_extractor | 2 | 2 | 100 | 2.80157 | 14.058 | | | phys05_triz_contradiction | 2 | 0 | 0 | 3.09229 | 17.2739 | missing technical_contradiction key |
Per-case detail
| id | organ | ok | wall (s) | tok/s | verifier | output[:80] | |----|-------|----|---------|-------|----------|--------------| | triz_01 | phys05_triz_contradiction | ❌ | 3.17602 | 7.61387 | missing technical_contradiction key | {"technical_contradictions":["small heatsink"],"physical_contradictions":[]} | | triz_02 | phys05_triz_contradiction | ❌ | 3.00856 | 26.9339 | JSON syntax invalid: parse error: expected : | {"technical_contradictions":[{"lightness":true,"stiffness":false},{"strength":{" | | claim_01 | phys05_claim_extractor | ✅ | 3.06615 | 20.2288 | array with ≥1 claim items | [{"claim":"The Eiffel Tower is 330m tall","type":"number/fact/causal/instruction | | claim_02 | phys05_claim_extractor | ✅ | 2.53698 | 7.88719 | array with ≥1 claim items | [{"claim":"Bees pollinate flowers","type":"fact/number/causal"}] |