CyberdyneLabs · Reports · TERMINAL_NANOOS_30

TERMINAL_NANOOS_30

reports/TERMINAL_NANOOS_30.md 619 words raw markdown ↗

TERMINAL_NANOOS_30

Phase-12 NanoOS Capsule Substrate × 30-task Terminal-Bench-style suite. PARROT one-shot vs MONSTER --chat envelope -> C++ runtime k=1..3 retry.

| mode | pass | total | rate | wall | VWS | |---|---|---|---|---|---| | PARROT | 20 | 30 | 67 % | 10.7s | 1.87 | | MONSTER | 22 | 30 | 73 % | 25.2s | 0.87 |

Δ MONSTER − PARROT: +2, wall ratio MONSTER/PARROT: 2.35×.

Per-task

| task | diff | PARROT | MONSTER | rounds (M) | capsule (M) | artifacts (M) | |---|---|---|---|---|---|---| | create_file_exact | easy | OK | OK | 1 | 30_185521_f67193 | a948904f2f0f479b | | run_python_print_42 | easy | OK | OK | 1 | 30_185521_a3744e | 58a44735ffdfa6b1 | | parse_json | easy | X | OK | 1 | 30_185521_768bb5 | 36bd4ed657a57ace | | append_to_file | easy | OK | OK | 1 | 30_185522_6952bd | 4a1e67f2fe1d1cc7 | | count_words | easy | OK | OK | 1 | 30_185522_f73802 | 37bbce1230bbac4c | | rename_file | easy | OK | OK | 1 | 30_185522_8d6bec | 68adc578f362e140 | | uppercase_file | easy | OK | OK | 1 | 30_185522_1bc347 | a948904f2f0f479b, 2949725604dd9eef | | count_error_lines | easy | OK | OK | 1 | 30_185523_0df829 | 41c29f94e5ce4223 | | find_pattern_grep | easy | OK | OK | 1 | 30_185523_377484 | 343390bab4363fd0 | | make_dir_and_file | easy | OK | OK | 1 | 30_185523_532cd9 | d117fa006ba92085 | | sed_transform | medium | OK | OK | 2 | 30_185524_8f3afb | ac914dfa543e017c, ac914dfa543e017c | | chmod_run_executable | medium | OK | OK | 1 | 30_185525_884f49 | f2e35109e0f7bfa8 | | produce_patch | medium | OK | X | 3 | 30_185526_5593bc | 470450d1505afcca, eb7aa2dfbdd9d0e7, c684c7ca167360a4 | | verify_output_hash | medium | OK | OK | 1 | 30_185527_239302 | 5fc4ae2d24d613fb, 8bd186b55ecb5d98 | | extract_field_awk | medium | OK | OK | 1 | 30_185527_cd391b | e061394a063b8486 | | json_repair_missing_brace | medium | X | X | 3 | 30_185529_c0284b | b7b4e6ee2d168446 | | multistep_chain | medium | OK | OK | 1 | 30_185529_33a0b6 | 14c5e74c4b96ccef | | shellscript_run | medium | OK | OK | 1 | 30_185530_cef4ca | 3fe809fb11697744 | | csv_to_json | medium | X | X | 3 | 30_185533_2a7f1d | 1347fa16719cf5e4, 3fd84de16a2aa121 | | rename_then_diff | medium | OK | OK | 1 | 30_185534_bb6d7e | 7203dc815d0b0b7f, 25718360e05d3c2d, 7f8b1dfc466b6249 | | fix_failing_test | hard | X | OK | 1 | 30_185534_aa5eee | fa8bf169bf67ba1c, ba1a531f581d2e60, 3b3f657c1874839f | | compile_cpp_missing_include | hard | X | OK | 2 | 30_185535_463ad4 | 9d648401c7b6fb42, 3433be63e64588d1 | | find_bug_from_stderr | hard | X | X | 3 | 30_185537_1228f5 | ede1fb2f0d44a773 | | fix_python_syntax_error | hard | OK | X | 3 | 30_185538_7bc36f | bc44a7bc45c0afdd | | fix_indent_error | hard | X | X | 3 | 30_185540_4ae169 | ce311384c953bddf | | fix_import_error | hard | X | X | 3 | 30_185541_a0ca66 | 4a0f67afb1a8ebbe | | compile_cpp_typo | hard | X | OK | 1 | 30_185542_b08ce3 | 51dccfdfead2c68e, 865d069e62a8d8ff | | compile_cpp_missing_semicolon | hard | X | X | 3 | 30_185544_fb7f65 | 7b7daaa1777c36bc | | fix_off_by_one | hard | OK | OK | 1 | 30_185545_3d610a | bfc908d793be37d4 | | regex_replace_capture | hard | OK | OK | 1 | 30_185545_6e199f | 12dab4732dc90c3e, 118570d3c591607f |

Architectural witness

Every MONSTER pass row carries capsule_id + at least one artifact sha256. Replay any of them by feeding the spec under replay_recipe.spec_inline (in dag/capsules/cap_*.json) back to tools/capsule/shell_capsule.py. For rows where rounds > 1, the C++ runtime fed stderr+exit codes from k-1 capsule into the next prompt.

DOD