README.md
# Self-directed benchmark: ssm_mamba strategy stress test
- Paper ID: self.20260428134113.039
- Hypothesis: Benchmark a simplified recovered baseline against an ablated variant.
- Plan: Run the deterministic recovery benchmark and capture VRAM plus throughput telemetry.
- Expected Signal: Recovered benchmark should execute and emit standard telemetry fields.
- Concept Combo: recovery_baseline
- Note: Generated fallback because the architect returned no content.
results.log
--- AUTO FIX ---
Applied CPU fallback benchmark profile to reduce runtime and re-run.
Replaced invalid benchmark script with deterministic ARES recovery benchmark.
--- ATTEMPT: initial (code=124) ---
Status: TIMEOUT
--- STDOUT ---
--- RUNTIME PROFILE ---
Device policy: gpu_required
Torch: 2.11.0+rocm7.1
Accelerator backend: rocm
Torch CUDA build: None
Torch HIP build: 7.1.52802
CUDA available: True
CUDA device count: 1
CUDA device[0]: AMD Radeon 890M
Accelerator memory total: 92160.0 MB
Accelerator memory used: 46878.1 MB
Recommended autocast dtype: bf16
Recommended DataLoader pin_memory: True
Recommended DataLoader num_workers: 12
Recommended starting batch size: 64
Recommended CPU threads: 24
/dev/kfd present: True
--- STDERR ---
Timeout while waiting for process shutdown
Timeout
--- ATTEMPT: retry_after_autofix_1 (code=1) ---
--- STDOUT ---
--- STDERR ---
Traceback (most recent call last):
File "/home/corbybender/ares/benchmark_runner.py", line 80, in <module>
main()
~~~~^^
File "/home/corbybender/ares/benchmark_runner.py", line 76, in main
runpy.run_path(str(script_path), run_name="__main__")
~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "<frozen runpy>", line 287, in run_path
File "<frozen runpy>", line 98, in _run_module_code
File "<frozen runpy>", line 88, in _run_code
File "/home/corbybender/ares/experiments/exp_self.20260428134113.039_20260428_134113/benchmark.py", line 8, in <module>
D_MODEL = min(D_MODEL, 192)
^^^^^^^
NameError: name 'D_MODEL' is not defined
--- ATTEMPT: recovery_fallback (code=124) ---
Status: TIMEOUT
--- STDOUT ---
--- STDERR ---
Timeout while waiting for process shutdown
Timeout
--- HUMAN SUMMARY (LAYMAN) ---
Automatic repair applied by ARES: Applied CPU fallback benchmark profile to reduce runtime and re-run.
Replaced invalid benchmark script with deterministic ARES recovery benchmark.
Result: The test did not finish successfully.
Main failure observed: Timeout while waiting for process shutdown
Plain-English interpretation: The script hit a runtime problem before it could report final metrics.