README.md
# Experiment Benchmark
This experiment contains a runnable benchmark generated by ARES.
## Files
- `benchmark.py`: main benchmark entrypoint
- `results.log`: captured runtime output after execution
## Run
```bash
python benchmark.py
```
## Expected Output
- `VRAM_USAGE: <value>MB`
- `TOKENS_PER_SEC: <value>`
- A final `VERIFIED:` or `RESULT:` status line
## Runtime Notes
- Uses a local runnable Python benchmark implementation.
results.log
--- ATTEMPT: initial (code=0) ---
--- STDOUT ---
--- RUNTIME PROFILE ---
Device policy: gpu_preferred
Torch: 2.11.0+rocm7.1
Accelerator backend: rocm
Torch CUDA build: None
Torch HIP build: 7.1.52802
CUDA available: True
CUDA device count: 1
CUDA device[0]: AMD Radeon 890M Graphics
Accelerator memory total: 73728.0 MB
Accelerator memory used: 14810.1 MB
Recommended autocast dtype: bf16
Recommended DataLoader pin_memory: True
Recommended DataLoader num_workers: 12
Recommended starting batch size: 64
Recommended CPU threads: 24
/dev/kfd present: True
VRAM_USAGE: 0MB
TOKENS_PER_SEC: 396070.83
VERIFIED: PASS - deterministic stdlib exercise completed
RESULT_JSON: {"label": "Type-safe Package Interface", "elapsed_s": 1.3e-05}
--- STDERR ---
--- HUMAN SUMMARY (LAYMAN) ---
Result: The test completed successfully.
Benchmark script conclusion: VERIFIED: PASS - deterministic stdlib exercise completed