← Back to index
v0.1.0 — Add Maze Benchmark
[0.1.0] add maze benchmark
Added
- Initial maze benchmark implementation
- Maze Gauntlet Evaluator with gradient scoring system
- Support for ASCII maze parsing from LLM output
- Scoring components: Ambition, Progress, Objectives, Proximity, Danger
- CLI runner (
run_benchmark.py)