Model Strategies & Patterns
Opus Magnum Bench
Strategies
Failure modes
Reward hacks
←
Solve matrix