Benchmark Run: 20250314_111430 Server: http://localhost:11434 CPU Information: python_version: 3.10.16.final.0 (64 bit) cpuinfo_version: [9, 0, 0] cpuinfo_version_string: 9.0.0 arch: ARM_8 bits: 64 count: 10 arch_string_raw: arm64 brand_raw: Apple M1 Pro Benchmark Results: 🏆 Final Model Leaderboard: olmo2:13b-1124-instruct-q4_K_M Overall Success Rate: 81.9% (59/72 cases) Average Tokens/sec: 9.16 (8.49 - 9.16) Average Duration: 27.80s Min/Max Avg Duration: 9.97s / 27.80s Test Results: - Fibonacci: ❌ 5/18 cases (27.8%) - Binary Search: ✅ 18/18 cases (100.0%) - Palindrome: ✅ 18/18 cases (100.0%) - Anagram Check: ✅ 18/18 cases (100.0%)