760 B
760 B
Benchmark Run: 20250314_111430
Server: http://localhost:11434
CPU Information:
python_version: 3.10.16.final.0 (64 bit)
cpuinfo_version: [9, 0, 0]
cpuinfo_version_string: 9.0.0
arch: ARM_8
bits: 64
count: 10
arch_string_raw: arm64
brand_raw: Apple M1 Pro
Benchmark Results:
🏆 Final Model Leaderboard:
olmo2:13b-1124-instruct-q4_K_M
Overall Success Rate: 81.9% (59/72 cases)
Average Tokens/sec: 9.16 (8.49 - 9.16)
Average Duration: 27.80s
Min/Max Avg Duration: 9.97s / 27.80s
Test Results:
- Fibonacci: ❌ 5/18 cases (27.8%)
- Binary Search: ✅ 18/18 cases (100.0%)
- Palindrome: ✅ 18/18 cases (100.0%)
- Anagram Check: ✅ 18/18 cases (100.0%)
Server: http://localhost:11434
CPU Information:
python_version: 3.10.16.final.0 (64 bit)
cpuinfo_version: [9, 0, 0]
cpuinfo_version_string: 9.0.0
arch: ARM_8
bits: 64
count: 10
arch_string_raw: arm64
brand_raw: Apple M1 Pro
Benchmark Results:
🏆 Final Model Leaderboard:
olmo2:13b-1124-instruct-q4_K_M
Overall Success Rate: 81.9% (59/72 cases)
Average Tokens/sec: 9.16 (8.49 - 9.16)
Average Duration: 27.80s
Min/Max Avg Duration: 9.97s / 27.80s
Test Results:
- Fibonacci: ❌ 5/18 cases (27.8%)
- Binary Search: ✅ 18/18 cases (100.0%)
- Palindrome: ✅ 18/18 cases (100.0%)
- Anagram Check: ✅ 18/18 cases (100.0%)