190 Commits

Author SHA1 Message Date
Michael Peter Christen
8cd2f1b795 more benchmarks 2025-03-27 00:09:51 +01:00
Michael Peter Christen
556e67b2c7 ollama client can now call multimodal models 2025-03-25 22:57:18 +01:00
Michael Peter Christen
39518e8647 problems scraper can now load images from the problems 2025-03-25 21:31:49 +01:00
Michael Peter Christen
317956264a more benchmarks 2025-03-25 07:05:10 +01:00
Michael Peter Christen
dd992ec38e Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-03-21 11:49:19 +01:00
Michael Peter Christen
ee41927066 more benchmarks 2025-03-21 11:49:17 +01:00
Michael Peter Christen
47d9517082 Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-03-21 11:48:43 +01:00
Michael Peter Christen
6aab9859d4 changed exaone benchmark 2025-03-21 11:48:40 +01:00
Michael Peter Christen
714ac3fc87 updated deepseek-r1:1.5b-qwen-distill-q4_K_M with longer response token number 2025-03-21 06:21:34 +01:00
Michael Peter Christen
92c06494c8 fixed conflicts 2025-03-20 18:48:49 +01:00
Michael Peter Christen
7447bc401e more benchmarks 2025-03-20 18:05:42 +01:00
Michael Peter Christen
dc61713847 Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-03-20 07:47:01 +01:00
Michael Peter Christen
1d84fc91f8 more benchmarks 2025-03-20 07:46:58 +01:00
Michael Peter Christen
abbb18e326 program fixes, use more tokens for thinking 2025-03-20 07:45:28 +01:00
Michael Peter Christen
bdab2b852f Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-03-19 23:15:54 +01:00
Michael Peter Christen
b5f28fdb42 update benchmark 2025-03-19 22:29:18 +01:00
Michael Peter Christen
fdb8b488ff Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-03-19 19:32:48 +01:00
Michael Peter Christen
c7957bf922 more benchmarks 2025-03-19 19:32:45 +01:00
Michael Peter Christen
adfe4690b2 Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-03-18 18:53:24 +01:00
Michael Peter Christen
67fd17b2a3 added OlympicCoder 2025-03-18 18:52:44 +01:00
Michael Peter Christen
cd0192b3a4 fixed conflicts 2025-03-18 06:02:42 +01:00
Michael Peter Christen
f645a5cdfa more benchmarks 2025-03-18 06:01:48 +01:00
Michael Peter Christen
6e0ea8708d added gemma3 results 2025-03-17 06:37:47 +01:00
Michael Peter Christen
aafc6ac0f4 Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-03-17 06:35:16 +01:00
Michael Peter Christen
36c28a9c0e added gemma3 2025-03-17 06:35:13 +01:00
Michael Peter Christen
4fa50074d5 added gemma3:27b 2025-03-17 06:24:28 +01:00
Michael Peter Christen
0b5344ef21 added README 2025-03-08 01:22:57 +01:00
Michael Peter Christen
c73387d42e Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-03-08 01:22:24 +01:00
Michael Peter Christen
c051e977ac more benchmarks 2025-03-08 01:22:20 +01:00
Michael Peter Christen
d342c6851e more benchmarks 2025-03-07 23:11:36 +01:00
Michael Peter Christen
b71271847a merged conflicts 2025-03-06 21:55:21 +01:00
Michael Peter Christen
a4698bde06 Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-03-06 21:37:59 +01:00
Michael Peter Christen
d96b71aa85 added qwen2.5:3b and llama3.2:1b missing values 2025-03-06 21:37:47 +01:00
Michael Peter Christen
616eeebaeb result output format fix 2025-03-02 12:43:50 +01:00
Michael Peter Christen
cb0054769a result output format change 2025-03-02 12:37:08 +01:00
Michael Peter Christen
d064692724 added more models 2025-03-02 10:45:38 +01:00
Michael Peter Christen
06ed29a210 requirements.txt 2025-02-26 21:13:35 +01:00
Michael Peter Christen
ff3de400fb added Viper Coder Hybrid 1.3 2025-02-26 21:13:16 +01:00
Michael Peter Christen
a7e28e094d Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-02-23 23:37:39 +01:00
Michael Peter Christen
2321365efe added falcon:80b benchmark (partly) 2025-02-23 23:37:36 +01:00
Michael Peter Christen
10e9bc6d1b benchmark correction 2025-02-23 23:23:31 +01:00
Michael Peter Christen
def91b21a4 more benchmarks 2025-02-22 11:06:26 +01:00
Michael Peter Christen
3efc54ae2e Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-02-22 11:05:59 +01:00
Michael Peter Christen
a55b984b6d more benchmarks 2025-02-22 11:05:56 +01:00
Michael Peter Christen
58e14ff7aa more benchmarks 2025-02-20 19:51:41 +01:00
Michael Peter Christen
40e3226547 more benchmarks 2025-02-20 14:38:34 +01:00
Michael Peter Christen
e2cb2e6c3c publish 2025-02-19 06:11:35 +01:00
Michael Peter Christen
062284068e resolved conflicts 2025-02-19 06:09:06 +01:00
Michael Peter Christen
d3ce934431 more benchmarks 2025-02-19 06:07:55 +01:00
Michael Peter Christen
71baa09d80 added DeepScaleR 1.5B 2025-02-17 06:05:14 +01:00