190 Commits

Author SHA1 Message Date
Michael Peter Christen
e74c5b1cd3 more benchmarks 2025-05-21 00:20:45 +02:00
Michael Peter Christen
aed802413c more benchmarks 2025-05-21 00:18:17 +02:00
Michael Peter Christen
1cc7eaafd5 more benchmarks 2025-05-20 07:43:46 +02:00
Michael Peter Christen
e7472a36d0 making endpoint class main argument in ollama client 2025-05-18 23:38:01 +02:00
Michael Peter Christen
f67c4ce38f asynchronous loading of remote models 2025-05-18 22:47:42 +02:00
Michael Peter Christen
c049b908dd redesign of endpoint and task classes 2025-05-18 22:30:50 +02:00
Michael Peter Christen
b080928020 full abstraction of multiprocessing in ollama client 2025-05-18 10:39:02 +02:00
Michael Peter Christen
df5975f978 Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-05-17 21:43:31 +02:00
Michael Peter Christen
775fea6cbe more benchmarks 2025-05-17 21:43:28 +02:00
Michael Peter Christen
b0be4e602c refactoring: more abstract server-client process 2025-05-17 19:32:29 +02:00
Michael Peter Christen
ef4996269c refactoring 2025-05-17 18:22:55 +02:00
Michael Peter Christen
fdcd98878b more benchmarks 2025-05-11 22:13:59 +02:00
Michael Peter Christen
23507ab1c6 more benchmarks 2025-05-11 14:03:17 +02:00
Michael Peter Christen
00702c0b56 more benchmarks 2025-05-08 21:21:05 +02:00
Michael Peter Christen
3b490e0f15 Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-05-08 21:20:30 +02:00
Michael Peter Christen
4b05812443 more benchmarka 2025-05-08 21:20:27 +02:00
Michael Peter Christen
9d819d8370 ability to remotely pull and delete models 2025-05-08 21:18:38 +02:00
Michael Peter Christen
35e87b1050 added automatic loading of models if they do not exist on remote servers 2025-05-06 23:22:29 +02:00
Michael Peter Christen
ce9a189d5f concurrency for inference on remote server 2025-05-04 21:42:15 +02:00
Michael Peter Christen
445ac928da fixed conflicts 2025-05-04 13:27:10 +02:00
Michael Peter Christen
6733a1d0f9 more benchmarks 2025-05-04 12:12:51 +02:00
Michael Peter Christen
43c0f9bc28 more benchmarks 2025-05-04 12:04:34 +02:00
Michael Peter Christen
c0faa73ef7 more benchmarks 2025-05-04 08:07:33 +02:00
Michael Peter Christen
2ca40d703b Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-05-01 15:37:02 +02:00
Michael Peter Christen
5ebfc0989a more benchmarks 2025-05-01 15:37:02 +02:00
Michael Peter Christen
69877de44f enhanced code extraction for thinking models and recalculated some benchmarks 2025-05-01 12:38:17 +02:00
Michael Peter Christen
811343a570 fixed conflicts 2025-05-01 11:44:52 +02:00
Michael Peter Christen
a7c9a5619e more benchmarks 2025-05-01 11:42:38 +02:00
Michael Peter Christen
581125ec46 more benchmarks 2025-04-30 05:57:15 +02:00
Michael Peter Christen
88ac851380 refactoring for parallel processing 2025-04-29 21:33:41 +02:00
Michael Peter Christen
5e61e1667a more benchmarks 2025-04-27 10:41:27 +02:00
Michael Peter Christen
c8610ff401 fixed model_dict 2025-04-17 23:33:42 +02:00
Michael Peter Christen
2537be1209 added option to re-calculate failed inferences again 2025-04-17 22:41:37 +02:00
Michael Peter Christen
968ced1a5e more benchmarks, changes for 04-mini, re-inference for failed results 2025-04-17 20:02:19 +02:00
Michael Peter Christen
2b2911304e Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-04-17 09:36:27 +02:00
Michael Peter Christen
acb2a506ea added more benchmarks 2025-04-17 09:36:23 +02:00
Michael Peter Christen
45b8f3c532 added more benchmarks, new best model GPT-4.1 2025-04-16 21:57:19 +02:00
Michael Peter Christen
8fd0e47897 more benchmarks 2025-04-12 17:15:27 +02:00
Michael Peter Christen
dc9c928475 Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-04-04 15:16:37 +02:00
Michael Peter Christen
ed32bffa82 more benchmarks 2025-04-04 15:16:34 +02:00
Michael Peter Christen
b530866e6f more benchmarks 2025-03-29 22:41:16 +01:00
Michael Peter Christen
3af0da99ed Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-03-28 20:10:30 +01:00
Michael Peter Christen
b3d37e2044 some refactoring 2025-03-28 20:10:26 +01:00
Michael Peter Christen
736a81d8f3 merged conflicts 2025-03-28 19:59:07 +01:00
Michael Peter Christen
fc53f299bb updated benchmark 2025-03-28 19:57:39 +01:00
Michael Peter Christen
dcde2ee6bf updated README 2025-03-27 21:33:54 +01:00
Michael Peter Christen
1e60a37aeb more benchmarks 2025-03-27 15:14:44 +01:00
Michael Peter Christen
513af8a8ce fixed in code extraction and inference, added multimodality (stub) 2025-03-27 13:48:09 +01:00
Michael Peter Christen
e5a8d76098 more benchmarks 2025-03-27 00:10:22 +01:00
Michael Peter Christen
9055212f22 Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark 2025-03-27 00:09:53 +01:00