Michael Peter Christen
|
e74c5b1cd3
|
more benchmarks
|
2025-05-21 00:20:45 +02:00 |
|
Michael Peter Christen
|
aed802413c
|
more benchmarks
|
2025-05-21 00:18:17 +02:00 |
|
Michael Peter Christen
|
1cc7eaafd5
|
more benchmarks
|
2025-05-20 07:43:46 +02:00 |
|
Michael Peter Christen
|
e7472a36d0
|
making endpoint class main argument in ollama client
|
2025-05-18 23:38:01 +02:00 |
|
Michael Peter Christen
|
f67c4ce38f
|
asynchronous loading of remote models
|
2025-05-18 22:47:42 +02:00 |
|
Michael Peter Christen
|
c049b908dd
|
redesign of endpoint and task classes
|
2025-05-18 22:30:50 +02:00 |
|
Michael Peter Christen
|
b080928020
|
full abstraction of multiprocessing in ollama client
|
2025-05-18 10:39:02 +02:00 |
|
Michael Peter Christen
|
df5975f978
|
Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark
|
2025-05-17 21:43:31 +02:00 |
|
Michael Peter Christen
|
775fea6cbe
|
more benchmarks
|
2025-05-17 21:43:28 +02:00 |
|
Michael Peter Christen
|
b0be4e602c
|
refactoring: more abstract server-client process
|
2025-05-17 19:32:29 +02:00 |
|
Michael Peter Christen
|
ef4996269c
|
refactoring
|
2025-05-17 18:22:55 +02:00 |
|
Michael Peter Christen
|
fdcd98878b
|
more benchmarks
|
2025-05-11 22:13:59 +02:00 |
|
Michael Peter Christen
|
23507ab1c6
|
more benchmarks
|
2025-05-11 14:03:17 +02:00 |
|
Michael Peter Christen
|
00702c0b56
|
more benchmarks
|
2025-05-08 21:21:05 +02:00 |
|
Michael Peter Christen
|
3b490e0f15
|
Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark
|
2025-05-08 21:20:30 +02:00 |
|
Michael Peter Christen
|
4b05812443
|
more benchmarka
|
2025-05-08 21:20:27 +02:00 |
|
Michael Peter Christen
|
9d819d8370
|
ability to remotely pull and delete models
|
2025-05-08 21:18:38 +02:00 |
|
Michael Peter Christen
|
35e87b1050
|
added automatic loading of models if they do not exist on remote servers
|
2025-05-06 23:22:29 +02:00 |
|
Michael Peter Christen
|
ce9a189d5f
|
concurrency for inference on remote server
|
2025-05-04 21:42:15 +02:00 |
|
Michael Peter Christen
|
445ac928da
|
fixed conflicts
|
2025-05-04 13:27:10 +02:00 |
|
Michael Peter Christen
|
6733a1d0f9
|
more benchmarks
|
2025-05-04 12:12:51 +02:00 |
|
Michael Peter Christen
|
43c0f9bc28
|
more benchmarks
|
2025-05-04 12:04:34 +02:00 |
|
Michael Peter Christen
|
c0faa73ef7
|
more benchmarks
|
2025-05-04 08:07:33 +02:00 |
|
Michael Peter Christen
|
2ca40d703b
|
Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark
|
2025-05-01 15:37:02 +02:00 |
|
Michael Peter Christen
|
5ebfc0989a
|
more benchmarks
|
2025-05-01 15:37:02 +02:00 |
|
Michael Peter Christen
|
69877de44f
|
enhanced code extraction for thinking models and recalculated some benchmarks
|
2025-05-01 12:38:17 +02:00 |
|
Michael Peter Christen
|
811343a570
|
fixed conflicts
|
2025-05-01 11:44:52 +02:00 |
|
Michael Peter Christen
|
a7c9a5619e
|
more benchmarks
|
2025-05-01 11:42:38 +02:00 |
|
Michael Peter Christen
|
581125ec46
|
more benchmarks
|
2025-04-30 05:57:15 +02:00 |
|
Michael Peter Christen
|
88ac851380
|
refactoring for parallel processing
|
2025-04-29 21:33:41 +02:00 |
|
Michael Peter Christen
|
5e61e1667a
|
more benchmarks
|
2025-04-27 10:41:27 +02:00 |
|
Michael Peter Christen
|
c8610ff401
|
fixed model_dict
|
2025-04-17 23:33:42 +02:00 |
|
Michael Peter Christen
|
2537be1209
|
added option to re-calculate failed inferences again
|
2025-04-17 22:41:37 +02:00 |
|
Michael Peter Christen
|
968ced1a5e
|
more benchmarks, changes for 04-mini, re-inference for failed results
|
2025-04-17 20:02:19 +02:00 |
|
Michael Peter Christen
|
2b2911304e
|
Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark
|
2025-04-17 09:36:27 +02:00 |
|
Michael Peter Christen
|
acb2a506ea
|
added more benchmarks
|
2025-04-17 09:36:23 +02:00 |
|
Michael Peter Christen
|
45b8f3c532
|
added more benchmarks, new best model GPT-4.1
|
2025-04-16 21:57:19 +02:00 |
|
Michael Peter Christen
|
8fd0e47897
|
more benchmarks
|
2025-04-12 17:15:27 +02:00 |
|
Michael Peter Christen
|
dc9c928475
|
Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark
|
2025-04-04 15:16:37 +02:00 |
|
Michael Peter Christen
|
ed32bffa82
|
more benchmarks
|
2025-04-04 15:16:34 +02:00 |
|
Michael Peter Christen
|
b530866e6f
|
more benchmarks
|
2025-03-29 22:41:16 +01:00 |
|
Michael Peter Christen
|
3af0da99ed
|
Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark
|
2025-03-28 20:10:30 +01:00 |
|
Michael Peter Christen
|
b3d37e2044
|
some refactoring
|
2025-03-28 20:10:26 +01:00 |
|
Michael Peter Christen
|
736a81d8f3
|
merged conflicts
|
2025-03-28 19:59:07 +01:00 |
|
Michael Peter Christen
|
fc53f299bb
|
updated benchmark
|
2025-03-28 19:57:39 +01:00 |
|
Michael Peter Christen
|
dcde2ee6bf
|
updated README
|
2025-03-27 21:33:54 +01:00 |
|
Michael Peter Christen
|
1e60a37aeb
|
more benchmarks
|
2025-03-27 15:14:44 +01:00 |
|
Michael Peter Christen
|
513af8a8ce
|
fixed in code extraction and inference, added multimodality (stub)
|
2025-03-27 13:48:09 +01:00 |
|
Michael Peter Christen
|
e5a8d76098
|
more benchmarks
|
2025-03-27 00:10:22 +01:00 |
|
Michael Peter Christen
|
9055212f22
|
Merge branch 'main' of https://github.com/Orbiter/project-euler-llm-benchmark
|
2025-03-27 00:09:53 +01:00 |
|