vps50-cpu-matrix-1 — gemma/phi/qwen2.5/qwen3 on vps50
2 calls across 5 cell(s)
Methodology
See A3B_AND_CPU_OVERNIGHT_2026-05-05
for the full procedure.
Reproducible at git SHA ddbaaf46.
Results
| Cell | tok/s mean | tok/s p50 | tok/s p95 | duration p50 | calls |
|---|---|---|---|---|---|
| phi-4 | — | — | — | — | 0 |
| gemma-4-26b-a4b | — | — | — | — | 0 |
| qwen3-30b-a3b | — | — | — | — | 0 |
| qwen2.5-72b | — | — | — | — | 0 |
| gemma-4-26b-a4b-it-q4km-cpu-ctx32k | — | — | — | — | 2 |
tokens per second — mean · p50 · p95
No tokens-per-second data captured.
Raw data
Every run gets its JSONL, log, summary, and metadata published. Clone the archive; re-run it; tell us where we got it wrong.
Cite
Margetic, S. et al. (2026). benchmarks.weeyuga.com/benchmarks/91751afd.html Public benchmarks of the Weeyuga cluster. Run id: 91751afd-068a-477b-8f40-6e1963f803f1. SHA ddbaaf46.