Qwen2.5 Coder 0.5B same-model 1-up
Combined system throughput 57.821 tok/s, primary system throughput 42.895 tok/s, wall time 61.172s.
- Parallel speedup: 0.998
- Wall savings: -0.2%
- Usable primary/follow-up: 20/20
This run selected the historically fastest local-Mac question types, repeated them into one focused packet, and then swept concurrency upward until throughput clearly stopped improving.
Peak combined system throughput: 87.773 tok/s at 5-up. Practical plateau start: 5-up.
| Question | Category | Avg primary TPS | Avg follow-up TPS | Avg combined TPS |
|---|---|---|---|---|
| py_csv_parse | parsing | 31.89 | 26.99 | 58.88 |
| py_cli_args | cli | 25.82 | 26.65 | 52.47 |
| py_pydantic_model | validation | 25.78 | 25.85 | 51.62 |
| py_typing_dataclass | typing | 26.29 | 23.55 | 49.84 |
| py_file_scan | file_io | 23.82 | 25.67 | 49.49 |
| Concurrency | Combined system TPS | Primary system TPS | Parallel speedup | Total wall seconds |
|---|---|---|---|---|
| 1 | 57.821 | 42.895 | 0.998 | 61.172 |
| 2 | 58.34 | 42.901 | 1.515 | 60.884 |
| 3 | 79.442 | 59.135 | 2.301 | 44.762 |
| 4 | 60.341 | 45.772 | 2.581 | 60.539 |
| 5 | 87.773 | 64.922 | 3.961 | 42.143 |
| 6 | 59.497 | 43.693 | 3.383 | 58.911 |
| 7 | 72.559 | 52.321 | 5.113 | 50.056 |
Combined system throughput 57.821 tok/s, primary system throughput 42.895 tok/s, wall time 61.172s.
Combined system throughput 58.34 tok/s, primary system throughput 42.901 tok/s, wall time 60.884s.
Combined system throughput 79.442 tok/s, primary system throughput 59.135 tok/s, wall time 44.762s.
Combined system throughput 60.341 tok/s, primary system throughput 45.772 tok/s, wall time 60.539s.
Combined system throughput 87.773 tok/s, primary system throughput 64.922 tok/s, wall time 42.143s.
Combined system throughput 59.497 tok/s, primary system throughput 43.693 tok/s, wall time 58.911s.
Combined system throughput 72.559 tok/s, primary system throughput 52.321 tok/s, wall time 50.056s.