Qwen2.5 Coder 1.5B same-model 3-up
Combined system throughput 32.251 tok/s, primary system throughput 17.973 tok/s, wall time 155.563s.
- Parallel speedup: 2.441
- Wall savings: 59.0%
- Usable primary/follow-up: 20/20
This run selected the historically fastest local-Mac question types, repeated them into one focused packet, and then swept concurrency upward until throughput clearly stopped improving.
Peak combined system throughput: 32.251 tok/s at 3-up. Practical plateau start: 3-up.
| Question | Category | Avg primary TPS | Avg follow-up TPS | Avg combined TPS |
|---|---|---|---|---|
| py_csv_parse | parsing | 31.89 | 26.99 | 58.88 |
| py_cli_args | cli | 25.82 | 26.65 | 52.47 |
| py_pydantic_model | validation | 25.78 | 25.85 | 51.62 |
| py_typing_dataclass | typing | 26.29 | 23.55 | 49.84 |
| py_file_scan | file_io | 23.82 | 25.67 | 49.49 |
| Concurrency | Combined system TPS | Primary system TPS | Parallel speedup | Total wall seconds |
|---|---|---|---|---|
| 3 | 32.251 | 17.973 | 2.441 | 155.563 |
| 4 | 29.788 | 17.157 | 2.468 | 164.427 |
| 5 | 27.898 | 14.566 | 4.289 | 183.17 |
| 6 | 31.313 | 17.125 | 4.683 | 166.542 |
| 7 | 28.407 | 16.072 | 3.678 | 174.78 |
Combined system throughput 32.251 tok/s, primary system throughput 17.973 tok/s, wall time 155.563s.
Combined system throughput 29.788 tok/s, primary system throughput 17.157 tok/s, wall time 164.427s.
Combined system throughput 27.898 tok/s, primary system throughput 14.566 tok/s, wall time 183.17s.
Combined system throughput 31.313 tok/s, primary system throughput 17.125 tok/s, wall time 166.542s.
Combined system throughput 28.407 tok/s, primary system throughput 16.072 tok/s, wall time 174.78s.