A simple greeting still took over two minutes.
This page is written in the same finished-report style as the other telemetry summaries, but the run did not complete. The computer started freezing under sustained 14B load, the suite was aborted during the 20Q stage, and the results below are the recovered artifacts from the completed and partial stages.
A simple greeting still took over two minutes.
Average latency 1990.4s and 0.07 tok/s.
Recovered from raw JSON after 5702.0s of runtime.
The runner never reached the final 10Q stage.
| Stage | Status | Progress | Elapsed | Notes |
|---|---|---|---|---|
| Hello check | Completed | 1/1 | 138.3s | Basic model smoke test returned a valid greeting. |
| 5Q small eval | Completed | 5/5 | 1990.4s | 3/5 strict passes. 1 timeout. |
| 20Q Python suite | Partial | 4/20 | 5702.0s | Recovered from per-question JSON after the runner was aborted. |
| 10Q Python suite | Not started | 0/10 | n/a | The run never reached the 10Q stage before the abort. |
The 5Q stage did finish, but only 3 of 5 questions met the strict pass rules. One question timed out at the full 3600-second ceiling, which is the clearest sign that the 14B lane is unusable on this host for normal iteration.
| Question | Category | Duration | Throughput | Marker hit rate | Format OK | Outcome |
|---|---|---|---|---|---|---|
| Disk Guard Script | shell | 1193.9s | 0.09 tok/s | 0.75 | no | usable |
| IPv4 Validator | python | 2138.0s | 0.09 tok/s | 1.00 | yes | usable |
| Nginx Safe Reload | ops | 495.5s | 0.09 tok/s | 0.75 | yes | usable |
| YAML Validator Plan | planning | 3600.0s | n/a | 0.00 | no | timed out |
| SSH Lockout Triage | debugging | 2524.7s | 0.10 tok/s | 1.00 | yes | usable |
The top-level 20Q suite summary never finalized, so this section is reconstructed from the per-question primary and follow-up JSON files that were already on disk when the run was aborted.
Average primary duration across the four recovered questions.
Estimated from eval token count over eval duration.
Average follow-up duration across the four recovered questions.
Estimated from eval token count over eval duration.
| Question | Category | Primary duration | Primary throughput | Follow-up duration | Follow-up throughput |
|---|---|---|---|---|---|
| CSV Parser | parsing | 418.2s | 0.15 tok/s | 1042.3s | 0.14 tok/s |
| File Scanner | file_io | 665.9s | 0.14 tok/s | 840.0s | 0.14 tok/s |
| CLI Arguments | cli | 754.4s | 0.14 tok/s | 955.9s | 0.14 tok/s |
| Typed Dataclass | typing | 349.9s | 0.14 tok/s | 674.4s | 0.13 tok/s |
qwen2.5-coder:14b, but the response times are too slow for practical use.[archive-source][archive-source][archive-source][archive-source]Hello response preview: Of course! How may I assist you today?
Source host: