Ballpark street prices (USD, checked July 2026 — the memory-price surge moved the whole market up, and it keeps drifting, so treat these as orders of magnitude), joined with this site's live median effective bandwidth where we have enough runs. Load draw is what the box pulls while actually generating — it sizes your PSU and your electricity bill per hour of inference. The two composite columns are the buying signals: $ per GB/s is capex per unit of speed, and $ per GB/s-per-watt folds power in — it penalizes hardware that needs many watts for its speed, so it favors a box that is fast, cheap, and efficient. Lower is better in both.
| Hardware | Rough price | Load draw | Median effective BW | $ per GB/s | $ per GB/s/W |
|---|
| RTX 5060 Ti | ~$450 (new, 16 GB) | ~180 W | 723 GB/s | ~$0.62 | ~$112 |
| RTX 3090 | ~$1,100 (used, 24 GB) | ~350 W | 686 GB/s | ~$1.60 | ~$561 |
| RTX 5090 | ~$3,700 (new, 32 GB) | ~500 W | 1,247 GB/s | ~$2.97 | ~$1,483 |
| RTX 4090 | ~$2,300 (used, 24 GB) | ~350 W | 668 GB/s | ~$3.44 | ~$1,205 |
| DGX Spark | ~$4,700 (new, 128 GB unified) | ~100 W | 521 GB/s | ~$9.02 | ~$902 |
| Ryzen AI Max 395 | ~$2,000 (mini-PC, 128 GB unified) | ~90 W | 195 GB/s | ~$10.24 | ~$922 |
| M2 Ultra | ~$7,500 (used Mac Studio, 192 GB) | ~180 W | — | — | — |
| M3 Max | ~$3,000 (used, 128 GB) | ~60 W | — | — | — |
Load figures are typical draw during decoding, not TDP — decode is memory-bound, so cards rarely hit their full power budget (prefill gets closer). Discrete-GPU figures are card-only; add ~50–100 W for the host system. Unified-memory boxes are whole-machine. Benchmark submissions accept measured per-GPU power draw (gpuPowerWatts) — report it and these ballparks can become community data too.
Patterns worth noticing: budget 16 GB cards and used flagships lead on capex per GB/s; once watts count, the 5060 Ti and Apple Silicon pull ahead of the big cards; large unified boxes pay a premium for capacity and efficiency, not speed. Check the marketplace for live second-hand listings from other builders, and attach purchase records to your benchmark submissions — once enough runs carry real prices, this table can switch from ballparks to community-sourced data.