LocalMaxxing
Models
Leaderboard
Evals
Train
Rentals
API
Submit
Models
Leaderboard
Evals
Train
Rentals
API Docs
Total runs
Highest
Median
Lowest
Models
sroecker
Qwen3.6-35B-REAP-Pruned-ratio-0.5-NVFP4
Total runs
Highest
Median
Lowest
Benchmarks
Evals
Benchmark Results
Submit benchmark
Any size
< 3B
3B
7B
13B
30B
70B
110B+
Advanced filters
#
Hardware
Engine · Quant
Ctx
tok/s out
prefill
tok/s total
TTFT ms
VRAM GB
Qwen3.6-35B-REAP-Pruned-ratio-0.5-NVFP4 — 133.3 tok/s on NVIDIA GeForce RTX 5070 Ti · 16 GB