Qwen3.6-35B-REAP-Pruned-ratio-0.5-NVFP4 — 133.3 tok/s on NVIDIA GeForce RTX 5070 Ti · 16 GB

Models Leaderboard Evals Train Rentals API Docs

Total runs

Highest

Median

Lowest

Total runs

Highest

Median

Lowest

Benchmark Results

Submit benchmark

#	Hardware	Engine · Quant	Ctx	tok/s out	prefill	tok/s total	TTFT ms	VRAM GB

Qwen3.6-35B-REAP-Pruned-ratio-0.5-NVFP4 — 133.3 tok/s on NVIDIA GeForce RTX 5070 Ti · 16 GB