ModelsLeaderboardEvalsTrainRentalsAPI Docs
Language
Total runs
Highest
Median
Lowest
Qwen3.5-2B-Base (2B) — LocalMaxxing