ModelsLeaderboardEvalsAPI Docs

Eval Suites

Community benchmark suites for evaluating local LLM quality. Submit results via the API.

AllOfficial

No eval suites yet

Approved suites will appear here. Submit one via the API.