Models

Qwenimage-text-to-text98 benchmarks total

Qwen3.6-35B-A3B

Qwen / Qwen3.6-35B-A3B

36B

transformerssafetensorsqwen3_5_moeimage-text-to-text

Qwen3.5-27B

Qwen / Qwen3.5-27B

28B

Qwenimage-text-to-text55 benchmarks total

Qwenimage-text-to-text35 benchmarks total

Qwen3.5-9B-Base

Qwen / Qwen3.5-9B-Base

10B

MiniMax-M2.7

MiniMaxAI / MiniMax-M2.7

Minimaxtext-generation31 benchmarks total

Qwenimage-text-to-text31 benchmarks total

Qwen3.5-35B-A3B-Base

Qwen / Qwen3.5-35B-A3B-Base

36B

transformerssafetensorsqwen3_5_moeimage-text-to-text

Gemmaimage-text-to-text23 benchmarks total

gemma-4-26B-A4B

google / gemma-4-26B-A4B

27B

Qwenimage-text-to-text20 benchmarks total

Qwen3.5-122B-A10B

Qwen / Qwen3.5-122B-A10B

125B

transformerssafetensorsqwen3_5_moeimage-text-to-text

gemma-4-31B

google / gemma-4-31B

33B

Gemmaimage-text-to-text15 benchmarks total

Qwentext-generation15 benchmarks total

Qwen3-Coder-30B-A3B-Instruct

Qwen / Qwen3-Coder-30B-A3B-Instruct

31B

transformerssafetensorsqwen3_moetext-generation

Qwen3-Coder-Next

Qwen / Qwen3-Coder-Next

80B

Qwentext-generation13 benchmarks total

transformerssafetensorsqwen3_nexttext-generation

Qwen3.5-4B-Base

Qwen / Qwen3.5-4B-Base

Qwenimage-text-to-text12 benchmarks total

Qwentext-generation10 benchmarks total

Qwen3.6-27B-DFlash

z-lab / Qwen3.6-27B-DFlash

transformerssafetensorsqwen3feature-extraction

Ling-2.6-flash

inclusionAI / Ling-2.6-flash

107B

text-generation8 benchmarks total

safetensorsbailing_hybridtext-generationconversational

gemma-4-E4B

google / gemma-4-E4B

Gemmaany-to-any8 benchmarks total

Llamatext-generation8 benchmarks total

Llama-3.1-8B

meta-llama / Llama-3.1-8B

transformerssafetensorsllamatext-generation

Gemmaany-to-any6 benchmarks total

gemma-4-E2B

google / gemma-4-E2B

text-generation6 benchmarks total

GLM-4.7-Flash

zai-org / GLM-4.7-Flash

31B

transformerssafetensorsglm4_moe_litetext-generation

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

nvidia / Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

33B

any-to-any5 benchmarks total

transformerssafetensorsNemotronH_Nano_Omni_Reasoning_V3feature-extraction

Nemotron-Cascade-2-30B-A3B

nvidia / Nemotron-Cascade-2-30B-A3B

32B

text-generation5 benchmarks total

Opttext-generation5 benchmarks total

NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4

nvidia / NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4

67B

text-generation4 benchmarks total

GLM-5.1

zai-org / GLM-5.1

754B

transformerssafetensorsglm_moe_dsatext-generation

Mistral4 benchmarks total

Mistral-Medium-3.5-128B

mistralai / Mistral-Medium-3.5-128B

128B

safetensorsmistral3vLLMen

gpt-oss-20b

openai / gpt-oss-20b

22B

Gpttext-generation4 benchmarks total

transformerssafetensorsgpt_osstext-generation

gpt-oss-120b

openai / gpt-oss-120b

120B

Gpttext-generation4 benchmarks total

transformerssafetensorsgpt_osstext-generation

Qwen2.5-72B

Qwen / Qwen2.5-72B

73B

Qwentext-generation4 benchmarks total

transformerssafetensorsqwen2text-generation

Qwentext-generation4 benchmarks total

Qwen3-8B-Base

Qwen / Qwen3-8B-Base

transformerssafetensorsqwen3text-generation

Gemmatext-generation4 benchmarks total

Gemopus-4-26B-A4B-it

Jackrong / Gemopus-4-26B-A4B-it

27B

safetensorsgemma4gemmainstruction-tuned

NVIDIA-Nemotron-3-Super-120B-A12B-BF16

nvidia / NVIDIA-Nemotron-3-Super-120B-A12B-BF16

124B

text-generation3 benchmarks total

Qwentext-generation3 benchmarks total

Qwen3-32B

Qwen / Qwen3-32B

32B

transformerssafetensorsqwen3text-generation

MiniMax-M2.5

MiniMaxAI / MiniMax-M2.5

Minimaxtext-generation3 benchmarks total

Qwenimage-text-to-text3 benchmarks total

Qwen3.5-0.8B-Base

Qwen / Qwen3.5-0.8B-Base

Qwentext-generation3 benchmarks total

Qwen2.5-7B

Qwen / Qwen2.5-7B

transformerssafetensorsqwen2text-generation

image-text-to-text2 benchmarks total

Kimi-K2.5

moonshotai / Kimi-K2.5

1.1T

transformerssafetensorskimi_k25feature-extraction

granite-4.1-30b

ibm-granite / granite-4.1-30b

29B

text-generation2 benchmarks total

transformerssafetensorsgranitetext-generation

Mistral-Small-3.1-24B-Base-2503

mistralai / Mistral-Small-3.1-24B-Base-2503

24B

Mistral2 benchmarks total

vllmsafetensorsmistral3mistral-common

MiniMax-M2.1

MiniMaxAI / MiniMax-M2.1

Minimaxtext-generation2 benchmarks total

MiniMax-M2

MiniMaxAI / MiniMax-M2

Minimaxtext-generation2 benchmarks total

Qwenimage-text-to-text2 benchmarks total

Qwen3-VL-30B-A3B-Instruct

Qwen / Qwen3-VL-30B-A3B-Instruct

30B

transformerssafetensorsqwen3_vl_moeimage-text-to-text

Ministral-3-3B-Base-2512

mistralai / Ministral-3-3B-Base-2512

Mistral2 benchmarks total

vllmsafetensorsmistral3mistral-common

Llamatext-generation2 benchmarks total

Llama-3.1-70B

meta-llama / Llama-3.1-70B

71B

transformerssafetensorsllamatext-generation

Qwen3.5-122B-A10B-GPTQ-Int4

Qwen / Qwen3.5-122B-A10B-GPTQ-Int4

125B

transformerssafetensorsqwen3_5_moeimage-text-to-text

Llama-2-7b

meta-llama / Llama-2-7b

Llamatext-generation1 benchmark total

facebookmetapytorchllama

Qwen2.5-32B

Qwen / Qwen2.5-32B

33B

Qwentext-generation1 benchmark total

safetensorsqwen2text-generationconversational

Qwen3-VL-8B-Instruct

Qwen / Qwen3-VL-8B-Instruct

transformerssafetensorsqwen3_vlimage-text-to-text

Llama-3.2-3B-Instruct

meta-llama / Llama-3.2-3B-Instruct

Llamatext-generation1 benchmark total

transformerssafetensorsllamatext-generation

Qwen3-30B-A3B-Base

Qwen / Qwen3-30B-A3B-Base

31B

Qwentext-generation1 benchmark total

transformerssafetensorsqwen3_moetext-generation

safetensorsqwen3prismmlbonsai

Ternary-Bonsai-8B-unpacked

prism-ml / Ternary-Bonsai-8B-unpacked

Qwen1 benchmark total

text-generation1 benchmark total

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

nvidia / NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

32B

Qwen3.5-35B-A3B-4bit

mlx-community / Qwen3.5-35B-A3B-4bit

transformerssafetensorsqwen3_5_moeimage-text-to-text

gemma-3-4b-pt

google / gemma-3-4b-pt

Gemmaimage-text-to-text1 benchmark total

transformerssafetensorsgemma3image-text-to-text

text-generation1 benchmark total

GLM-5

zai-org / GLM-5

754B

transformerssafetensorsglm_moe_dsatext-generation

Deepseektext-generation1 benchmark total

DeepSeek-V4-Flash-2bit-DQ

mlx-community / DeepSeek-V4-Flash-2bit-DQ

284B

mlxsafetensorsdeepseek_v4text-generation

Qwen3-VL-2B-Instruct

Qwen / Qwen3-VL-2B-Instruct

transformerssafetensorsqwen3_vlimage-text-to-text

Qwen3-30B-A3B-Instruct-2507

Qwen / Qwen3-30B-A3B-Instruct-2507

30B

Qwentext-generation1 benchmark total

transformerssafetensorsqwen3_moetext-generation

Gemmatext-generation1 benchmark total

Gemopus-4-26B-A4B-it-GGUF

Jackrong / Gemopus-4-26B-A4B-it-GGUF

26B

ggufgemma4gemmainstruction-tuned

LFM2-24B-A2B

LiquidAI / LFM2-24B-A2B

24B

text-generation

transformerssafetensorslfm2_moetext-generation

ggufendpoints_compatibleregion:usconversational

LFM2-24B-A2B-GGUF

lmstudio-community / LFM2-24B-A2B-GGUF

24B