Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

34

Full-text search

Active filters: Int4

AXERA-TECH/Qwen3-VL-2B-Instruct-GPTQ-Int4-P1536-CTX2047

Image-Text-to-Text • Updated Dec 25, 2025

AXERA-TECH/Qwen3-VL-2B-Instruct-GPTQ-Int4-C256-P3584-CTX4095

Image-Text-to-Text • Updated Jan 5 • 2

AXERA-TECH/HY-MT1.5-1.8B_GPTQ_INT4

Translation • Updated 15 days ago • 31 • 1

QuantTrio/Kimi-K2.5-E304

Image-Text-to-Text • 138B • Updated 20 days ago • 5.37k • 1