Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

100

Full-text search

Active filters: GPTQ

JunHowie/Qwen3-32B-GPTQ-Int8

Text Generation • 33B • Updated Sep 5, 2025 • 716 • 4

QuantTrio/Qwen3-Coder-30B-A3B-Instruct-GPTQ-Int8

Text Generation • 31B • Updated Sep 5, 2025 • 4.28k • 8

JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int4

Text Generation • 4B • Updated Sep 4, 2025 • 1.17k • 2

DanielAWrightGabrielAI/pygmalion-7b-4bit-128g-cuda-2048Token

Text Generation • Updated May 18, 2023 • 56 • 15

mlabonne/gpt2-GPTQ-4bit

Text Generation • Updated Jul 8, 2023 • 6 • 1

CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA

Text Generation • Updated Jul 20, 2023 • 2

daedalus314/Griffin-3B-GPTQ

Text Generation • 3B • Updated Sep 8, 2023 • 3

Sanrove/gpt2-GPTQ-4b

Text Generation • Updated Sep 24, 2023

daedalus314/Marx-3B-V2-GPTQ

Text Generation • Updated Oct 12, 2023 • 2

TKDKid1000/pythia-2.8b-deduped-GPTQ

Text Generation • Updated Oct 25, 2023 • 2

Trelis/Yi-34B-200K-Llamafied-chat-SFT-function-calling-v2-GPTQ

Text Generation • Updated Nov 20, 2023

Inferless/deciLM-7B-GPTQ

Text Generation • Updated Jan 4, 2024 • 1 • 1

Inferless/SOLAR-10.7B-Instruct-v1.0-GPTQ

Text Generation • Updated Jan 4, 2024 • 5 • 2

Inferless/Mixtral-8x7B-v0.1-int8-GPTQ

Text Generation • Updated Jan 25, 2024 • 5 • 2

Masterjp123/SnowyRP-FinalV1-L2-13B-GPTQ

Text Generation • Updated Apr 4, 2024 • 1 • 4

bigquant/Senku-70B-GPTQ-4bit

Text Generation • Updated Feb 26, 2024 • 3 • 1

twhoool02/Llama-2-7b-hf-AutoGPTQ

Text Generation • 7B • Updated Apr 3, 2024 • 1

Dmitriy007/rugpt2_gen_news-gptq-4bit

Text Generation • 0.1B • Updated Feb 28, 2024 • 2

SwastikM/Llama-2-7B-Chat-text2code

Text Generation • Updated May 19, 2024 • 4 • 4

adriabama06/Llama-3.2-1B-Instruct-GPTQ-8bit-128g

Text Generation • 1B • Updated Jan 2, 2025 • 1

NightForger/saiga_nemo_12b-GPTQ

Text Generation • Updated Oct 28, 2025 • 1

NaomiBTW/L3-8B-Lunaris-v1-GPTQ

Text Generation • Updated Nov 11, 2024

GusPuffy/Llama-3.1-70B-ArliAI-RPMax-v1.3-GPTQ

11B • Updated Jul 19, 2025 • 1

iSolver-AI/test123-quantized.w4a16

Image-Text-to-Text • Updated Aug 1, 2025 • 3

AXERA-TECH/DeepSeek-R1-Distill-Qwen-1.5B-GPTQ-Int4

Updated Feb 19, 2025 • 4 • 1

AXERA-TECH/DeepSeek-R1-Distill-Qwen-7B-GPTQ-Int4

Updated Feb 17, 2025 • 7 • 1

AXERA-TECH/Qwen2.5-1.5B-Instruct-GPTQ-Int4

Text Generation • Updated Apr 1, 2025 • 26

AXERA-TECH/Qwen2.5-3B-Instruct-GPTQ-Int4

Updated Apr 22, 2025 • 2

AXERA-TECH/Qwen2.5-0.5B-Instruct-GPTQ-Int4

Text Generation • Updated Sep 28, 2025 • 3

AXERA-TECH/Qwen2.5-7B-Instruct-GPTQ-Int4

Updated Feb 17, 2025 • 1