Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

arXiv: 2309.05516

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

146

Full-text search

Active filters: 2309.05516

Intel/GLM-4.6-REAP-218B-A32B-FP8-gguf-q2ks-mixed-AutoRound

218B • Updated 15 days ago • 1.57k • 9

Intel/gpt-oss-20b-int4-AutoRound

2B • Updated Aug 7 • 243 • 7

Intel/gpt-oss-20b-gguf-q4ks-AutoRound

21B • Updated Aug 8 • 835 • 11

Intel/gpt-oss-120b-gguf-q4ks-AutoRound

117B • Updated Aug 8 • 293 • 5

Intel/Magistral-Small-2509-int4-AutoRound

2B • Updated 8 days ago • 35 • 2

Intel/neural-chat-7b-v3-1-int4-inc

Text Generation • 1B • Updated Aug 26, 2024 • 9 • 2

Intel/neural-chat-7b-v3-3-int4-inc

Text Generation • 1B • Updated Aug 26, 2024 • 3 • 2

Intel/falcon-7b-int4-inc

Text Generation • 2B • Updated Aug 26, 2024 • 1

Intel/Phi-3-mini-4k-instruct-int4-inc

Updated Jul 4, 2024 • 4

Intel/Baichuan2-13B-Chat-int4-inc

Updated Jul 4, 2024 • 1

Intel/SOLAR-10.7B-Instruct-v1.0-int4-inc

Updated Jul 4, 2024 • 1

Intel/opt-1.3b-int4-inc-recipe

Updated Nov 6, 2024 • 1

Intel/Phi-3-mini-128k-instruct-int4-inc-recipe

Updated Nov 8, 2024 • 1

Intel/Qwen2-0.5B-Instuct-int4-inc

Text Generation • 0.3B • Updated Jun 6, 2024 • 1

Intel/Qwen2-1.5B-Instuct-int4-inc

Text Generation • 0.7B • Updated Jun 6, 2024 • 18 • 2

Intel/Qwen2-7B-int4-inc

Text Generation • 2B • Updated Oct 24, 2024 • 2 • 6

fbaldassarri/modello-italia-9b-autoround-w4g128-cpu

Text Generation • 2B • Updated Jun 22, 2024

fbaldassarri/modello-italia-9b-autoround-w4g128-gpu

Text Generation • 2B • Updated Jun 22, 2024

Intel/Qwen2.5-0.5B-Instruct-int4-inc

Updated Oct 10, 2024 • 1

Intel/Qwen2.5-1.5B-Instruct-int4-inc

Updated Oct 10, 2024 • 1

OPEA/Meta-Llama-3.1-70B-Instruct-int4-asym-inc

11B • Updated Apr 30 • 10 • 1

OPEA/Qwen2.5-32B-Instruct-int4-sym-mixed-inc

6B • Updated Apr 30 • 6 • 1

OPEA/Qwen2.5-14B-Instruct-int4-sym-inc

3B • Updated Apr 30 • 2

OPEA/Qwen2-VL-7B-Instruct-int4-sym-inc

3B • Updated Jun 5 • 14 • 1

OPEA/Phi-3.5-vision-instruct-int4-sym-inc

Updated Apr 30 • 2

OPEA/Qwen2.5-7B-Instruct-int4-sym-inc

2B • Updated Apr 30 • 6 • 1

OPEA/Llama-3.2-11B-Vision-Instruct-int4-sym-inc

3B • Updated Jun 5 • 17 • 2

OPEA/llava-v1.5-7b-int4-sym-inc

1B • Updated Jul 18 • 6 • 1

OPEA/cogvlm2-llama3-chat-19B-int4-sym-inc

7B • Updated Jul 18 • 6

OPEA/Qwen2.5-72B-Instruct-int4-sym-inc

12B • Updated Apr 30 • 6 • 1