Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

172

Full-text search

Active filters: on-device

TitleOS/Lightning-1.7B-LoRA

Text Generation • Updated Dec 11, 2025

TitleOS/Lightning-1.7B

Text Generation • 2B • Updated Dec 11, 2025 • 7 • 4

TitleOS/Lightning-1.7B-Q4_K_M-GGUF

Text Generation • 2B • Updated Dec 11, 2025 • 66

TitleOS/Lightning-1.7B-Q8_0-GGUF

Text Generation • 2B • Updated Dec 11, 2025 • 4 • 1

bartowski/TitleOS_Lightning-1.7B-GGUF

Text Generation • 2B • Updated Dec 11, 2025 • 335 • 2

alexgusevski/Lightning-1.7B-q2-mlx

Text Generation • 0.2B • Updated Dec 11, 2025 • 2

alexgusevski/Lightning-1.7B-q3-mlx

Text Generation • 0.2B • Updated Dec 11, 2025 • 2

alexgusevski/Lightning-1.7B-q4-mlx

Text Generation • 0.3B • Updated Dec 11, 2025 • 3

alexgusevski/Lightning-1.7B-q6-mlx

Text Generation • 0.4B • Updated Dec 11, 2025 • 2

alexgusevski/Lightning-1.7B-q8-mlx

Text Generation • 0.5B • Updated Dec 11, 2025 • 2

alexgusevski/Lightning-1.7B-mlx

Text Generation • 2B • Updated Dec 11, 2025 • 9 • 1

gocharlie-ai/charlie-micro-ov

Text Generation • Updated Dec 18, 2025 • 1

Yagna1/functiongemma-270m-mobile-actions

Text Generation • 0.3B • Updated Dec 21, 2025 • 8

Thorge-AI/functiongemma-270m-it-mobile-actions.litertlm

Updated Dec 21, 2025 • 3

blackcloud1199/Llama-3.2-1B-Executorch-SpinQuant

Updated Dec 26, 2025

blackcloud1199/Llama-3.2-3B-Executorch-Q8DA4W

Updated Dec 26, 2025

XXXXyu/Falcon3-1B-Instruct-1.58bit-vlut-gguf

Text Generation • 2B • Updated Jan 1 • 43

XXXXyu/bitnet_b1_58-3B-vlut-gguf

Text Generation • 3B • Updated Jan 1 • 58

bisonnetworking/MediPhi-Instruct-mlx-4bit

Text Generation • 0.6B • Updated Dec 30, 2025 • 28

XXXXyu/Llama3-8B-1.58-100B-tokens-vlut-gguf

Text Generation • 8B • Updated Jan 1 • 98

cstr/nllb-200-coreml-128

Translation • Updated Dec 29, 2025 • 2

cstr/nllb-200-coreml-256

Translation • Updated Dec 29, 2025 • 12

uralstech/Qwen-2.5-1.5B-KCC-LiteRT-LM

Text Generation • Updated Dec 30, 2025 • 11

blackcloud1199/SmolLM2-1.7B-Executorch-Q8DA4W

Updated Dec 31, 2025

blackcloud1199/Qwen2.5-1.5B-Executorch-Q8DA4W

Updated Dec 31, 2025

ndlanier/gutsignal-food-parser-tinyllama-1.1b

1B • Updated Jan 6 • 13

ndlanier/gutsignal-food-parser-llama-3.2-1b

1B • Updated Jan 6 • 6

ndlanier/gutsignal-food-parser-llama-3.2-3b

3B • Updated Jan 6 • 1

vijayk-huggingface/orchestrix-actions

Text Generation • Updated Jan 8

Irfanuruchi/Qwen2.5-1.5B-Instruct-MLX-8bit

Text Generation • 0.4B • Updated about 1 month ago • 22