Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
cybermotaz
/
nemotron3-nano-nvfp4-w4a16
like
6
Text Generation
Transformers
Safetensors
English
nemotron_h
feature-extraction
nvidia
nemotron
nvfp4
quantized
blackwell
sm121
dgx-spark
elk-ai
vllm
cuda13
fp4
awq
mamba
Mixture of Experts
conversational
custom_code
8-bit precision
License:
nvidia-open-model-license
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
nemotron3-nano-nvfp4-w4a16
18.8 GB
1 contributor
History:
6 commits
cybermotaz
Fix: base_model to nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
8665d96
verified
8 days ago
.gitattributes
1.57 kB
NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster
8 days ago
README.md
12.7 kB
Fix: base_model to nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
8 days ago
config.json
3.73 kB
Fix: Replace Infinity with 1e308 for valid JSON
8 days ago
generation_config.json
171 Bytes
NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster
8 days ago
hf_quant_config.json
1.52 kB
NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster
8 days ago
model-00001-of-00004.safetensors
5 GB
xet
NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster
8 days ago
model-00002-of-00004.safetensors
5 GB
xet
NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster
8 days ago
model-00003-of-00004.safetensors
5 GB
xet
NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster
8 days ago
model-00004-of-00004.safetensors
3.81 GB
xet
NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster
8 days ago
model.safetensors.index.json
2.41 MB
NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster
8 days ago
special_tokens_map.json
449 Bytes
NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster
8 days ago
tokenizer.json
17.1 MB
xet
NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster
8 days ago
tokenizer_config.json
188 kB
NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster
8 days ago