cybermotaz
/

nemotron3-nano-nvfp4-w4a16

Model card Files Files and versions

nemotron3-nano-nvfp4-w4a16

18.8 GB

1 contributor

History: 6 commits

cybermotaz's picture

Fix: base_model to nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

8665d96 verified 8 days ago

.gitattributes
1.57 kB

NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster 8 days ago
README.md
12.7 kB

Fix: base_model to nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 8 days ago
config.json
3.73 kB

Fix: Replace Infinity with 1e308 for valid JSON 8 days ago
generation_config.json
171 Bytes

NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster 8 days ago
hf_quant_config.json
1.52 kB

NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster 8 days ago
model-00001-of-00004.safetensors
5 GB
xet

NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster 8 days ago
model-00002-of-00004.safetensors
5 GB
xet

NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster 8 days ago
model-00003-of-00004.safetensors
5 GB
xet

NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster 8 days ago
model-00004-of-00004.safetensors
3.81 GB
xet

NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster 8 days ago
model.safetensors.index.json
2.41 MB

NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster 8 days ago
special_tokens_map.json
449 Bytes

NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster 8 days ago
tokenizer.json
17.1 MB
xet

NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster 8 days ago
tokenizer_config.json
188 kB

NVFP4 W4A16 quantized by Mutaz Al Awamleh | ELK-AI | 14.5x faster 8 days ago