Update README.md
#26 opened about 1 month ago
by
cherry0328
Can Llama-3.1- Nemotron-40B-Instruct be released as well?
👍 1
#24 opened about 1 year ago
by
tdh111
What is the context size this model was trained on?
2
#23 opened about 1 year ago
by
treehugg3
Modified llama.cpp to generate GGUFs for Llama-3_1-Nemotron-51
❤️ 🔥 2
#22 opened about 1 year ago
by
ymcki
Documentation about the linear attention used in some layers of this model?
#21 opened over 1 year ago
by
ymcki
Comparison to the 70B model?
🚀 1
1
#20 opened over 1 year ago
by
AIGUYCONTENT
Update README.md
#11 opened over 1 year ago
by
Vlad748283847
vLLM compatible?
👍 5
3
#10 opened over 1 year ago
by
nickandbro
AttributeError: 'DeciLMConfig'
3
#9 opened over 1 year ago
by
bluenevus
fp8 / int8 inference - use bitsandbytes or awq
👍 2
#8 opened over 1 year ago
by
dtanow
GGUF possible ?
👍 ❤️ 4
2
#5 opened over 1 year ago
by
gopi87
fine-tuning
#1 opened over 1 year ago
by
kzmaker