Nirvana Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism YuhuaJiang/Nirvana-pro 2B • Updated Oct 13 • 5 • 3 YuhuaJiang/Nirvana-simple 2B • Updated Oct 13 • 4 • 2 YuhuaJiang/Nirvana 2B • Updated Oct 13 • 4 • 2
SDAR The models without suffixes use the default block size = 4. JetLM/SDAR-1.7B-Chat Text Generation • 2B • Updated Oct 21 • 513 • 7 JetLM/SDAR-4B-Chat Text Generation • 4B • Updated Oct 21 • 6.01k • 2 JetLM/SDAR-8B-Chat Text Generation • 8B • Updated Oct 21 • 251 • 3 JetLM/SDAR-30B-A3B-Chat Text Generation • 31B • Updated Oct 21 • 12 • 2
Nirvana Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism YuhuaJiang/Nirvana-pro 2B • Updated Oct 13 • 5 • 3 YuhuaJiang/Nirvana-simple 2B • Updated Oct 13 • 4 • 2 YuhuaJiang/Nirvana 2B • Updated Oct 13 • 4 • 2
SDAR The models without suffixes use the default block size = 4. JetLM/SDAR-1.7B-Chat Text Generation • 2B • Updated Oct 21 • 513 • 7 JetLM/SDAR-4B-Chat Text Generation • 4B • Updated Oct 21 • 6.01k • 2 JetLM/SDAR-8B-Chat Text Generation • 8B • Updated Oct 21 • 251 • 3 JetLM/SDAR-30B-A3B-Chat Text Generation • 31B • Updated Oct 21 • 12 • 2