Jamba Reasoning 3B Collection AI21's top-performing reasoning model that packs leading scores on intelligence benchmarks and highly-efficient processing into a compact 3B build • 2 items • Updated Oct 8 • 5
LLaDA-8B-BGPO Collection Boundary-Guided Policy Optimization for Memory-Efficient RL of Diffusion Large Language Models • 4 items • Updated about 1 month ago • 4