SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18 • 19 • 16
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18 • 19 • 16
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18 • 19 • 16
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18 • 19 • 16
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18 • 19 • 16
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18 • 19 • 16
Llama 4 Collection Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 16 days ago • 51
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF Image-Text-to-Text • 401B • Updated Jun 18 • 8.68k • 38
meta-llama/Llama-4-Maverick-17B-128E-Instruct Image-Text-to-Text • 402B • Updated May 22 • 20.3k • • 425
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18 • 19 • 16
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation Paper • 2409.10262 • Published Sep 16, 2024 • 1
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18 • 19
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18 • 19 • 16
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18 • 19
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation Paper • 2409.10262 • Published Sep 16, 2024 • 1