view article Article The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU 17 days ago • 12
🧮functiongemma ft mobile-actions Collection A collection of functiongemma-270m-it models fine-tuned on mobile actions dataset for Spanish, French and Italian • 3 items • Updated 14 days ago • 3
INTELLECT-3 Collection INTELLECT-3: A 100B+ MoE trained with large-scale RL • 4 items • Updated Nov 28, 2025 • 11
SYNTH Collection Fully generalist synthetic dataset and SOTA small reasoners • 3 items • Updated Nov 10, 2025 • 11
view article Article Extract Text and Knowledge from Images with Open Vision Language Models Oct 23, 2025 • 5
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 Sep 4, 2025 • 268
view article Article Exploring Environments Hub: Your Language Model needs better (open) environments to learn Sep 4, 2025 • 29
view article Article Some Safety and Security tests using LlamaGuard 4 12B and PromptGuard2 Aug 28, 2025 • 1
Mergenetic: a Simple Evolutionary Model Merging Library Paper • 2505.11427 • Published May 16, 2025 • 14
view article Article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs May 7, 2025 • 42
Qwen Scheduler GRPO Collection Train a SLM to create a schedule from a list of events and priorities - Article: https://t.ly/-Dejx - Code: https://t.ly/1J_VG • 2 items • Updated Oct 25, 2025 • 4