In a Training Loop 🔄

49 61 175

Stefano Fiorucci PRO

anakin87

AI & ML interests

Language Models: orchestration, post-training, GRPO, synthetic data... Contributing to Haystack LLM framework 🏗️

Recent Activity

liked a model 5 days ago

VAGOsolutions/SauerkrautLM-Translator-LFM2.5-1.2B

liked a model 7 days ago

mii-llm/nesso-4B

updated a model 7 days ago

anakin87/mr-ttt-harder-mixed4-merged

View all activity

Organizations

upvoted an article 13 days ago

Article

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

17 days ago

•

upvoted a collection 18 days ago

🧮functiongemma ft mobile-actions

Collection

A collection of functiongemma-270m-it models fine-tuned on mobile actions dataset for Spanish, French and Italian • 3 items • Updated 14 days ago • 3

upvoted a collection about 2 months ago

INTELLECT-3

Collection

INTELLECT-3: A 100B+ MoE trained with large-scale RL • 4 items • Updated Nov 28, 2025 • 11

upvoted a collection 2 months ago

SYNTH

Collection

Fully generalist synthetic dataset and SOTA small reasoners • 3 items • Updated Nov 10, 2025 • 11

upvoted a paper 3 months ago

Extracting alignment data in open models

Paper • 2510.18554 • Published Oct 21, 2025 • 9

upvoted an article 3 months ago

Article

Extract Text and Knowledge from Images with Open Vision Language Models

Oct 23, 2025

•

upvoted a collection 4 months ago

IFBench

Collection

Datasets for IFBench benchmark and paper! • 3 items • Updated 27 days ago • 10

upvoted an article 4 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

Sep 23, 2025

•

135

upvoted 2 articles 5 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4, 2025

•

268

Article

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

Sep 4, 2025

•

upvoted a collection 5 months ago

EmbeddingGemma

Collection

3 items • Updated Sep 11, 2025 • 108

upvoted 2 articles 5 months ago

Article

Some Safety and Security tests using LlamaGuard 4 12B and PromptGuard2

Aug 28, 2025

•

Article

Vision Language Model Alignment in TRL ⚡️

Aug 7, 2025

•

106

upvoted a paper 6 months ago

Mergenetic: a Simple Evolutionary Model Merging Library

Paper • 2505.11427 • Published May 16, 2025 • 14

upvoted a collection 6 months ago

T5Gemma

Collection

32 items • Updated Jul 10, 2025 • 81

upvoted an article 6 months ago

Article

cocogold: training Marigold for text-grounded segmentation

Jul 8, 2025

•

upvoted an article 9 months ago

Article

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

May 7, 2025

•

upvoted a collection 9 months ago

Qwen Scheduler GRPO

Collection

Train a SLM to create a schedule from a list of events and priorities - Article: https://t.ly/-Dejx - Code: https://t.ly/1J_VG • 2 items • Updated Oct 25, 2025 • 4

upvoted an article 9 months ago

Article

I trained a Language Model to schedule events with GRPO!

Apr 29, 2025

•

upvoted an article 10 months ago

Article

Training a Gemma 2 2B-IT for Reasoning with GRPO

Mar 18, 2025

•

Stefano Fiorucci PRO

AI & ML interests

Recent Activity

Organizations

anakin87's activity

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Extract Text and Knowledge from Images with Open Vision Language Models

Smol2Operator: Post-Training GUI Agents for Computer Use

Welcome EmbeddingGemma, Google's new efficient embedding model

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

Some Safety and Security tests using LlamaGuard 4 12B and PromptGuard2

Vision Language Model Alignment in TRL ⚡️

cocogold: training Marigold for text-grounded segmentation

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

I trained a Language Model to schedule events with GRPO!

Training a Gemma 2 2B-IT for Reasoning with GRPO