Behrooz Azarkhalili's picture

57 500

Behrooz Azarkhalili

ermiaazarkhalili

·

AI & ML interests

LLMs, VLMs, PEFT, RL for LLMs and VLMs.

Recent Activity

liked a Space 13 days ago

HuggingFaceTB/smol-training-playbook

liked a model 16 days ago

Alibaba-NLP/Tongyi-DeepResearch-30B-A3B

upvoted an article 25 days ago

Supercharge your OCR Pipelines with Open Models

View all activity

Organizations

upvoted an article 25 days ago

Article

Supercharge your OCR Pipelines with Open Models

25 days ago

•

236

upvoted a collection about 1 month ago

ExGRPO

Model collections trained using ExGRPO. • 7 items • Updated Oct 3 • 1

upvoted a paper 3 months ago

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs

Paper • 2508.14896 • Published Aug 20 • 22

upvoted 5 articles 3 months ago

Article

Upskill your LLMs With Gradio MCP Servers

Jul 9

•

20

Article

Generate Images with Claude and Hugging Face

Aug 19

•

36

Article

Multimodal RAG with Colpali, Milvus and VLMs

Dec 10, 2024

•

10

Article

How I Built 7 Custom Gradio Components in Just 12 Days!

Aug 12

•

7

Article

Vision Language Model Alignment in TRL ⚡️

Aug 7

•

99

upvoted a collection 4 months ago

Qwen3-MegaScience

Qwen3-MegaScience • 5 items • Updated Jul 23 • 4

upvoted a paper 4 months ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22 • 63

upvoted an article 4 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Jul 29

•

197

upvoted a collection 4 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated about 14 hours ago • 143

upvoted an article 4 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11

•

83

upvoted a collection 5 months ago

Qwen3

84 items • Updated Aug 6 • 1.42k

upvoted a paper 5 months ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26 • 56

upvoted an article 5 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

•

371

upvoted an article 7 months ago

Article

Multi-Label Classification Model From Scratch: Step-by-Step Tutorial

Jan 8, 2024

•

47

upvoted an article 8 months ago

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

Aug 25, 2023

•

37

upvoted an article 10 months ago

Article

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!

Jan 23

•

186

upvoted an article about 1 year ago

Article

Introducing GGUF-my-LoRA

Nov 1, 2024

•

22