6 56 91

BuiDoan

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

janhq/Jan-v2-VL-max-Instruct-FP8

new activity 10 days ago

rahtml/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4:vLLM compatible

new activity 25 days ago

cyankiwi/Devstral-Small-2-24B-Instruct-2512-AWQ-4bit:Loading Mistral Models in vLLM

View all activity

Organizations

upvoted a collection about 1 month ago

📙 LLM Engineer's Handbook

Collection

Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook • 6 items • Updated Apr 7, 2025 • 14

upvoted a collection 6 months ago

Kimi-K2

Collection

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Nov 14, 2025 • 162

upvoted an article 8 months ago

Article

The 4 Things Qwen-3’s Chat Template Teaches Us

Apr 30, 2025

•

upvoted 3 papers 8 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 82

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8, 2025 • 185

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 188

upvoted an article 8 months ago

Article

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

Jun 16, 2023

•

upvoted 2 papers 8 months ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1, 2025 • 36

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29, 2025 • 53

upvoted an article 8 months ago

Article

What is MoE 2.0? Update Your Knowledge about Mixture-of-experts

Apr 27, 2025

•

upvoted a paper 8 months ago

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Paper • 2505.02835 • Published May 5, 2025 • 28

upvoted 2 articles 8 months ago

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Mar 17, 2025

•

348

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

Jan 31, 2025

•

upvoted 3 papers 8 months ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30, 2025 • 53

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Paper • 2504.18415 • Published Apr 25, 2025 • 47

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30, 2025 • 49

upvoted 2 articles 8 months ago

Article

Open R1: Update #3

Mar 11, 2025

•

296

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28, 2025

•

887

upvoted 2 papers 8 months ago

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21, 2025 • 37

Tina: Tiny Reasoning Models via LoRA

Paper • 2504.15777 • Published Apr 22, 2025 • 56

BuiDoan

AI & ML interests

Recent Activity

Organizations

BuiDoan's activity

The 4 Things Qwen-3’s Chat Template Teaches Us

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

What is MoE 2.0? Update Your Knowledge about Mixture-of-experts

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

Open R1: Update #3

Open-R1: a fully open reproduction of DeepSeek-R1