3 16 4

Bai Yang

ShacklesLay

AI & ML interests

None yet

Recent Activity

liked a Space 9 days ago

HuggingFaceTB/smol-training-playbook

upvoted a paper 3 months ago

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

upvoted a paper 8 months ago

Visual-RFT: Visual Reinforcement Fine-Tuning

View all activity

Organizations

liked a Space 9 days ago

2.04k

The Smol Training Playbook: The Secrets to Building World-Class LLMs

📝

Display loss curves for training LLMs

upvoted a paper 3 months ago

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17 • 75

upvoted a paper 8 months ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published Mar 3 • 84

published a dataset 9 months ago

ShacklesLay/Moment-10M

Updated Feb 27 • 4

upvoted 3 papers 10 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 123

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 423

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Paper • 2501.05510 • Published Jan 9 • 43

upvoted a paper 11 months ago

FastVLM: Efficient Vision Encoding for Vision Language Models

Paper • 2412.13303 • Published Dec 17, 2024 • 71

liked a dataset 11 months ago

nyu-visionx/Cambrian-10M

Preview • Updated Jul 8, 2024 • 8.69k • 119

upvoted 3 papers about 1 year ago

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Paper • 2410.23918 • Published Oct 31, 2024 • 21

HumanEval-V: Benchmarking High-Level Visual Reasoning with Complex Diagrams in Coding Tasks

Paper • 2410.12381 • Published Oct 16, 2024 • 43

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28, 2024 • 87

upvoted 5 papers over 1 year ago

InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance

Paper • 2401.11206 • Published Jan 20, 2024 • 2

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 109

liked a Space almost 2 years ago

13.7k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

upvoted a paper almost 2 years ago

The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning

Paper • 2312.01552 • Published Dec 4, 2023 • 32

upvoted a paper about 2 years ago

Chain-of-Verification Reduces Hallucination in Large Language Models

Paper • 2309.11495 • Published Sep 20, 2023 • 39