-
Agentic Reinforced Policy Optimization
Paper • 2507.19849 • Published • 156 -
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance
Paper • 2507.22448 • Published • 65 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 204 -
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
Paper • 2508.21113 • Published • 109
Yeo Sing Chen
scyeo
·
AI & ML interests
None yet
Recent Activity
liked
a Space
9 days ago
HuggingFaceTB/smol-training-playbook
updated
a collection
2 months ago
Daily high rank paper
updated
a collection
3 months ago
Daily high rank paper
Organizations
None yet