-
Taming LLMs by Scaling Learning Rates with Gradient Grouping
Paper • 2506.01049 • Published • 38 -
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Paper • 2402.09240 • Published • 4 -
Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning
Paper • 2410.06373 • Published • 35 -
OpenMixup: Open Mixup Toolbox and Benchmark for Visual Representation Learning
Paper • 2209.04851 • Published • 3
Collections
Discover the best community collections!
Collections including paper arxiv:2506.01049
-
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Paper • 2402.17193 • Published • 26 -
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
Paper • 2410.23743 • Published • 63 -
Direct Preference Optimization Using Sparse Feature-Level Constraints
Paper • 2411.07618 • Published • 17 -
Transformer^2: Self-adaptive LLMs
Paper • 2501.06252 • Published • 54
-
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Paper • 2506.01939 • Published • 185 -
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics
Paper • 2506.01844 • Published • 142 -
Taming LLMs by Scaling Learning Rates with Gradient Grouping
Paper • 2506.01049 • Published • 38 -
ARIA: Training Language Agents with Intention-Driven Reward Aggregation
Paper • 2506.00539 • Published • 30
-
CoRAG: Collaborative Retrieval-Augmented Generation
Paper • 2504.01883 • Published • 9 -
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning
Paper • 2504.08837 • Published • 43 -
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model
Paper • 2504.10068 • Published • 30 -
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
Paper • 2504.10481 • Published • 85
-
Taming LLMs by Scaling Learning Rates with Gradient Grouping
Paper • 2506.01049 • Published • 38 -
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Paper • 2402.09240 • Published • 4 -
Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning
Paper • 2410.06373 • Published • 35 -
OpenMixup: Open Mixup Toolbox and Benchmark for Visual Representation Learning
Paper • 2209.04851 • Published • 3
-
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Paper • 2506.01939 • Published • 185 -
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics
Paper • 2506.01844 • Published • 142 -
Taming LLMs by Scaling Learning Rates with Gradient Grouping
Paper • 2506.01049 • Published • 38 -
ARIA: Training Language Agents with Intention-Driven Reward Aggregation
Paper • 2506.00539 • Published • 30
-
CoRAG: Collaborative Retrieval-Augmented Generation
Paper • 2504.01883 • Published • 9 -
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning
Paper • 2504.08837 • Published • 43 -
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model
Paper • 2504.10068 • Published • 30 -
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
Paper • 2504.10481 • Published • 85
-
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Paper • 2402.17193 • Published • 26 -
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
Paper • 2410.23743 • Published • 63 -
Direct Preference Optimization Using Sparse Feature-Level Constraints
Paper • 2411.07618 • Published • 17 -
Transformer^2: Self-adaptive LLMs
Paper • 2501.06252 • Published • 54