-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 509 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 38 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
Collections
Discover the best community collections!
Collections including paper arxiv:2603.06351
-
Beyond Language Modeling: An Exploration of Multimodal Pretraining
Paper • 2603.03276 • Published • 88 -
Qwen3-Coder-Next Technical Report
Paper • 2603.00729 • Published • 46 -
Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use
Paper • 2603.03205 • Published • 11 -
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios
Paper • 2602.23166 • Published • 40
-
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 119 -
KlingAvatar 2.0 Technical Report
Paper • 2512.13313 • Published • 43 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 93 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 220
-
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
Paper • 2602.17100 • Published • 2 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models
Paper • 2603.00618 • Published -
Heterogeneous Agent Collaborative Reinforcement Learning
Paper • 2603.02604 • Published • 173
-
Beyond Imitation: Reinforcement Learning for Active Latent Planning
Paper • 2601.21598 • Published • 10 -
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
Paper • 2601.18778 • Published • 41 -
Self-Hinting Language Models Enhance Reinforcement Learning
Paper • 2602.03143 • Published • 30 -
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning
Paper • 2602.12099 • Published • 58
-
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices
Paper • 2601.08303 • Published • 18 -
SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers
Paper • 2411.10510 • Published • 9 -
Dynamic Chunking Diffusion Transformer
Paper • 2603.06351 • Published • 12 -
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
Paper • 2603.06577 • Published • 43
-
CoLLM: A Large Language Model for Composed Image Retrieval
Paper • 2503.19910 • Published • 15 -
Parallel Scaling Law for Language Models
Paper • 2505.10475 • Published • 83 -
OLMoE: Open Mixture-of-Experts Language Models
Paper • 2409.02060 • Published • 80 -
Dynamic Chunking Diffusion Transformer
Paper • 2603.06351 • Published • 12
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 509 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 38 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
Paper • 2602.17100 • Published • 2 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models
Paper • 2603.00618 • Published -
Heterogeneous Agent Collaborative Reinforcement Learning
Paper • 2603.02604 • Published • 173
-
Beyond Language Modeling: An Exploration of Multimodal Pretraining
Paper • 2603.03276 • Published • 88 -
Qwen3-Coder-Next Technical Report
Paper • 2603.00729 • Published • 46 -
Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use
Paper • 2603.03205 • Published • 11 -
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios
Paper • 2602.23166 • Published • 40
-
Beyond Imitation: Reinforcement Learning for Active Latent Planning
Paper • 2601.21598 • Published • 10 -
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
Paper • 2601.18778 • Published • 41 -
Self-Hinting Language Models Enhance Reinforcement Learning
Paper • 2602.03143 • Published • 30 -
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning
Paper • 2602.12099 • Published • 58
-
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices
Paper • 2601.08303 • Published • 18 -
SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers
Paper • 2411.10510 • Published • 9 -
Dynamic Chunking Diffusion Transformer
Paper • 2603.06351 • Published • 12 -
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
Paper • 2603.06577 • Published • 43
-
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 119 -
KlingAvatar 2.0 Technical Report
Paper • 2512.13313 • Published • 43 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 93 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 220
-
CoLLM: A Large Language Model for Composed Image Retrieval
Paper • 2503.19910 • Published • 15 -
Parallel Scaling Law for Language Models
Paper • 2505.10475 • Published • 83 -
OLMoE: Open Mixture-of-Experts Language Models
Paper • 2409.02060 • Published • 80 -
Dynamic Chunking Diffusion Transformer
Paper • 2603.06351 • Published • 12