3 22 5

Makise Kurisu

kurisu0306

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

upvoted a paper 2 days ago

daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

upvoted a paper 2 days ago

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

View all activity

Organizations

None yet

upvoted a paper 1 day ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 2 days ago • 82

upvoted 3 papers 2 days ago

daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

Paper • 2602.02619 • Published 4 days ago • 47

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published 3 days ago • 33

RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents

Paper • 2602.02486 • Published 4 days ago • 16

upvoted a paper 3 days ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 4 days ago • 197

upvoted a paper 4 days ago

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Paper • 2601.21558 • Published 8 days ago • 56

upvoted a paper 10 days ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published 14 days ago • 174

upvoted a paper 15 days ago

Not All Correct Answers Are Equal: Why Your Distillation Source Matters

Paper • 2505.14464 • Published May 20, 2025 • 10

upvoted 2 collections 16 days ago

ASTRA Dataset

Collection

2 items • Updated 16 days ago • 4

ASTRA Models

Collection

2 items • Updated 15 days ago • 2

upvoted 2 papers 16 days ago

Toward Efficient Agents: Memory, Tool learning, and Planning

Paper • 2601.14192 • Published 17 days ago • 53

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Paper • 2601.11655 • Published 22 days ago • 60

upvoted a paper 18 days ago

Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text

Paper • 2601.10355 • Published 22 days ago • 39

upvoted a paper 19 days ago

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published 23 days ago • 193

upvoted a paper 24 days ago

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Paper • 2601.05808 • Published 28 days ago • 36

upvoted a paper about 1 month ago

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Paper • 2512.24873 • Published Dec 31, 2025 • 104

upvoted a paper 2 months ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 107

upvoted a paper 8 months ago

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Paper • 2506.04207 • Published Jun 4, 2025 • 48

upvoted a collection 9 months ago

Qwen3

Collection

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 2 days ago • 260

upvoted a collection 11 months ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 11 items • Updated Dec 31, 2025 • 557

Makise Kurisu

AI & ML interests

Recent Activity

Organizations

kurisu0306's activity