agentic - a zhuww Collection

zhuww 's Collections

RL

arena

SWE

code

agentic

LLM

agentic

updated Sep 28

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published Jun 26 • 51
Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23 • 81
Efficient Agent Training for Computer Use

Paper • 2505.13909 • Published May 20 • 44
Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16 • 115
rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 115