Weiran Yao's picture

5 6 5

Weiran Yao

weiranyao

·

AI & ML interests

AI Agent

Recent Activity

liked a dataset 20 days ago

Salesforce/EDR-200

upvoted a paper 29 days ago

Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

authored a paper about 1 month ago

SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs

View all activity

Organizations

authored 7 papers about 1 month ago

SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs

Paper • 2411.13547 • Published Nov 20, 2024

ActionStudio: A Lightweight Framework for Data and Training of Large Action Models

Paper • 2503.22673 • Published Mar 28 • 12

APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay

Paper • 2504.03601 • Published Apr 4 • 17

PersonaBench: Evaluating AI Models on Understanding Personal Information through Accessing (Synthetic) Private User Data

Paper • 2502.20616 • Published Feb 28

LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering

Paper • 2509.09614 • Published Sep 11 • 7

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

Paper • 2509.19736 • Published Sep 24 • 11

CoDA: Coding LM via Diffusion Adaptation

Paper • 2510.03270 • Published Sep 27 • 42