-
MASPRM: Multi-Agent System Process Reward Model
Paper • 2510.24803 • Published • 14 -
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
Paper • 2510.25992 • Published • 48 -
Multi-Agent Evolve: LLM Self-Improve through Co-evolution
Paper • 2510.23595 • Published • 12
Ed Li
edli
AI & ML interests
None yet
Recent Activity
submitted
a paper
about 3 hours ago
Scaling Multiagent Systems with Process Rewards
liked
a model
3 days ago
fdtn-ai/Foundation-Sec-8B-Reasoning
upvoted
a
paper
3 days ago
Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report