CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis Paper • 2503.23145 • Published Mar 29 • 35
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models Paper • 2310.04406 • Published Oct 6, 2023 • 10