AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks Paper • 2403.04783 • Published Mar 2, 2024 • 2
A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement Paper • 2410.13828 • Published Oct 17, 2024 • 4
LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking Paper • 2406.00231 • Published May 31, 2024 • 1
TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling Paper • 2410.16033 • Published Oct 18, 2024
Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems Paper • 2505.00212 • Published Apr 30 • 9
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence Paper • 2507.21046 • Published Jul 28 • 81