From Data to Behavior: Predicting Unintended Model Behaviors Before Training Paper • 2602.04735 • Published 11 days ago • 15
Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics Paper • 2602.02343 • Published 13 days ago • 13
Aligning Agentic World Models via Knowledgeable Experience Learning Paper • 2601.13247 • Published 27 days ago • 15
Can We Predict Before Executing Machine Learning Agents? Paper • 2601.05930 • Published Jan 9 • 27
Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency Paper • 2601.05905 • Published Jan 9 • 20
InnoGym: Benchmarking the Innovation Potential of AI Agents Paper • 2512.01822 • Published Dec 1, 2025 • 36
Memory Collection Prompt is text-based memory. System II prompting is updating memory. Parametric memory is long-term, while prompt-based are short-tem. • 23 items • Updated Oct 22, 2025 • 2
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published Oct 21, 2025 • 114
Executable Knowledge Graphs for Replicating AI Research Paper • 2510.17795 • Published Oct 20, 2025 • 15
When Benchmarks Age: Temporal Misalignment through Large Language Model Factuality Evaluation Paper • 2510.07238 • Published Oct 8, 2025 • 15
BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses Paper • 2510.00232 • Published Sep 30, 2025 • 16
OceanGym: A Benchmark Environment for Underwater Embodied Agents Paper • 2509.26536 • Published Sep 30, 2025 • 36
Towards Personalized Deep Research: Benchmarks and Evaluations Paper • 2509.25106 • Published Sep 29, 2025 • 30
Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL Paper • 2508.07976 • Published Aug 11, 2025 • 52
Automating Steering for Safe Multimodal Large Language Models Paper • 2507.13255 • Published Jul 17, 2025 • 4