Agentic - a galois77 Collection

galois77 's Collections

energy based models

OCR

Poetry

Agentic

Videos

ahan

Image generation

Training optimization

RL

Benchmarks and challenges

Agentic

updated 9 days ago

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published Apr 28 • 39
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Paper • 2504.16078 • Published Apr 22 • 21
Emergent Agentic Transformer from Chain of Hindsight Experience

Paper • 2305.16554 • Published May 26, 2023
DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models

Paper • 2504.02882 • Published Apr 2 • 7
ATLAS: Learning to Optimally Memorize the Context at Test Time

Paper • 2505.23735 • Published May 29 • 22
Self-Challenging Language Model Agents

Paper • 2506.01716 • Published Jun 2 • 10
Matrix-Game: Interactive World Foundation Model

Paper • 2506.18701 • Published Jun 23 • 72
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory

Paper • 2508.09736 • Published Aug 13 • 56
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8 • 78
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7 • 99
AgentFold: Long-Horizon Web Agents with Proactive Context Management

Paper • 2510.24699 • Published 13 days ago • 65