arxiv:2601.13572

Behavior Knowledge Merge in Reinforced Agentic Models

Published on Jan 20

· Submitted by

Xiangchi Yuan on Jan 22

Georgia Institute of Technology

Upvote

Authors:

Xiangchi Yuan ,

Abstract

Reinforced Agent Merging (RAM) addresses the limitations of traditional merging methods for reinforcement learning-trained agents by distinguishing shared and task-specific parameters to preserve critical behaviors during model integration.

AI-generated summary

Reinforcement learning (RL) is central to post-training, particularly for agentic models that require specialized reasoning behaviors. In this setting, model merging offers a practical mechanism for integrating multiple RL-trained agents from different tasks into a single generalist model. However, existing merging methods are designed for supervised fine-tuning (SFT), and they are suboptimal to preserve task-specific capabilities on RL-trained agentic models. The root is a task-vector mismatch between RL and SFT: on-policy RL induces task vectors that are highly sparse and heterogeneous, whereas SFT-style merging implicitly assumes dense and globally comparable task vectors. When standard global averaging is applied under this mismatch, RL's non-overlapping task vectors that encode critical task-specific behaviors are reduced and parameter updates are diluted. To address this issue, we propose Reinforced Agent Merging (RAM), a distribution-aware merging framework explicitly designed for RL-trained agentic models. RAM disentangles shared and task-specific unique parameter updates, averaging shared components while selectively preserving and rescaling unique ones to counteract parameter update dilution. Experiments across multiple agent domains and model architectures demonstrate that RAM not only surpasses merging baselines, but also unlocks synergistic potential among agents to achieve performance superior to that of specialized agents in their domains.

View arXiv page View PDF Project page GitHub 1 Add to collection

Community

Xiangchi

Paper author Paper submitter about 21 hours ago

•

edited about 21 hours ago

🚀 TL;DR

We introduce RAM (Reinforced Agent Merging), a method designed to merge RL-trained agents into a single generalist model without retraining, outperforming the original specialized agents in their domains.

💡 Key Insights

The Problem: Standard merging methods (like TIES/DARE) are built for SFT models. We find they fail for RL models because RL updates are extremely sparse and heterogeneous, leading to "Signal Dilution" when averaged (performance drops).

The Solution: RAM explicitly disentangles "shared" vs "unique" parameters. It preserves the full magnitude of unique task vectors to prevent dilution while averaging shared knowledge.