Context Management for Long-Horizon SWE-Agents

Published:Dec 26, 2025 17:15
1 min read
ArXiv

Analysis

This paper addresses the critical challenge of context management in long-horizon software engineering tasks performed by LLM-based agents. The core contribution is CAT, a novel context management paradigm that proactively compresses historical trajectories into actionable summaries. This is a significant advancement because it tackles the issues of context explosion and semantic drift, which are major bottlenecks for agent performance in complex, long-running interactions. The proposed CAT-GENERATOR framework and SWE-Compressor model provide a concrete implementation and demonstrate improved performance on the SWE-Bench-Verified benchmark.

Reference

SWE-Compressor reaches a 57.6% solved rate and significantly outperforms ReAct-based agents and static compression baselines, while maintaining stable and scalable long-horizon reasoning under a bounded context budget.