SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents

Semantic Highlight ◦ Coding Agent Native ◦ Flexibly Use ◦ Long Context Tailored

Make Claude Tokens 40% Saving!

📢 Latest Updates

🔥 Releases:

1/28/2025: We release some scripts for optimize and visualize the result! see ./utils
1/27/2025: We published our tech blogs: Towards Real-World Software Agents: How we push Semantic Highlight feature to Agentic Coding?
- 📄 Towards Real-World Software Agents: How we push Semantic Highlight feature to Agentic Coding?
- 📄 迈向真实世界的软件智能体：如何将语义高亮功能融入智能体编程？
1/26/2025: Introduce SWE-Pruner
- 📖 paper: https://arxiv.org/abs/2601.16746
- ⚙️ code: https://github.com/Ayanami1314/swe-pruner
- 🐍 pip: https://pypi.org/project/swe-pruner/
- 🤗 huggingface: https://huggingface.co/ayanami-kitasan/code-pruner

📑 Introduction to SWE-Pruner

Are you struggling with excessive token costs and latency when using LLM agents for software development? Traditional context compression often relies on fixed metrics like perplexity (PPL) and ignores task-specific code understanding. But generic compression ≠ relevant preservation — we need task-aware context pruning that retains critical implementation details.

Inspired by how human programmers "selectively skim" source code, SWE-Pruner enables agents to formulate explicit goals and uses a lightweight neural skimmer to dynamically select relevant code lines. It operates in two key steps:

Formulate task-specific goals to guide the pruning process
Dynamically select relevant code lines using a lightweight neural skimmer

🎯 Core Features

🧠 Task-Aware Pruning Understands the intent (e.g., "focus on error handling") and uses it to guide context pruning process, beyond generic metrics.

🤖 Coding Agent Native Built for multi-turn workflows and integrates seamlessly into agent decision loops, providing just-in-time context for complex software engineering tasks.

🎨 Semantic Highlight A lightweight 0.6B model identifies and preserves semantically critical lines of code, keeping logical structures intact.

⚡ Extreme Compression Delivers significant token savings without sacrificing performance: 23-54% token reduction on SWE-Bench Verified and up to 14.84x compression on LongCodeQA, cutting API costs and latency.

🔧 Flexibly Use Adaptable framework for various LLMs and scenarios, from debugging to feature development.

🌲 Project Structure

.
├── data/                      # Experiment trace archives and hyperparameter configurations
├── downstream_eval/           # Downstream evaluation benchmarks
│   ├── multi_turn/            # Includes: SWE-bench, SWEQA (coming soon)
│   └── single_turn/           # Includes: LongCodeQA, LCC (LongCodeCompletion)
├── swe-pruner/                # Inference code and model utilities
│   └── model/                 # Model files for SWE-Pruner
├── examples                   # Examples for integrating with other agents like claude code and openhands

🧰 Prerequisites

This project uses uv for fast and efficient dependency management.

🎮 Quick Start

Go to Inference Tutorial and have a try!

Tips: For easier serving and reproducing, we upload our models in ./swe-pruner/model directory(tracked by git lfs). It make the serving more simple but greatly increase the repo size if you use git clone directly without lfs config (and might failed to download model for traffic limit of github lfs service). However, you can use the methods in the tutorial to download it from HuggingFace.

⚙️ Installation

Since different modules have different dependencies, please refer to the specific README file inside each subfolder for detailed installation instructions.

📖 User Guides

For Users, look Inference Tutorial to start a swe-pruner locally and then reading real world examples for agents integration.
- We now support OpenHands and Claude Agent SDK!
For Developers, look ./train(coming soon) for training a pruner by yourself!
For Researchers, ./downstream_eval has some scripts for reproducing our results. We recommend to use slurm with at least 4 GPU to reuse our scripts.

🧪 Utils Scripts

We provide some utils scripts for continue improving the swe-pruner in ./utils, just look utils/README.md!

🔮 Coming Soon

💻 Update Training Code of SWE-Pruner
📁 Upload full parameters and trajectory files & logs
📁 Upload Training Dataset of SWE-Pruner
📁 Upload SWE-QA evaluation code
🤗 Update HuggingFace model card
🤗 Update HuggingFace blog to introducing our technical approach in detail.
🎮 Update agents integrate demo

📜 Citation

@misc{wang2026sweprunerselfadaptivecontextpruning,
      title={SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents}, 
      author={Yuhang Wang and Yuling Shi and Mo Yang and Rongrui Zhang and Shilin He and Heng Lian and Yuting Chen and Siyu Ye and Kai Cai and Xiaodong Gu},
      year={2026},
      eprint={2601.16746},
      archivePrefix={arXiv},
      primaryClass={cs.SE},
      url={https://arxiv.org/abs/2601.16746},
}

🏆 Acknowledgements

Bytedance Douyin Team for advises.
Alibaba Qwen Team for open-source models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents

📢 Latest Updates

📑 Introduction to SWE-Pruner

🎯 Core Features

🌲 Project Structure

🧰 Prerequisites

🎮 Quick Start

⚙️ Installation

📖 User Guides

🧪 Utils Scripts

🔮 Coming Soon

📜 Citation

🏆 Acknowledgements

⭐ Star History

About

Uh oh!

Releases

Packages

Contributors 4

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
data		data
downstream_eval		downstream_eval
examples		examples
images		images
swe-pruner		swe-pruner
utils		utils
.gitignore		.gitignore
README.md		README.md

Ayanami1314/swe-pruner

Folders and files

Latest commit

History

Repository files navigation

SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents

📢 Latest Updates

📑 Introduction to SWE-Pruner

🎯 Core Features

🌲 Project Structure

🧰 Prerequisites

🎮 Quick Start

⚙️ Installation

📖 User Guides

🧪 Utils Scripts

🔮 Coming Soon

📜 Citation

🏆 Acknowledgements

⭐ Star History

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages