SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expanding the search space and escaping local optima. On SWE-bench Verified, it achieves SOTA performance
☆242Sep 23, 2025Updated 6 months ago
Alternatives and similar repositories for SE-Agent
Users that are interested in SE-Agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Artifact for TOSEM Submission: GiantRepair☆13Jun 26, 2024Updated last year
- A simple visual test-time scaling method for GUI agent grounding☆21Dec 7, 2025Updated 3 months ago
- AI powered coding Agent☆36Oct 22, 2025Updated 5 months ago
- ☆33Mar 6, 2026Updated 3 weeks ago
- Personalized Fragrance Recommendation for Aromatherapy: A Machine Learning Approach Based on Personality Traits and Electrodermal Activit…☆14May 1, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [KDD 2025] AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation☆33Nov 18, 2025Updated 4 months ago
- ☆131Jun 6, 2025Updated 9 months ago
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 4 months ago
- ☆40Feb 20, 2026Updated last month
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆326Dec 18, 2025Updated 3 months ago
- A Deep Learning Project about cats.☆11Aug 8, 2022Updated 3 years ago
- Concurrency library☆17Oct 13, 2024Updated last year
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆34Apr 5, 2025Updated 11 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆77Mar 6, 2026Updated 3 weeks ago
- [NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents☆55Nov 27, 2025Updated 4 months ago
- ☆12Sep 23, 2023Updated 2 years ago
- [NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!☆171Mar 9, 2026Updated 2 weeks ago
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆30Mar 28, 2025Updated 11 months ago
- OW-OVD: Unified Open World and Open Vocabulary Object Detection (CVPR 2025)☆25Dec 2, 2024Updated last year
- ✨ A high-performance code agent written in Rust, combining the best features of WCGW for maximum efficiency and semantic capabilities. 🦀☆26Mar 16, 2026Updated last week
- ☆17Dec 21, 2023Updated 2 years ago
- MegEngine implementation of Diffusion Models.☆19Aug 8, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆102Oct 3, 2025Updated 5 months ago
- ☆12Aug 17, 2025Updated 7 months ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆602Updated this week
- ☆20Jun 17, 2025Updated 9 months ago
- The official implementation of "RouteExplainer: An Explanation Framework for Vehicle Routing Problem" (PAKDD 2024, oral)☆17Apr 5, 2024Updated last year
- A controlled benchmark on evaluating and studying the dynamics of Long Context Language Models☆25Oct 17, 2025Updated 5 months ago
- A Benchmark for Multi-Stage Legal Case Documents Generation☆16Feb 24, 2025Updated last year
- Residual vector quantization for KV cache compression in large language model☆12Oct 22, 2024Updated last year
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆323Mar 11, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆98Oct 27, 2025Updated 4 months ago
- 🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents☆2,673Jan 7, 2026Updated 2 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆149May 27, 2025Updated 10 months ago
- [ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution☆296Mar 19, 2026Updated last week
- ☆133Apr 7, 2025Updated 11 months ago
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)☆45Jan 6, 2026Updated 2 months ago