vaibhavagg303 / DARS-AgentLinks
☆67Updated 6 months ago
Alternatives and similar repositories for DARS-Agent
Users that are interested in DARS-Agent are comparing it to the libraries listed below
Sorting:
- Agent computer interface for AI software engineer.☆114Updated 2 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆78Updated last year
- Open Agent Computer Interface☆89Updated last year
- Coding problems used in aider's polyglot benchmark☆194Updated 11 months ago
- ☆62Updated 5 months ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆382Updated this week
- proof-of-concept of Cursor's Instant Apply feature☆87Updated last year
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆479Updated this week
- ☆126Updated 6 months ago
- A system that tries to resolve all issues on a github repo with OpenHands.☆117Updated last year
- Enhancing AI Software Engineering with Repository-level Code Graph☆232Updated 8 months ago
- Contains the prompts we use to talk to various LLMs for different utilities inside the editor☆83Updated last year
- ☆185Updated 2 months ago
- [NAACL2025] LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications☆131Updated 4 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆225Updated this week
- Run SWE-bench evaluations remotely☆44Updated 3 months ago
- ☆79Updated 2 months ago
- Agentless Lite: RAG-based SWE-Bench software engineering scaffold☆43Updated 7 months ago
- ☆59Updated 10 months ago
- 🚀 The LLM Automatic Computer Framework: L2MAC☆144Updated 11 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆69Updated 5 months ago
- ☆159Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆89Updated this week
- accompanying material for sleep-time compute paper☆118Updated 7 months ago
- 📚 Benchmark your browser agent on ~2.5k READ and ACTION based tasks☆78Updated 4 months ago
- Multi-Granularity LLM Debugger [ICSE2026]☆93Updated 5 months ago
- ☆613Updated 3 months ago
- ☆128Updated 7 months ago
- LongCodeZip: Compress Long Context for Code Language Models [ASE2025]☆129Updated last week
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆30Updated 8 months ago