Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git patches and run tests or SWE-Bench evaluations.
☆14Apr 9, 2025Updated last year
Alternatives and similar repositories for moatless-testbeds
Users that are interested in moatless-testbeds are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Nov 5, 2024Updated last year
- ☆136Jun 6, 2025Updated 11 months ago
- 这是一个大学四年的cs基础课部分专业课的复习笔记的扫描版备份仓库☆12Jun 29, 2019Updated 6 years ago
- ☆22Apr 26, 2026Updated 3 weeks ago
- ☆13May 23, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Jul 25, 2023Updated 2 years ago
- ☆13Dec 31, 2023Updated 2 years ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- [ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs☆19Mar 20, 2025Updated last year
- Explainable Neural Subgraph Matching with Graph Learnable Multi-hop Attention Networks☆14Sep 26, 2024Updated last year
- Source code and dataset for paper "End-to-End Transition-Based Online Dialogue Disentanglement"☆17May 17, 2021Updated 5 years ago
- Code to compute AnthroScore, a computational linguistic measure of anthropomorphism in text☆19Mar 31, 2025Updated last year
- Utilities for efficient fine-tuning, inference and evaluation of code generation models☆21Oct 3, 2023Updated 2 years ago
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Project page for the 'CLAWS: Clustering Assisted Weakly Supervised Learning with Normalcy Suppression for Anomalous Event Detection', ECC…☆12May 29, 2021Updated 4 years ago
- Landing page + leaderboard for SWE-Bench benchmark☆12Mar 29, 2026Updated last month
- Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions☆49Sep 13, 2025Updated 8 months ago
- ☆19Jun 13, 2024Updated last year
- Agentless Lite: RAG-based SWE-Bench software engineering scaffold☆47Apr 15, 2025Updated last year
- [ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model☆43Dec 25, 2024Updated last year
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆30Jan 28, 2024Updated 2 years ago
- A minimal language for Isabelle/HOL, designed for easing machine learning.☆28May 16, 2026Updated last week
- [NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!☆192May 16, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ARC gym: a data generation framework for the Abstraction & Reasoning Corpus☆25Mar 25, 2026Updated last month
- Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)☆97Mar 26, 2025Updated last year
- A PyTorch implementation of LDAST☆26Dec 17, 2023Updated 2 years ago
- Enhanced fork of SWE-bench, tailored for OpenDevin's ecosystem.☆30May 26, 2024Updated last year
- A Searching-based Agent Model for Open-Domain Open-Ended Question Answering☆35Jun 20, 2025Updated 11 months ago
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆39Apr 8, 2023Updated 3 years ago
- ☆17Jan 7, 2024Updated 2 years ago
- Reproducing R1 for Code with Reliable Rewards☆309May 5, 2025Updated last year
- Code for Aesop: Paraphrase Generation with Adaptive Syntactic Control (EMNLP 2021)☆26Jan 17, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The approach involves the usage of Multi-Criteria Decision Analyses, including Weighted Sum Model (WSM), Weighted Product Model (WPM) and…☆11Oct 22, 2021Updated 4 years ago
- Harness used to benchmark aider against SWE Bench benchmarks☆83Jun 27, 2024Updated last year
- LinearArbitrary-SeaHorn is a CHC solver for LLVM-based languages.☆22Mar 13, 2023Updated 3 years ago
- AskIt (for JavaScript/TypeScript): Unified programming interface for large language models (GPT-4, GPT-3.5)☆35Oct 1, 2023Updated 2 years ago
- Official Code for DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents (Findings of EMNL…☆22Oct 24, 2023Updated 2 years ago
- ☆36May 25, 2023Updated 3 years ago
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆51Jun 17, 2025Updated 11 months ago