Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git patches and run tests or SWE-Bench evaluations.
☆14Apr 9, 2025Updated last year
Alternatives and similar repositories for moatless-testbeds
Users that are interested in moatless-testbeds are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Nov 5, 2024Updated last year
- ☆132Jun 6, 2025Updated 10 months ago
- 这是一个大学四年的cs基础课部分专业课的复习笔记的扫描版备份仓库☆12Jun 29, 2019Updated 6 years ago
- Applied Symbolic Execution with KLEE/LLVM☆24Jun 7, 2013Updated 12 years ago
- ☆21Mar 29, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Legacy Code of ZJU Campus App for iOS☆11Jan 31, 2024Updated 2 years ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- [ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs☆19Mar 20, 2025Updated last year
- Explainable Neural Subgraph Matching with Graph Learnable Multi-hop Attention Networks☆14Sep 26, 2024Updated last year
- Code to compute AnthroScore, a computational linguistic measure of anthropomorphism in text☆18Mar 31, 2025Updated last year
- Utilities for efficient fine-tuning, inference and evaluation of code generation models☆21Oct 3, 2023Updated 2 years ago
- ☆23Dec 8, 2022Updated 3 years ago
- Project page for the 'CLAWS: Clustering Assisted Weakly Supervised Learning with Normalcy Suppression for Anomalous Event Detection', ECC…☆12May 29, 2021Updated 4 years ago
- Agentless Lite: RAG-based SWE-Bench software engineering scaffold☆45Apr 15, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Landing page + leaderboard for SWE-Bench benchmark☆12Mar 29, 2026Updated 2 weeks ago
- Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions☆48Sep 13, 2025Updated 7 months ago
- ☆19Jun 13, 2024Updated last year
- [ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model☆43Dec 25, 2024Updated last year
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆29Jan 28, 2024Updated 2 years ago
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆36Oct 26, 2025Updated 5 months ago
- ProgQuery is a system to extract useful syntactic and semantic information from source code programs and store it in a graph database for…☆17Jan 22, 2025Updated last year
- A minimal language for Isabelle/HOL, designed for easing machine learning.☆25Jan 13, 2026Updated 3 months ago
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆26Mar 2, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆169Oct 11, 2024Updated last year
- ☆22Dec 7, 2023Updated 2 years ago
- A Searching-based Agent Model for Open-Domain Open-Ended Question Answering☆34Jun 20, 2025Updated 9 months ago
- ☆17Jan 7, 2024Updated 2 years ago
- Reproducing R1 for Code with Reliable Rewards☆302May 5, 2025Updated 11 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆80Jun 27, 2024Updated last year
- Dataset and model for disentangling chat on IRC☆58May 7, 2024Updated last year
- The approach involves the usage of Multi-Criteria Decision Analyses, including Weighted Sum Model (WSM), Weighted Product Model (WPM) and…☆11Oct 22, 2021Updated 4 years ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Jun 5, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆36May 25, 2023Updated 2 years ago
- AI Services API: serves langchain, huggingface, & other emergent python AI libraries as a service. This project mainly serves LibreChat, …☆35Jul 24, 2023Updated 2 years ago
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆50Jun 17, 2025Updated 9 months ago
- Code Efficiency Benchmark☆87Mar 6, 2026Updated last month
- quick playground to animate pippin☆15Nov 11, 2024Updated last year
- A library for program induction and learning representations.☆32Dec 18, 2023Updated 2 years ago
- Official implementation of our ICSE 2023 paper on Automatic Code Generation.☆27Nov 8, 2023Updated 2 years ago