Benchmarking Goal-Oriented Software Engineering
☆128Jan 7, 2026Updated 3 months ago
Alternatives and similar repositories for CodeClash
Users that are interested in CodeClash are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆43Updated this week
- TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)☆26Sep 18, 2025Updated 6 months ago
- Run SWE-bench evaluations remotely☆61Aug 14, 2025Updated 7 months ago
- ☆14Apr 16, 2025Updated 11 months ago
- Scaling Coding-Agent RL to 32x H100s. **Achieving 160% improvement** on Stanford's TerminalBench☆97Nov 3, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆616Updated this week
- Convert GitHub PRs into Harbor tasks☆54Mar 10, 2026Updated last month
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 3 years ago
- ☆19Aug 10, 2024Updated last year
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆472Updated this week
- slowly building a set of infinite riddle generators for data-hungry methods☆14Nov 15, 2022Updated 3 years ago
- ☆34Mar 21, 2026Updated 2 weeks ago
- Production-Grade Autoresearch. Ideal for GPU kernels, ML model development, feature engineering, prompt engineering, and other optimizabl…☆41Updated this week
- Hand-Rolled GPU communications library☆89Nov 25, 2025Updated 4 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An ergonomic, opinionated memory interface for AI agents☆39Dec 18, 2025Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Apr 17, 2025Updated 11 months ago
- Clean RL implementation using MLX☆34Mar 8, 2024Updated 2 years ago
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆35Jan 18, 2026Updated 2 months ago
- ☆24Mar 21, 2026Updated 3 weeks ago
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆12Oct 12, 2024Updated last year
- Abstraction and Reasoning Corpus☆14Nov 22, 2022Updated 3 years ago
- RISC-V vector extension ISA simulation☆18Jun 11, 2019Updated 6 years ago
- A lightweight computational physics framework, based on the organization of turboWAVE. Implements a "Simulation, PhysicsModule, ComputeTo…☆11Apr 1, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10Nov 6, 2024Updated last year
- ☆11Mar 15, 2024Updated 2 years ago
- ☆15May 17, 2022Updated 3 years ago
- nyc is so back☆21Jun 27, 2025Updated 9 months ago
- NSA Triton Kernels written with GPT5 and Opus 4.1☆70Aug 12, 2025Updated 7 months ago
- tuimorphic choose-your-own-adventure story game☆18Mar 3, 2026Updated last month
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆40Jul 13, 2024Updated last year
- Building self-refined guardrails via DSPy☆14Jul 2, 2024Updated last year
- Official Code Repository for the paper "Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes" (ICML 2024).☆16Jul 21, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Async RL Training at Scale☆1,266Updated this week
- A benchmark for LLMs on complicated tasks in the terminal☆1,913Jan 22, 2026Updated 2 months ago
- A MCP implementation for sending notifications via Pushover☆35Mar 16, 2025Updated last year
- lol☆10Mar 12, 2021Updated 5 years ago
- ☆14Dec 13, 2022Updated 3 years ago
- Pytorch implementation of “MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures” (NeurIPS 2020 spotlight)☆13Jul 22, 2021Updated 4 years ago
- A series of high-performance GEMM (General Matrix Multiply) implementations Iteratively optimised for H100 GPUs in Pure CUDA.☆76Feb 18, 2026Updated last month