Benchmarking Goal-Oriented Software Engineering
☆175Jun 25, 2026Updated this week
Alternatives and similar repositories for CodeClash
Users that are interested in CodeClash are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆43Jun 24, 2026Updated last week
- TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)☆31Jun 18, 2026Updated last week
- Run SWE-bench evaluations remotely☆73Aug 14, 2025Updated 10 months ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆539Jun 22, 2026Updated last week
- MoE training for Me and You and maybe other people☆392Mar 15, 2026Updated 3 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Harness for running and evaluating AI agents against RL environments☆200Jun 20, 2026Updated last week
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆682Jun 22, 2026Updated last week
- Code & data for TaxCalcBench☆97Updated this week
- A Berkeley library for probability theory.☆15Jan 14, 2025Updated last year
- Solidity grammar for tree sitter☆12Mar 12, 2022Updated 4 years ago
- ☆19Aug 10, 2024Updated last year
- Programmable chat templates for LLM training and inference.☆121Updated this week
- Calling LLM APIs on a Raspberry Pi for lulz☆24Apr 17, 2023Updated 3 years ago
- ☆34Mar 21, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆16Mar 19, 2023Updated 3 years ago
- Hand-Rolled GPU communications library☆95Nov 25, 2025Updated 7 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Apr 17, 2025Updated last year
- Clean RL implementation using MLX☆34Mar 8, 2024Updated 2 years ago
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆36Jan 18, 2026Updated 5 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆66Jul 8, 2024Updated last year
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆14Oct 12, 2024Updated last year
- Abstraction and Reasoning Corpus☆14Nov 22, 2022Updated 3 years ago
- Benchmark Large Language Models Reliably On Your Data☆18Dec 27, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Nov 6, 2024Updated last year
- A lightweight computational physics framework, based on the organization of turboWAVE. Implements a "Simulation, PhysicsModule, ComputeTo…☆12Apr 1, 2026Updated 3 months ago
- PIRA - Automatic Instrumentation Refinement☆17Mar 28, 2024Updated 2 years ago
- ☆12Mar 15, 2024Updated 2 years ago
- a benchmark to evaluate the situated inductive reasoning☆16Jan 7, 2025Updated last year
- nyc is so back☆21Jun 27, 2025Updated last year
- Son of Grid engine☆18Dec 19, 2024Updated last year
- A meta-repo that watches karpathy/autoresearch and adjacent systems, distills portable patterns for bounded agent-verifier research lo…☆43May 8, 2026Updated last month
- ☆12Mar 3, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆24Mar 21, 2025Updated last year
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA☆16May 11, 2022Updated 4 years ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆40Jul 13, 2024Updated last year
- ☆29Sep 23, 2025Updated 9 months ago
- A Spigot plugin and programming language to schedule and program commands to be run at specified timecodes☆12Jan 16, 2026Updated 5 months ago
- ☆46Jan 10, 2026Updated 5 months ago
- Agentic RL Training at Scale☆1,533Jun 24, 2026Updated last week