Benchmarking Goal-Oriented Software Engineering
☆114Jan 7, 2026Updated last month
Alternatives and similar repositories for CodeClash
Users that are interested in CodeClash are comparing it to the libraries listed below
Sorting:
- ☆14Apr 16, 2025Updated 10 months ago
- ☆19Aug 10, 2024Updated last year
- Run SWE-bench evaluations remotely☆58Aug 14, 2025Updated 6 months ago
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆42Updated this week
- Hand-Rolled GPU communications library☆85Nov 25, 2025Updated 3 months ago
- Harness for running and evaluating AI agents against RL environments☆115Updated this week
- TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)☆27Sep 18, 2025Updated 5 months ago
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆35Aug 12, 2025Updated 6 months ago
- we have ai at home☆72Feb 18, 2026Updated last week
- Commit0: Library Generation from Scratch☆186Updated this week
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆64Jul 8, 2024Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Apr 17, 2025Updated 10 months ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆577Updated this week
- Create and deploy virtual-experiments - co-processing computational workflows☆10Jan 28, 2026Updated last month
- Official implementation of "BERTs are Generative In-Context Learners"☆32Mar 14, 2025Updated 11 months ago
- ext_mpi_collectives☆11Apr 1, 2025Updated 10 months ago
- A blueprint for next-gen AI. Project Infinity uses a token-efficient, Codified Agent Protocol to create specialized, secure, and imaginat…☆25Oct 2, 2025Updated 4 months ago
- Clean RL implementation using MLX☆35Mar 8, 2024Updated last year
- Memory Topology for GPUs☆17Feb 13, 2026Updated 2 weeks ago
- PARADIS, a lightweight and flexible weather forecast model that tries to Keep It Simple.☆26Feb 4, 2026Updated 3 weeks ago
- Open source project for the Merlin400 extractor☆20May 25, 2024Updated last year
- The official repository of ALE-Bench☆158Updated this week
- ☆10May 14, 2025Updated 9 months ago
- Code for paper "Beyond Closure Models: Learning Chaotic Systems via Physics-Informed Neural Operators".☆14Dec 24, 2025Updated 2 months ago
- Implementation of MetaVQA.☆12Jul 3, 2021Updated 4 years ago
- 🎮 Real-time game subtitle translator with AI-powered OCR. Context-aware translation for 20+ languages. Free offline models + dirt cheap …☆28Feb 20, 2026Updated last week
- 2D time-domain isotropic (visco)elastic FD modeling and full waveform inversion (FWI) code for SH-waves☆13Aug 9, 2020Updated 5 years ago
- Sequential Parameter Optimization in Python☆14Jan 12, 2026Updated last month
- A recreation of the Amadeus in steins;gate 0, more specifically the desktop version of Amadeus in Viktor chondria univiersity, I tried to…☆22Jul 28, 2025Updated 7 months ago
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago
- A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.☆12Jun 24, 2024Updated last year
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- I saw this [Blog Post](https://www.morling.dev/blog/one-billion-row-challenge/) on a Billion Row challenge for Java so naturally I tried …☆14Jan 10, 2024Updated 2 years ago
- How to build an ACP compliant agent that uses MCP as well!☆11May 6, 2025Updated 9 months ago
- ☆11Feb 27, 2024Updated 2 years ago
- Profitable MT5 Expert Advisors☆21Updated this week
- A Redis-compatible in-memory database server written in Rust with MLua-based Lua 5.1 scripting☆17Nov 28, 2025Updated 3 months ago