Can Language Models Rebuild Programs From Scratch?
☆795Jun 25, 2026Updated this week
Alternatives and similar repositories for ProgramBench
Users that are interested in ProgramBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Oct 2, 2023Updated 2 years ago
- Groq-powered MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆12Jul 5, 2024Updated last year
- [EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling☆17Nov 20, 2025Updated 7 months ago
- VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]☆17Jun 1, 2026Updated 3 weeks ago
- ☆45Jan 30, 2026Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Research Artifact for HPCA'24 Paper: *Modeling, Derivation, and Automated Analysis of Branch Predictor Security Vulnerabilities*.☆11Oct 30, 2025Updated 8 months ago
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆15Jun 16, 2026Updated 2 weeks ago
- implementation of aided LLM codeplan algorithm in java☆10Jan 13, 2024Updated 2 years ago
- Automated Benchmarking of LLM Agents on Real-World Software Security Tasks [NeurIPS 2025]☆79Jan 27, 2026Updated 5 months ago
- [NeurIPS '25] GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents☆85Apr 27, 2026Updated 2 months ago
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- The High Performance LLM Native Mock Server☆34May 24, 2026Updated last month
- ☆10Jan 23, 2025Updated last year
- These are tools I cheated with the help of ChatGPT to help me with Penetration Testing and Red Teaming☆13Feb 24, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Your Interface to Intelligence☆54Updated this week
- Clover - imageboard browser for Android☆40Updated this week
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆17Sep 27, 2023Updated 2 years ago
- CleanVul: Automatic Function-Level Vulnerability Detection in Code Commits Using LLM Heuristics☆22Mar 25, 2026Updated 3 months ago
- ☆31Nov 17, 2025Updated 7 months ago
- A Productivity-Boosting Burp Suite extension written in Kotlin that enables persistent sticky session handling in web application testing…☆14Oct 8, 2025Updated 8 months ago
- ☆12Nov 5, 2024Updated last year
- An ultimate pdf file disintegration tool☆11Jun 12, 2020Updated 6 years ago
- win32 native frontend for llama-cli☆14Nov 2, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Vibe Editing – Asynchronous Voice-to-Edit Flow with AI Agents in Cursor (AI Tinkerers Toronto - May 2025 Meetup: AGENTS at Ada)☆15May 22, 2025Updated last year
- Deep Correlations for Texture Synthesis☆18Sep 29, 2017Updated 8 years ago
- Semantic Segmentation Project for Self Driving Cars☆17Jun 11, 2020Updated 6 years ago
- Google Play InApp Billing v3 Example☆11Mar 21, 2021Updated 5 years ago
- code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"☆12Dec 13, 2024Updated last year
- 한국 스타트업, 1인 법인, 프리랜서, 개인 사업자를 위한 장부 자동 생성 Claude Code 스킬. 카드명세서 PDF·은행 CSV → 재무제표·세무사 전달 CSV 자동 생성. Level 2 민감정보 마스킹 적용.☆68Apr 23, 2026Updated 2 months ago
- ☆13Apr 26, 2023Updated 3 years ago
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- Regex base tail written in Rust☆10Mar 20, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆42Apr 21, 2025Updated last year
- This is the repository for our EMNLP 2022 paper "The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains".☆10Jun 2, 2023Updated 3 years ago
- ☆98Nov 22, 2025Updated 7 months ago
- API server for converts hwp files - thanks to hwplib & hwpxlib☆13Jun 9, 2023Updated 3 years ago
- ☆19Jul 4, 2025Updated 11 months ago
- the implementation of Embedding API Dependency Graph for Neural Code Generation☆12Jun 6, 2021Updated 5 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆144May 6, 2026Updated last month