[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
☆322Feb 24, 2025Updated last year
Alternatives and similar repositories for selfcodealign
Users that are interested in selfcodealign are comparing it to the libraries listed below
Sorting:
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Jul 2, 2024Updated last year
- Training and Benchmarking LLMs for Code Preference.☆38Nov 15, 2024Updated last year
- RepoQA: Evaluating Long-Context Code Understanding☆129Nov 1, 2024Updated last year
- The First International Workshop on Large Language Model for Code 2024 (Co-Located with ICSE 2024)☆17Oct 4, 2024Updated last year
- Reproducing R1 for Code with Reliable Rewards☆297May 5, 2025Updated 10 months ago
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024☆1,698Oct 2, 2025Updated 5 months ago
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆478Feb 5, 2025Updated last year
- Making code edting up to 7.7x faster using multi-layer speculation☆24Feb 20, 2025Updated last year
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆2,088Nov 1, 2024Updated last year
- EvoEval: Evolving Coding Benchmarks via LLM☆81Apr 6, 2024Updated last year
- [ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI☆488Jan 3, 2026Updated 2 months ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆679Mar 16, 2025Updated last year
- A multi-programming language benchmark for LLMs☆299Jan 28, 2026Updated last month
- Collect simple coverage information in memory.☆11Oct 6, 2022Updated 3 years ago
- e☆43Apr 23, 2025Updated 10 months ago
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆192Aug 16, 2024Updated last year
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆817Jul 16, 2025Updated 8 months ago
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.☆62Oct 21, 2024Updated last year
- Artifact for ESEC/FSE'23 paper "NeuRI: Diversifying DNN Generation via Inductive Rule Inference"☆32Nov 13, 2023Updated 2 years ago
- Fast and Precise On-the-fly Patch Validation for All☆10Feb 24, 2023Updated 3 years ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆2,019Dec 22, 2024Updated last year
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆597Updated this week
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆63Apr 10, 2024Updated last year
- A framework for the evaluation of autoregressive code generation language models.☆1,021Jul 22, 2025Updated 7 months ago
- Transfer Learning in Dialogue Benchmarking Toolkit☆14Mar 31, 2023Updated 2 years ago
- The Open Cookbook for Top-Tier Code Large Language Model☆2,059Dec 8, 2024Updated last year
- ☆1,506May 12, 2023Updated 2 years ago
- Home of StarCoder2!☆2,047Mar 21, 2024Updated last year
- ☆1,033Dec 17, 2024Updated last year
- Fuzzing Automatic Differentiation in Deep-Learning Libraries (ICSE'23)☆27Mar 2, 2024Updated 2 years ago
- Accepted by Transactions on Machine Learning Research (TMLR)☆136Oct 5, 2024Updated last year
- Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions☆48Sep 13, 2025Updated 6 months ago
- ☆133May 8, 2025Updated 10 months ago
- ☆71Oct 16, 2024Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆73Jun 25, 2024Updated last year
- ☆113Jul 17, 2024Updated last year
- ☆12Nov 14, 2021Updated 4 years ago
- ☆56May 28, 2024Updated last year
- ☆41Jun 19, 2024Updated last year