π OctoPack: Instruction Tuning Code Large Language Models
β479Feb 5, 2025Updated last year
Alternatives and similar repositories for octopack
Users that are interested in octopack are comparing it to the libraries listed below
Sorting:
- A framework for the evaluation of autoregressive code generation language models.β1,021Jul 22, 2025Updated 7 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Modelsβ63Apr 10, 2024Updated last year
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024β1,698Oct 2, 2025Updated 5 months ago
- β492Aug 15, 2024Updated last year
- Run evaluation on LLMs using human-eval benchmarkβ428Sep 12, 2023Updated 2 years ago
- Accepted by Transactions on Machine Learning Research (TMLR)β136Oct 5, 2024Updated last year
- A multi-programming language benchmark for LLMsβ299Jan 28, 2026Updated last month
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generationβ322Feb 24, 2025Updated last year
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)β175Aug 15, 2025Updated 7 months ago
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".β267Oct 30, 2024Updated last year
- Code for the paper "Evaluating Large Language Models Trained on Code"β3,163Jan 17, 2025Updated last year
- Code for the curation of The Stack v2 and StarCoder2 training dataβ130Apr 11, 2024Updated last year
- Open Source WizardCoder Datasetβ166Jul 12, 2023Updated 2 years ago
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructionsβ25Aug 8, 2024Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,478Jun 7, 2025Updated 9 months ago
- β1,506May 12, 2023Updated 2 years ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluationβ168Oct 11, 2024Updated last year
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.β62Oct 21, 2024Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agentsβ557Oct 28, 2023Updated 2 years ago
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLMβ1,479May 1, 2025Updated 10 months ago
- Code for fine-tuning Platypus fam LLMs using LoRAβ629Feb 4, 2024Updated 2 years ago
- Fine-tune SantaCoder for Code/Text Generation.β196Apr 11, 2023Updated 2 years ago
- A repository to perform self-instruct with a model on HF Hubβ32Sep 29, 2023Updated 2 years ago
- β234Feb 28, 2026Updated 3 weeks ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Mar 8, 2023Updated 3 years ago
- Home of StarCoder: fine-tuning & inference!β7,529Feb 27, 2024Updated 2 years ago
- AllenAI's post-training codebaseβ3,629Updated this week
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.β710Dec 30, 2024Updated last year
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neurβ¦β564Jan 21, 2025Updated last year
- Reproducing R1 for Code with Reliable Rewardsβ297May 5, 2025Updated 10 months ago
- β56May 28, 2024Updated last year
- Ongoing research training transformer models at scaleβ395Aug 20, 2024Updated last year
- β675Nov 1, 2024Updated last year
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srwβ64Oct 4, 2024Updated last year
- Scaling Data-Constrained Language Modelsβ342Jun 28, 2025Updated 8 months ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instructβ2,088Nov 1, 2024Updated last year
- Salesforce open-source LLMs with 8k sequence length.β726Jan 31, 2025Updated last year
- Heuristic filtering framework for RefineCodeβ83Mar 13, 2025Updated last year
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.β5,559May 21, 2025Updated 10 months ago