π OctoPack: Instruction Tuning Code Large Language Models
β478Feb 5, 2025Updated last year
Alternatives and similar repositories for octopack
Users that are interested in octopack are comparing it to the libraries listed below
Sorting:
- A framework for the evaluation of autoregressive code generation language models.β1,020Jul 22, 2025Updated 7 months ago
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024β1,688Oct 2, 2025Updated 4 months ago
- Accepted by Transactions on Machine Learning Research (TMLR)β137Oct 5, 2024Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Modelsβ63Apr 10, 2024Updated last year
- β489Aug 15, 2024Updated last year
- Run evaluation on LLMs using human-eval benchmarkβ427Sep 12, 2023Updated 2 years ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)β174Aug 15, 2025Updated 6 months ago
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generationβ323Feb 24, 2025Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,476Jun 7, 2025Updated 8 months ago
- Open Source WizardCoder Datasetβ164Jul 12, 2023Updated 2 years ago
- Code for the paper "Evaluating Large Language Models Trained on Code"β3,137Jan 17, 2025Updated last year
- Code for fine-tuning Platypus fam LLMs using LoRAβ629Feb 4, 2024Updated 2 years ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agentsβ555Oct 28, 2023Updated 2 years ago
- A multi-programming language benchmark for LLMsβ298Jan 28, 2026Updated last month
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLMβ1,481May 1, 2025Updated 10 months ago
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructionsβ25Aug 8, 2024Updated last year
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".β266Oct 30, 2024Updated last year
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.β708Dec 30, 2024Updated last year
- Code for the curation of The Stack v2 and StarCoder2 training dataβ126Apr 11, 2024Updated last year
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluationβ166Oct 11, 2024Updated last year
- AllenAI's post-training codebaseβ3,592Updated this week
- Salesforce open-source LLMs with 8k sequence length.β725Jan 31, 2025Updated last year
- β672Nov 1, 2024Updated last year
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.β62Oct 21, 2024Updated last year
- Fine-tune SantaCoder for Code/Text Generation.β196Apr 11, 2023Updated 2 years ago
- β232Dec 3, 2025Updated 2 months ago
- Home of StarCoder: fine-tuning & inference!β7,530Feb 27, 2024Updated 2 years ago
- β1,504May 12, 2023Updated 2 years ago
- Scaling Data-Constrained Language Modelsβ342Jun 28, 2025Updated 8 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.β121Aug 16, 2023Updated 2 years ago
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srwβ64Oct 4, 2024Updated last year
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Modelsβ1,660Mar 8, 2024Updated last year
- Generate textbook-quality synthetic LLM pretraining dataβ509Oct 19, 2023Updated 2 years ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instructβ2,076Nov 1, 2024Updated last year
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.β5,536May 21, 2025Updated 9 months ago
- β159Aug 27, 2024Updated last year
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neurβ¦β558Jan 21, 2025Updated last year
- General technology for enabling AI capabilities w/ LLMs and MLLMsβ4,289Dec 22, 2025Updated 2 months ago
- A repository to perform self-instruct with a model on HF Hubβ32Sep 29, 2023Updated 2 years ago