π OctoPack: Instruction Tuning Code Large Language Models
β479Feb 5, 2025Updated last year
Alternatives and similar repositories for octopack
Users that are interested in octopack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for the evaluation of autoregressive code generation language models.β1,032Jul 22, 2025Updated 8 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Modelsβ63Apr 10, 2024Updated 2 years ago
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024β1,709Oct 2, 2025Updated 6 months ago
- β492Aug 15, 2024Updated last year
- Run evaluation on LLMs using human-eval benchmarkβ430Sep 12, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Accepted by Transactions on Machine Learning Research (TMLR)β135Oct 5, 2024Updated last year
- A multi-programming language benchmark for LLMsβ301Jan 28, 2026Updated 2 months ago
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generationβ323Feb 24, 2025Updated last year
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".β269Oct 30, 2024Updated last year
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)β177Aug 15, 2025Updated 7 months ago
- Code for the paper "Evaluating Large Language Models Trained on Code"β3,188Jan 17, 2025Updated last year
- Code for the curation of The Stack v2 and StarCoder2 training dataβ130Apr 11, 2024Updated last year
- Open Source WizardCoder Datasetβ166Jul 12, 2023Updated 2 years ago
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructionsβ25Aug 8, 2024Updated last year
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,477Jun 7, 2025Updated 10 months ago
- β1,505May 12, 2023Updated 2 years ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluationβ169Oct 11, 2024Updated last year
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.β62Oct 21, 2024Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agentsβ557Oct 28, 2023Updated 2 years ago
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLMβ1,480May 1, 2025Updated 11 months ago
- Code for fine-tuning Platypus fam LLMs using LoRAβ628Feb 4, 2024Updated 2 years ago
- Fine-tune SantaCoder for Code/Text Generation.β197Apr 11, 2023Updated 3 years ago
- A repository to perform self-instruct with a model on HF Hubβ32Sep 29, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β236Feb 28, 2026Updated last month
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Mar 8, 2023Updated 3 years ago
- Home of StarCoder: fine-tuning & inference!β7,522Feb 27, 2024Updated 2 years ago
- AllenAI's post-training codebaseβ3,677Updated this week
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.β714Dec 30, 2024Updated last year
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neurβ¦β565Jan 21, 2025Updated last year
- Reproducing R1 for Code with Reliable Rewardsβ302May 5, 2025Updated 11 months ago
- β57May 28, 2024Updated last year
- Ongoing research training transformer models at scaleβ396Aug 20, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- β676Nov 1, 2024Updated last year
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srwβ64Oct 4, 2024Updated last year
- Scaling Data-Constrained Language Modelsβ343Jun 28, 2025Updated 9 months ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instructβ2,091Nov 1, 2024Updated last year
- Salesforce open-source LLMs with 8k sequence length.β726Jan 31, 2025Updated last year
- Heuristic filtering framework for RefineCodeβ83Mar 13, 2025Updated last year
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.β5,594May 21, 2025Updated 10 months ago