π OctoPack: Instruction Tuning Code Large Language Models
β479Feb 5, 2025Updated last year
Alternatives and similar repositories for octopack
Users that are interested in octopack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for the evaluation of autoregressive code generation language models.β1,045Jul 22, 2025Updated 9 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Modelsβ63Apr 10, 2024Updated 2 years ago
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024β1,745Oct 2, 2025Updated 7 months ago
- β494Aug 15, 2024Updated last year
- Run evaluation on LLMs using human-eval benchmarkβ431Sep 12, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Accepted by Transactions on Machine Learning Research (TMLR)β135Oct 5, 2024Updated last year
- A multi-programming language benchmark for LLMsβ304Apr 12, 2026Updated last month
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generationβ323Feb 24, 2025Updated last year
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".β273Oct 30, 2024Updated last year
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)β177Aug 15, 2025Updated 9 months ago
- Code for the paper "Evaluating Large Language Models Trained on Code"β3,227Jan 17, 2025Updated last year
- Code for the curation of The Stack v2 and StarCoder2 training dataβ134Apr 11, 2024Updated 2 years ago
- Open Source WizardCoder Datasetβ166Jul 12, 2023Updated 2 years ago
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructionsβ25Aug 8, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,483Jun 7, 2025Updated 11 months ago
- β1,510May 12, 2023Updated 3 years ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluationβ170Oct 11, 2024Updated last year
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.β64Oct 21, 2024Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agentsβ556Oct 28, 2023Updated 2 years ago
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLMβ1,480May 1, 2025Updated last year
- Code for fine-tuning Platypus fam LLMs using LoRAβ626Feb 4, 2024Updated 2 years ago
- Fine-tune SantaCoder for Code/Text Generation.β196Apr 11, 2023Updated 3 years ago
- A repository to perform self-instruct with a model on HF Hubβ32Sep 29, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β241Feb 28, 2026Updated 2 months ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Mar 8, 2023Updated 3 years ago
- Home of StarCoder: fine-tuning & inference!β7,511Feb 27, 2024Updated 2 years ago
- AllenAI's post-training codebaseβ3,726Updated this week
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.β716Dec 30, 2024Updated last year
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neurβ¦β569Jan 21, 2025Updated last year
- Reproducing R1 for Code with Reliable Rewardsβ308May 5, 2025Updated last year
- Ongoing research training transformer models at scaleβ395Aug 20, 2024Updated last year
- β674Nov 1, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srwβ66Oct 4, 2024Updated last year
- Scaling Data-Constrained Language Modelsβ343Jun 28, 2025Updated 10 months ago
- β57May 28, 2024Updated last year
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instructβ2,093Nov 1, 2024Updated last year
- Salesforce open-source LLMs with 8k sequence length.β727Jan 31, 2025Updated last year
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.β5,636May 21, 2025Updated last year
- Heuristic filtering framework for RefineCodeβ85Mar 13, 2025Updated last year