π OctoPack: Instruction Tuning Code Large Language Models
β479Feb 5, 2025Updated last year
Alternatives and similar repositories for octopack
Users that are interested in octopack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for the evaluation of autoregressive code generation language models.β1,048Jul 22, 2025Updated 10 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Modelsβ63Apr 10, 2024Updated 2 years ago
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024β1,760Oct 2, 2025Updated 8 months ago
- β494Aug 15, 2024Updated last year
- Run evaluation on LLMs using human-eval benchmarkβ429Sep 12, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Accepted by Transactions on Machine Learning Research (TMLR)β135Oct 5, 2024Updated last year
- A multi-programming language benchmark for LLMsβ308Apr 12, 2026Updated last month
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generationβ322Feb 24, 2025Updated last year
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".β272Oct 30, 2024Updated last year
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)β179Aug 15, 2025Updated 9 months ago
- Code for the paper "Evaluating Large Language Models Trained on Code"β3,253Jan 17, 2025Updated last year
- Code for the curation of The Stack v2 and StarCoder2 training dataβ134Apr 11, 2024Updated 2 years ago
- Open Source WizardCoder Datasetβ166Jul 12, 2023Updated 2 years ago
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructionsβ25Aug 8, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,483Jun 7, 2025Updated last year
- β1,512May 12, 2023Updated 3 years ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluationβ169Oct 11, 2024Updated last year
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.β64Oct 21, 2024Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agentsβ556Oct 28, 2023Updated 2 years ago
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLMβ1,477May 1, 2025Updated last year
- Code for fine-tuning Platypus fam LLMs using LoRAβ626Feb 4, 2024Updated 2 years ago
- Fine-tune SantaCoder for Code/Text Generation.β197Apr 11, 2023Updated 3 years ago
- A repository to perform self-instruct with a model on HF Hubβ32Sep 29, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β241Feb 28, 2026Updated 3 months ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Mar 8, 2023Updated 3 years ago
- Home of StarCoder: fine-tuning & inference!β7,505Feb 27, 2024Updated 2 years ago
- AllenAI's post-training codebaseβ3,746Updated this week
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.β715Dec 30, 2024Updated last year
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neurβ¦β571Jun 2, 2026Updated last week
- Reproducing R1 for Code with Reliable Rewardsβ310May 5, 2025Updated last year
- Ongoing research training transformer models at scaleβ396Aug 20, 2024Updated last year
- β675Nov 1, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srwβ66Oct 4, 2024Updated last year
- Scaling Data-Constrained Language Modelsβ342Jun 28, 2025Updated 11 months ago
- β57May 28, 2024Updated 2 years ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instructβ2,095Nov 1, 2024Updated last year
- Salesforce open-source LLMs with 8k sequence length.β727Jun 2, 2026Updated last week
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.β5,659May 21, 2025Updated last year
- Heuristic filtering framework for RefineCodeβ85Mar 13, 2025Updated last year