bigcode-project / octopackLinks

🐙 OctoPack: Instruction Tuning Code Large Language Models

☆471

Alternatives and similar repositories for octopack

Users that are interested in octopack are comparing it to the libraries listed below

Sorting:

abacaj / code-eval
Run evaluation on LLMs using human-eval benchmark
☆421Updated 2 years ago
microsoft / CodeT
☆666Updated 11 months ago
Leolty / repobench
✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024
☆174Updated last year
nlpxucan / evol-instruct
☆274Updated 2 years ago
bigcode-project / bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
☆985Updated 3 months ago
nickrosh / evol-teacher
Open Source WizardCoder Dataset
☆161Updated 2 years ago
bigcode-project / selfcodealign
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
☆317Updated 7 months ago
xlang-ai / DS-1000
[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".
☆256Updated 11 months ago
loubnabnl / santacoder-finetuning
Fine-tune SantaCoder for Code/Text Generation.
☆193Updated 2 years ago
reasoning-machines / pal
PaL: Program-Aided Language Models (ICML 2023)
☆511Updated 2 years ago
Zyq-scut / RLTF
Accepted by Transactions on Machine Learning Research (TMLR)
☆132Updated last year
princeton-nlp / intercode
[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898
☆227Updated last year
bigcode-project / bigcode-dataset
☆475Updated last year
declare-lab / instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
☆548Updated last year
amazon-science / cceval
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)
☆159Updated 2 months ago
shuyanzhou / docprompting
Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023
☆249Updated last year
DachengLi1 / LongChat
Official repository for LongChat and LongEval
☆531Updated last year
OpenLMLab / LEval
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
☆389Updated last year
nuprl / MultiPL-E
A multi-programming language benchmark for LLMs
☆278Updated 2 months ago
my-other-github-account / llm-humaneval-benchmarks
☆83Updated 2 years ago
FSoft-AI4Code / CodeCapybara
Open-source Self-Instruction Tuning Code LLM
☆169Updated 2 years ago
theblackcat102 / evol-dataset
evol augment any dataset online
☆60Updated 2 years ago
jayelm / gisting
Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467
☆296Updated 8 months ago
conceptofmind / toolformer
☆371Updated 2 years ago
TIGER-AI-Lab / MAmmoTH
Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]
☆377Updated last year
Re-Align / URIAL
☆312Updated last year
GammaTauAI / leetcode-hard-gym
A hard gym for programming
☆161Updated last year
facebookresearch / cruxeval
CRUXEval: Code Reasoning, Understanding, and Execution Evaluation
☆154Updated last year
haoliuhl / chain-of-hindsight
Simple next-token-prediction for RLHF
☆226Updated 2 years ago
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆628Updated last year