loubnabnl / santacoder-finetuningView external linksLinks
Fine-tune SantaCoder for Code/Text Generation.
☆196Apr 11, 2023Updated 2 years ago
Alternatives and similar repositories for santacoder-finetuning
Users that are interested in santacoder-finetuning are comparing it to the libraries listed below
Sorting:
- A framework for the evaluation of autoregressive code generation language models.☆1,020Jul 22, 2025Updated 6 months ago
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆479Feb 5, 2025Updated last year
- Accepted by Transactions on Machine Learning Research (TMLR)☆137Oct 5, 2024Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- ☆23Jul 10, 2023Updated 2 years ago
- LLM Workshop by Sourab Mangrulkar☆401Jun 16, 2024Updated last year
- Home of StarCoder: fine-tuning & inference!☆7,533Feb 27, 2024Updated last year
- ☆126Apr 22, 2023Updated 2 years ago
- Run evaluation on LLMs using human-eval benchmark☆427Sep 12, 2023Updated 2 years ago
- A multi-programming language benchmark for LLMs☆299Jan 28, 2026Updated 2 weeks ago
- 4 bits quantization of SantaCoder using GPTQ☆51Jun 6, 2023Updated 2 years ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆251Dec 15, 2023Updated 2 years ago
- ☆22Jan 25, 2023Updated 3 years ago
- ☆490Aug 15, 2024Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆308Mar 1, 2023Updated 2 years ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆63Apr 10, 2024Updated last year
- C++ implementation for 💫StarCoder☆459Sep 9, 2023Updated 2 years ago
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆5,173Oct 27, 2025Updated 3 months ago
- Repository for analysis and experiments in the BigCode project.☆128Mar 20, 2024Updated last year
- Ongoing research training transformer models at scale☆395Aug 20, 2024Updated last year
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024☆1,687Oct 2, 2025Updated 4 months ago
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆65Apr 18, 2022Updated 3 years ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Feb 9, 2024Updated 2 years ago
- Just a bunch of benchmark logs for different LLMs☆119Jul 28, 2024Updated last year
- Open Source WizardCoder Dataset☆164Jul 12, 2023Updated 2 years ago
- ☆41Jun 19, 2024Updated last year
- Pipeline for pulling and processing online language model pretraining data from the web☆177Jul 31, 2023Updated 2 years ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆183Nov 6, 2025Updated 3 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆280Jul 11, 2024Updated last year
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel☆24Apr 3, 2023Updated 2 years ago
- ☆564Nov 20, 2024Updated last year
- A tqdm bar progress that works with MongoDB instead of console.☆11Feb 21, 2022Updated 3 years ago
- ☆14Oct 12, 2024Updated last year
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for so…☆56Mar 21, 2024Updated last year
- Code for our paper: "Building A Coding Assistant via Retrieval-Augmented Language Models"☆10Nov 2, 2024Updated last year
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆558Jan 21, 2025Updated last year
- Code for the curation of The Stack v2 and StarCoder2 training data☆126Apr 11, 2024Updated last year
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆323Feb 24, 2025Updated 11 months ago
- Implements RNNPool and SoftPool for CNNs.☆14Jan 29, 2021Updated 5 years ago