Fine-tune SantaCoder for Code/Text Generation.
β196Apr 11, 2023Updated 2 years ago
Alternatives and similar repositories for santacoder-finetuning
Users that are interested in santacoder-finetuning are comparing it to the libraries listed below
Sorting:
- A framework for the evaluation of autoregressive code generation language models.β1,020Jul 22, 2025Updated 7 months ago
- π OctoPack: Instruction Tuning Code Large Language Modelsβ478Feb 5, 2025Updated last year
- Accepted by Transactions on Machine Learning Research (TMLR)β137Oct 5, 2024Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" givenβ¦β15Oct 16, 2023Updated 2 years ago
- β23Jul 10, 2023Updated 2 years ago
- β126Apr 22, 2023Updated 2 years ago
- Generative model for code infilling and synthesisβ315Sep 9, 2023Updated 2 years ago
- Run evaluation on LLMs using human-eval benchmarkβ427Sep 12, 2023Updated 2 years ago
- A multi-programming language benchmark for LLMsβ298Jan 28, 2026Updated last month
- A pre-trained GPT model for Python code completion and generationβ282Jun 12, 2023Updated 2 years ago
- 4 bits quantization of SantaCoder using GPTQβ51Jun 6, 2023Updated 2 years ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023β251Dec 15, 2023Updated 2 years ago
- β23Jan 25, 2023Updated 3 years ago
- β491Aug 15, 2024Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.β308Mar 1, 2023Updated 3 years ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Modelsβ63Apr 10, 2024Updated last year
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.β5,170Oct 27, 2025Updated 4 months ago
- Repository for analysis and experiments in the BigCode project.β128Mar 20, 2024Updated last year
- Ongoing research training transformer models at scaleβ395Aug 20, 2024Updated last year
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024β1,693Oct 2, 2025Updated 5 months ago
- Source codes for paper βReACC: A Retrieval-Augmented Code Completion Frameworkββ65Apr 18, 2022Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β96Feb 9, 2023Updated 3 years ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.β25Feb 9, 2024Updated 2 years ago
- Just a bunch of benchmark logs for different LLMsβ119Jul 28, 2024Updated last year
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate codeβ45May 31, 2023Updated 2 years ago
- Open Source WizardCoder Datasetβ164Jul 12, 2023Updated 2 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ23Mar 12, 2024Updated last year
- β41Jun 19, 2024Updated last year
- Pipeline for pulling and processing online language model pretraining data from the webβ177Jul 31, 2023Updated 2 years ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the userβ¦β184Nov 6, 2025Updated 4 months ago
- Manage scalable open LLM inference endpoints in Slurm clustersβ282Jul 11, 2024Updated last year
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallelβ24Apr 3, 2023Updated 2 years ago
- β565Nov 20, 2024Updated last year
- [ISSTA'24] A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testingβ12Jan 7, 2025Updated last year
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.β13Aug 15, 2021Updated 4 years ago
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for soβ¦β56Mar 21, 2024Updated last year
- β14Oct 12, 2024Updated last year
- Code for our paper: "Building A Coding Assistant via Retrieval-Augmented Language Models"β10Nov 2, 2024Updated last year
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neurβ¦β561Jan 21, 2025Updated last year